Deploy search engine by khaledk2 · Pull Request #359 · IDR/deployment

khaledk2 · 2022-01-25T22:13:18Z

I have added playbooks to deploy searchengine, searchengine client and ELasticsearch.
"management-searchengine.yml" is used to configure and run all three applications.
There is a variables file (searchengine_vars.yml) that the user needs to customize before running the playbook
After deploying the apps using the playbook, it is needed to run another playbook (run_searchengine_index_cache_services.yml) for caching and indexing
As the caching and indexing processes take a long time, there are another two playbooks that enable the user to check if they have finished or not i.e. check_indexing_service.yml and check_caching_service.yml.

… using ansible playbooks

sbesson · 2022-01-27T13:18:02Z

After copying the vars files to match the name of the group and removing the variable pointing to a local path

(idr-ansible) (base) sbesson@ls30630:ansible ((db001bb...)) $ diff group_vars/searchengine_vars.yml group_vars/management-hosts.yml 
11d10
< ansible_python_interpreter: path/to/bin/python

the playbook executed until

TASK [configure elasticsearch  for docker searchengine] **********************************************************************************************************************************************
fatal: [test104-management]: FAILED! => {"ansible_facts": {"discovered_interpreter_python": "/usr/bin/python"}, "changed": false, "msg": "Docker SDK for Python version is 1.10.6 (test104-management.novalocal's Python /usr/bin/python). Minimum version required is 2.1.0 to set auto_remove option. Try `pip uninstall docker-py` followed by `pip install docker`."}

PLAY RECAP *******************************************************************************************************************************************************************************************
test104-management         : ok=12   changed=11   unreachable=0    failed=1    skipped=0    rescued=0    ignored=0

Possible options to move forward are:

deploy the auto_remove option for now (does any behavior depend on it?)
review how the Docker SDK for Python is install and upgrade and/or create a local virtual environment with a recent version of the docker module

khaledk2 · 2022-01-27T16:22:58Z

auto_remove instructs to delete the container after it runs. I think we may comment on it for the time being, what do you think?

sbesson · 2022-01-27T16:26:36Z

Agreed, let's comment it out and come back to it later in the testing.

khaledk2 · 2022-01-28T10:27:23Z

Sorry, I should mention before that I have commented on auto_remove and push the playbooks yesterday.

sbesson · 2022-02-01T11:38:03Z

Added a minimal configuration allowing to proxy the 5567 port under the /searchengine endpoint

TASK [ome.nginx_proxy : nginx | proxy config] *************************************************************************************************
--- before: /etc/nginx/conf.d/proxy-default.conf
+++ after: /Users/sbesson/.ansible/tmp/ansible-local-26677fmuyzkn/tmp_tkpwlmz/nginx-confd-proxy.j2
@@ -253,15 +253,6 @@
 
     }
 
-    location ^~ /searchengine {
-        proxy_pass http://searchengine/;
-        proxy_redirect http://searchengine $scheme://$server_name;
-
-
-        proxy_ignore_headers   "Set-Cookie" "Vary" "Expires";
-        proxy_hide_header Set-Cookie;
-    }
-
 
     add_header Access-Control-Allow-Origin $allow_origin;
 

changed: [test104-proxy] => (item={'nginx_proxy_is_default': True, 'nginx_proxy_additional_directives': ['add_header Access-Control-Allow-Origin $allow_origin']})
ok: [test104-proxy] => (item={'nginx_proxy_server_name': 'cachebuster', 'nginx_proxy_listen_http': 0, 'nginx_proxy_ssl': False, 'nginx_proxy_cachebuster_enabled': True, 'nginx_proxy_backends': [{'name': 'omerocached', 'location': '~ /webclient/metadata_*|/webclient/render_*|/webclient/get_thumbnail*|/webgateway/metadata_*|/webgateway/render_*|/webgateway/get_thumbnail*|/webclient/api/*|/webclient/search/*|/api/*|/webclient/img_detail/*|/iviewer/*|/figure/*|/gallery-api/*|/mapr/*', 'server': 'http://omeroreadwrite', 'cache_validity': '1d', 'read_timeout': 900}, {'name': 'omerostatic', 'location': '~ /static/*', 'server': 'http://omeroreadwrite', 'cache_validity': '1d'}, {'name': 'omero', 'location': '/', 'server': 'http://omeroreadwrite'}]})
ok: [test104-proxy] => (item={'nginx_proxy_server_name': 'idr-demo.openmicroscopy.org', 'nginx_proxy_ssl': True, 'nginx_proxy_redirect_map_locations': [], 'nginx_proxy_direct_locations': [{'location': '/', 'redirect301': '$scheme://idr.openmicroscopy.org$request_uri'}], 'nginx_proxy_backends': []})

TASK [ome.nginx_proxy : nginx | proxy upstream servers] ***************************************************************************************
--- before: /etc/nginx/conf.d/proxy-upstream.conf
+++ after: /Users/sbesson/.ansible/tmp/ansible-local-26677fmuyzkn/tmplufdxnu2/nginx-confd-proxy-upstream.j2
@@ -13,6 +13,3 @@
 upstream omeroreadwrite {
   server 192.168.3.22;
 }
-upstream searchengine {
-  server 192.168.3.120:5567;
-}

Following this morning's discussion, we are currently running into two issues:

the static files are missing from the endpoint. Khaled is looking into the Nginx configuration
the indexing process failed with a statement timeout. Khaled identified the issue as memory related and reduced the number of rows processed concurrently to allow the indexing to run. The management VM currently used for the deployment is relatively small with 8GB RAM and 4 VCPUs. When moving to a production state, we might consider hardening this configuration and provisioning the searchengine VM with more resources similarly to the OMERO ro/rw VMS.

…aching and set secret key for search engine and client

khaledk2 · 2022-02-16T22:11:19Z

I should have written that before. I have changed one line in the searchengine section in the proxy-default.conf Nginx configuration file to add searchengine" subdomain to the HOST header
proxy_set_header Host $host/searchengine
Also, I have made some modifications in the search engine client code, to work when its URL is not the domain root.
I have renamed the "static" folder to the "searchengineclientstatic" folder in the search engine client.
I have changed the searchengine code to allow the number of rows to be configured externally (in the app configuration) so it can be customized according to the host machine configuration.
I have changed the deployment files and added "cache_rows" variable in management-searchengine-hosts.yml to set the number of rows and added an additional step to the management-searchengine.yml to set his configuration.
I have added searchengine_secret_key and searchengineclient_secret_key variables to set SECRET_KEYs for each searchengine and searchengine client and added steps inside the deployment yml file to configure them.
The docker images for each of searchengine and searchengineclient are hosted in my Docker Hub account, they are built and pushed manually. I have created GitHub actions to build and push them automatically to the openmicroscopy DockerHub account, they are in the testing stage.

sbesson

With the last set of commits, I was able to successfully deploy the search engine application and launch the indexing/caching processes on a fresh test104 deployment.

The following local changes to the idr-proxy.yml playbook/variables were also applied to deploy the client under the /searchengine endpoint:

diff --git a/ansible/group_vars/proxy-hosts.yml b/ansible/group_vars/proxy-hosts.yml
index e082a821..63dc5cd8 100644
--- a/ansible/group_vars/proxy-hosts.yml
+++ b/ansible/group_vars/proxy-hosts.yml
@@ -38,6 +38,8 @@ nginx_proxy_upstream_servers:
   servers: "{{ omero_omeroreadonly_hosts_external | map('regex_replace', '^(.*)$', '\\1:4065') | sort }}"
 - name: omeroreadwrite
   servers: "{{ omero_omeroreadwrite_hosts }}"
+- name: searchengine
+  servers: "{{ searchengine_hosts | map('regex_replace', '^(.*)$', '\\1:5567') | sort }}"
 
 # The regex is getting complicated, so unroll it into a list and join
 _nginx_proxy_omero_locations:
@@ -100,11 +102,19 @@ _nginx_proxy_backends_prometheus_federate:
   server: "http://{{ management_host_ansible | default('localhost') }}:9090/federate"
   cache_validity: 15s
 
+_nginx_proxy_backends_searchengine:
+- name: prometheusfederate
+  location: "^~ /searchengine"
+  server: http://searchengine/
+  host_header: "$host/searchengine"
+
+
 nginx_proxy_backends: >
   {{ _nginx_proxy_backends_omero +
      _nginx_proxy_backends_omerowebsockets +
      _nginx_proxy_backends_grafana_render +
-     _nginx_proxy_backends_prometheus_federate
+     _nginx_proxy_backends_prometheus_federate +
+     _nginx_proxy_backends_searchengine
   }}
 
 
diff --git a/ansible/idr-proxy.yml b/ansible/idr-proxy.yml
index edc0db47..adb601f2 100644
--- a/ansible/idr-proxy.yml
+++ b/ansible/idr-proxy.yml
@@ -61,6 +61,12 @@
             idr_environment | default('idr') + '-management-hosts'][0]]
             ['ansible_' + (idr_net_iface | default('eth0'))]['ipv4']['address']
         }}
+      searchengine_hosts: >-
+        {{
+          groups[idr_environment | default('idr') + '-management-hosts'] |
+          map('extract', hostvars,
+            ['ansible_' + (idr_net_iface | default('eth0')), 'ipv4', 'address']) | list
+        }}
     when: groups[idr_environment | default('idr') + '-management-hosts'] is defined
 
   roles:

A few inline comments and the client probably needs some testing once the indexing/caching has completed.
From a code perspective, I think we are approaching thew point where this playbook can be safely merged into the repository. Importantly, as things stand, this app is not included by default and needs to be deployed manually. Probably the biggest question for the IDR team is whether we would consider deploying it on all prod deployment as an experimental endpoint and/or the steps to move towards this target.

sbesson · 2022-02-24T14:50:58Z

+database_user_password:  "{{ idr_secret_postgresql_password_ro | default('omero') }}"
+searchenginecache_folder: /data/searchengine/searchengine/cacheddata/
+search_engineelasticsearch_docker_image: docker.elastic.co/elasticsearch/elasticsearch:7.16.2
+searchengine_docker_image: openmicroscopy/omero-searchengine:latest


For a deployment from scratch this will do the job but as soon as we want to update Docker image, it will be preferable to use named tagged images rather than latest.

sbesson · 2022-02-24T14:51:15Z

+searchengine_index: searchengine_index
+cache_rows: 10000
+# I think that the following two variables should be in secret
+searchengine_secret_key: "fagfdssf3fgdnvhg56ghhgfhgfgh45f"


Proposing to update the valoue of these keys and migrate them as private variables

sbesson · 2022-02-24T16:21:36Z

+    * If the Postgresql database server is located at the same machine which hosts the searchengine, it is needed to:
+        * Edit pg_hba.conf file (one of the postgresql configuration files) and add two client ips (i.e. 10.11.0.10 and 10.11.0.11)
+        * Reload the configuration; so the PostgreSQL accepts the connection from indexing and caching services.
+    * As the caching and indexing processes take a long time, there are another two playbooks that enable the user to check if they have finished or not:


Unlike the service set-up playbook, I am less convinced of the value of running and checking the indexing/caching via Ansible playbooks.

Unless there is an obvious alternative, happy to keep things as they are right now and revisit this behaviorin the future. I suspect this will become apparent as we start run these workflows during the app lifecycle e.g. before release.

…ervice as all the cached data now is saved in Elasticsearch.

khaledk2 · 2022-04-19T21:59:21Z

I have pushed changes to run on the searchengine-hosts group and removed hdf5 caching service as all the cached data now is saved in Elasticsearch.

khaledk2 · 2022-04-20T16:48:08Z

I have renamed the files and increased cache_rows to 50000, I think we can increase it more than that.

This reverts commit 4781f0f.

khaledk2 · 2022-04-21T15:54:33Z

I have reverted renaming dockermanager-hosts.yml and renamed the files

sbesson

With the change to the hosts section of idr-searchengine.yml, I was able to deploy the searchengine stack on pilot-idr0000 and start the indexing process which completed in ~12h.

khaledk2 · 2022-04-28T23:47:17Z

These are the modifications to fix the issues of displaying swagger documents using the searchengineapi url

I have added a variable to the "searchengine-hosts.yml"; its value equals the URL prefix part (searchengineapi)
searchengineurlprefix: "searchengineapi"

It will be used to set the script_name when running the gunicorn

Also, I have changed the Nginx configuration at the searchengineapi section

location ^~ /searchengineapi { proxy_pass http://searchengineapi/searchengineapi; proxy_redirect http://searchengineapi/searchengineapi $scheme://$server_name; }

sbesson

With the latest set of changes and #367, I was able to successfully deploy the search engine stack onto a newly create pilot using the new group.

The API is available when forwarding the port 5577 and the client is available when accessing the port 5556.

The playbook is currently set up so that it will only run when executed manually against VMs with the correct groups.

As discussed this morning as part of the weekly IDR call, merging this so that we can make incremental progresses towards a production release of the new service via smaller PRs. I will capture the outstanding issues as todos.

khaledk2 added 3 commits January 25, 2022 19:29

Adding searchengine, searchengineclinet, and Elasticsearch deployment…

9cb5163

… using ansible playbooks

update

f853115

Fix syntex issue

7c58b65

sbesson requested changes Jan 27, 2022

View reviewed changes

Comment thread ansible/management-searchengine.yml Outdated

Comment thread ansible/management-searchengine.yml Outdated

Comment thread ansible/management-searchengine.yml Outdated

Comment thread ansible/searchengine_vars.yml Outdated

khaledk2 added 3 commits January 27, 2022 12:00

Update searchengine playbook

35fd7d5

Update the searchengine playbooks

5486209

set /data as main searchengine main folder

96b4ee4

khaledk2 requested a review from sbesson January 27, 2022 12:27

comment auto_remove out

b250587

sbesson requested changes Jan 28, 2022

View reviewed changes

Comment thread ansible/group_vars/searchengine_vars.yml Outdated

Comment thread docs/searchengine_deployemnt.md Outdated

Comment thread ansible/group_vars/searchengine_vars.yml Outdated

add set_fact to get the database host

48c099c

sbesson requested changes Jan 31, 2022

View reviewed changes

Comment thread ansible/management-searchengine.yml

sbesson reviewed Jan 31, 2022

View reviewed changes

Comment thread ansible/check_caching_service.yml Outdated

modify searchengine deployment playbook to allow set no of rows for c…

a4b7855

…aching and set secret key for search engine and client

khaledk2 added 3 commits February 18, 2022 17:48

change searchengine searchengieclient names

9a7e40f

change the image tag to be latest

6223581

create and configure app data folder for the client

d745f5c

sbesson requested changes Feb 24, 2022

View reviewed changes

Comment thread ansible/management-searchengine.yml Outdated

khaledk2 added 2 commits February 24, 2022 11:18

requested changes

c44b87f

Suggested change

6a5fc7a

sbesson reviewed Feb 24, 2022

View reviewed changes

Comment thread ansible/management-searchengine.yml Outdated

comment auto_remove

a2dd386

sbesson reviewed Feb 24, 2022

View reviewed changes

sbesson self-requested a review February 24, 2022 20:58

sbesson mentioned this pull request Apr 19, 2022

Defined new group for deploying the searchengine in the pilot IDRs #367

Merged

Changes to run on the searchengine-hosts group, remove hdf5 caching s…

3761f3c

…ervice as all the cached data now is saved in Elasticsearch.

sbesson requested changes Apr 20, 2022

View reviewed changes

Comment thread ansible/group_vars/management-hosts.yml

Comment thread ansible/group_vars/management-hosts.yml Outdated

Comment thread ansible/management-searchengine.yml

Comment thread ansible/management-searchengine.yml

rename files, and incrase cache_rows

4781f0f

khaledk2 added 2 commits April 21, 2022 16:16

Revert "rename files, and incrase cache_rows"

fbc7e09

This reverts commit 4781f0f.

rename files

23c76cb

sbesson reviewed Apr 26, 2022

View reviewed changes

Comment thread ansible/idr-searchengine.yml Outdated

khaledk2 added 3 commits April 26, 2022 19:23

change hosts

2249cf2

increase cache_rows to 100000

f71306f

adding searchengineurlprefix varaible to set up script_name

8614035

sbesson approved these changes May 2, 2022

View reviewed changes

sbesson merged commit c7bc747 into IDR:master May 2, 2022

This was referenced May 2, 2022

Search engine documentation review #369

Merged

prod107: search engine deployment #370

Closed

Conversation

khaledk2 commented Jan 25, 2022

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sbesson commented Jan 27, 2022

Uh oh!

khaledk2 commented Jan 27, 2022

Uh oh!

sbesson commented Jan 27, 2022

Uh oh!

khaledk2 commented Jan 28, 2022

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sbesson commented Feb 1, 2022

Uh oh!

khaledk2 commented Feb 16, 2022

Uh oh!

Uh oh!

Uh oh!

sbesson left a comment

Choose a reason for hiding this comment

Uh oh!

sbesson Feb 24, 2022

Choose a reason for hiding this comment

Uh oh!

sbesson Feb 24, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

sbesson Feb 24, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

khaledk2 commented Apr 19, 2022

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

khaledk2 commented Apr 20, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

khaledk2 commented Apr 21, 2022

Uh oh!

sbesson left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

khaledk2 commented Apr 28, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sbesson left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

sbesson Feb 24, 2022 •

edited

Loading

sbesson Feb 24, 2022 •

edited

Loading

khaledk2 commented Apr 20, 2022 •

edited

Loading

khaledk2 commented Apr 28, 2022 •

edited

Loading