[Feature]: Make Elasticsearch service restart after config change more graceful #343

frankhetterich · 2024-08-27T11:52:15Z

Describe the feature request

We updated our Elasticsearch instances to Version 8.15.0 using the Rolling update feature of the collection, which works good. The cluster was available all the time.

With the update we changed some parameters in the Elasticsearch config. Now the Collection performs the parameter change as part of the "normal" installation process, after all Nodes where updated. This means the config is changed on all nodes and all Nodes are restarted at once using a handler. This full cluster restart causes that the cluster is unavailable for some time.

For us it makes no sense to perform a rolling update with a lot of tasks to make shure that the Cluster is available all the time and perform afterwards a full cluster restart which leads to the opposite.

Please implement a "graceful" cluster restart (with rolling restarts and cluster health checks) after a config change of Elasticsearch

ivareri · 2024-08-29T10:51:19Z

ouch.

I'd say this is a bug and not a feature. Should be easy enough to create a handler for a rolling cluster restart (all the code is in the repo already), but I'm not sure how to best handle it without duplicating most of the code from elasticsearch-rolling-upgrade.yml in a handler.

Is there a way to inject tasks into a handler? have two tasks files, one with all the tasks to gracefully stop a node, and one with everything to bring it back online and wait for cluster to become green. They could then be included in both a cluster restart handler and the rolling-upgrade file?

So the handler would look something like this:

 - name: Gracefully stop node
      ansible.builtin.include_tasks:
        file: cluster_restart_stop_node.yaml
        
 - name: Start node and wait for green cluster
      ansible.builtin.include_tasks:
        file: cluster_restart_start_node.yaml

And the Be careful about upgrade when Elasticsearch is running block in elasticsearch-rolling-upgrade.yml would be reduced to something like this

 - name: Gracefully stop node
      ansible.builtin.include_tasks:
        file: cluster_restart_stop_node.yaml

# Tasks to upgrade packages        

 - name: Start node and wait for green cluster
      ansible.builtin.include_tasks:
        file: cluster_restart_start_node.yaml

widhalmt · 2024-09-25T07:45:38Z

Good find and sorry for your bad experience. Yes, definitely a bug. We'll look into it. Thanks also for the suggestions, @ivareri .

fixes #343

widhalmt · 2024-10-24T14:01:35Z

This problem gave me a real headache. I pushed a Draft PR with your idea. That should actually work, but I'm afraid this will need quite extensive testing.

frankhetterich added feature New feature or request needs-triage Needs to be triaged labels Aug 27, 2024

widhalmt added bug Something isn't working and removed feature New feature or request needs-triage Needs to be triaged labels Sep 25, 2024

widhalmt self-assigned this Sep 25, 2024

widhalmt added a commit that referenced this issue Oct 24, 2024

Make Elasticsearch restarts always rolling

b57009e

fixes #343

widhalmt linked a pull request Oct 24, 2024 that will close this issue

Make Elasticsearch restarts always rolling #349

Draft

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature]: Make Elasticsearch service restart after config change more graceful #343

[Feature]: Make Elasticsearch service restart after config change more graceful #343

frankhetterich commented Aug 27, 2024

ivareri commented Aug 29, 2024

widhalmt commented Sep 25, 2024

widhalmt commented Oct 24, 2024

[Feature]: Make Elasticsearch service restart after config change more graceful #343

[Feature]: Make Elasticsearch service restart after config change more graceful #343

Comments

frankhetterich commented Aug 27, 2024

Describe the feature request

ivareri commented Aug 29, 2024

widhalmt commented Sep 25, 2024

widhalmt commented Oct 24, 2024