ECK Operator not resetting shard_allocation setting to "all"

phoerious · December 14, 2021, 8:40am

Hi,

I've always had the the problem that the ECK operator does not reset the shard_allocation cluster setting after restarting a node. The result is that after a rolling restart over night, I end up with hundreds of unallocated replica shards and no redundancy, because transient.cluster.routing.allocation.enable is set to primaries. This will also halt the rolling reboot at some point when no more nodes can be restarted without making the cluster red.

This basically means that for each cluster reboot, I have to stand by all the time and issue a

PUT _cluster/settings
{
  "transient": {
    "cluster.routing.allocation.enable": null
  }
}

after each node, only so the operator can reset it to primaries as soon as the next node is restarted.

Is this a known issue? For me this issue has been around since the beginning and no update has solved it so far.

system · January 11, 2022, 8:41am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Restarting nodes with allocation disabled Elasticsearch	2	3486	July 6, 2017
Cluster.routing.allocation.enable behavior (sticky shard allocation not working as expected) Elasticsearch	9	1167	July 6, 2017
Rolling restart Elasticsearch	5	438	July 6, 2017
Stopping a cluster to balance shards when restarting a node Elasticsearch	2	1229	July 5, 2017
Trying to optimize configuration for better cluster restart/recovery Elasticsearch	8	658	July 6, 2017

ECK Operator not resetting shard_allocation setting to "all"

Related topics