Node failure and recovery

wrinehart · November 1, 2017, 4:57pm

Howdy,

I have a cluster in Elasticsearch version 1.4.4 that is currently under-provisioned. Currently, when a node falls off the cluster the other nodes start reassigning shards from the failed nodes to the other nodes. This jacks up the heap/disk space used on the other nodes. I have the gateway.recover_after_nodes set so I thought that it would wait until the node had rejoined and then reinitialize the shards to the node that had failed, but that doesn't seem to be the case (i.e. when you disable shard allocation and restart a node).

Is there a setting that would wait and then reinitialize to the node that had fallen off the cluster and then rejoined?

I have these set:
gateway.recover_after_nodes: 8
gateway.expected_nodes: 8

I am mostly just curious if I'm using these settings wrong or there is a bug in this version. I am working on upgrading the cluster to version 5.x but it won't be immediate so this would be triage in the meantime.

Thanks,
Walt

shanec · November 1, 2017, 9:17pm

What you're looking for is delayed shard allocation (5.x version docs | 1.7 version docs). That functionality was added in 1.7.0: https://www.elastic.co/blog/elasticsearch-1-7-0-and-1-6-1-released

I'd recommend at least upgrading to 1.7 for a variety of reasons, not least of which includes 1.4 has multiple security vulnerabilities associated with it. But really your 5.x plan is much better.

wrinehart · November 2, 2017, 12:06am

Ah yes, I remember that's what it is called now. I definitely have clusters across all the major versions so it's easy for me to mix the settings up.

Thanks!

system · November 30, 2017, 12:06am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Quick recovery after node restart in elasticsearch Elasticsearch	5	2238	July 6, 2017
Unnassigned Shards After Node Restart Elasticsearch	3	518	July 5, 2017
Elasticsearch rolling restart recovery is slow Elasticsearch	3	1239	January 10, 2020
ES Cluster Recovery and Restart Elasticsearch	3	586	July 6, 2017
What's the secret to fast recovery when adding a new node? Elasticsearch	1	137	October 17, 2023

Node failure and recovery

Related topics