What is the allocation process when a primary shard goes down?

yehosef · December 10, 2015, 7:35am

I've seen lots of people that talk about wanting primary shard evenly distributed - or not to be on certain nodes and the general answer given is "why - it doesn't matter - the replica does the same work as the primary".

But I'm curious as to the effect of a primary shard going down vs a replica. What happens when a primary shard goes down - what happens to write that happen while the cluster is reallocating and how intensive is the reallocation process?

dadoonet · December 10, 2015, 7:48am

When a primary goes down, one of the replicas is automatically promoted as primary by the master node.
Then a new replica is allocated in the cluster by the master node and data are copied over the wire.

Write operations are still possible during this time because you still have a replica in the cluster (index is in yellow state).

yehosef · December 10, 2015, 8:46am

thanks for the answer!

But I'm still not 100% clear. I have a working cluster with a primary and replica shard. When the primary goes down - how does the master know? I assume there is some time until that node is marked as "down" until the replica is promoted, no? Otherwise a small network pause would cause failovers. How long is that and what happens in that time window?

dadoonet · December 10, 2015, 9:20am

First, a shard is most of the time being unavailable when a node stops.
The master often pings all nodes to check if they are still alive. Every second: https://www.elastic.co/guide/en/elasticsearch/reference/current/modules-discovery-zen.html#fault-detection

When you send a index request, you send it to a coordinating node. This node tries to reach the primary shard first. If your node holding the primary is down, the coordinating node will try again during one minute by default. See https://www.elastic.co/guide/en/elasticsearch/reference/current/docs-index_.html#timeout

yehosef · December 10, 2015, 12:57pm

Thanks for the additional information - very helpful.

What happens in the minute until the timeout - is the request is queued at the coordinating node? What is the queue size?

Topic		Replies	Views
If Primary and Replica shards both fail how to recover? Elasticsearch	6	3063	March 20, 2019
What are the common situations in which primary shard goes down Elasticsearch	2	322	October 14, 2019
Shard Allocation Forced Awareness Elasticsearch	5	496	March 22, 2019
Shard reallocation when nodes are down Elasticsearch	2	675	June 6, 2017
Elastic Search - Node failures Elasticsearch	2	337	June 19, 2020

What is the allocation process when a primary shard goes down?

Related topics