What happens if primary shard fail during publication?

lotabout · August 17, 2019, 2:26pm

As I understand ES's synchronization is kind of like 2 Phase Commit(2PC) and the problem of 2PC is when the "master" node fails during commit.

Quote from Github Reply:

An indexing request goes through the following process:

Written to the translog on the primary

indexed on the primary

written to the translog on each replica shard

indexed on each replica shard

once all replicas have responded, the request returns to the client

So as long as the replicas remain alive, the change will be persisted on the replica.

So what happens if:

Primary dies at phase #2 when all replicas are in old state.
Primary dies at phase #4, when some replicas are in new state and some are in old state.

Thanks!

DavidTurner · August 26, 2019, 2:10pm

ES's synchronisation is not really anything like 2PC, if only because it has a single phase

If the primary dies at phase 2 then a replica is promoted to primary in its place. The operation was indexed on the now-dead primary but not the replica, nor was it acked to the client.

If the primary dies at phase 4 then a replica is promoted to primary in its place. The other replicas then roll back to an earlier state and recover any missing operations from the new primary so as to be sure that they end up in the same state.

In both cases the client might see an exception, or Elasticsearch might retry the operation on the new primary and return a successful response to the client.

lotabout · August 26, 2019, 2:43pm

Thank you very much for clarifying the index process.

I've did some researching after post the question and learnt that ES's synchronization is a partial implementation of the PacificA algorithm. Is it correct?

Thus I'd like to confirm an additional state: Primary dies at phase #3. My assumption is:

A new replica is promoted as the new primary, but there is no guarantee that the request was successfully written to it.
Thus the client would receive exception, but the request might or might not be indexed depends on whether the translog was written to the new primary or not.

Is my understanding correct?

DavidTurner · August 26, 2019, 2:47pm

They're quite closely related.

The client might receive an exception, or Elasticsearch might retry internally on the new primary and return a successful response to the client.

system · September 23, 2019, 2:47pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Primary and Replica Shard Sync Elasticsearch	12	3735	July 5, 2017
What happens when synchronization between the primary and replica shards is lost? Elasticsearch	5	59	October 16, 2024
Elasticsearch data consistency Elasticsearch	3	1627	December 14, 2016
Shard synchronization behaviour Elasticsearch	4	586	May 16, 2017
Restarting ES on a node Elasticsearch	2	385	April 17, 2018

What happens if primary shard fail during publication?

Related topics