Write failure handling in elasticsearch

Vikas_Kumar · October 18, 2016, 9:22am

How are write failures handled in elasticsearch, particularly cases where a write op succeeds on primary, but one or more replicas fail to respond (due to network or any other issue)?

Will the write/update stay on replicas where it succeeded? Even in cases where quorum is not met? How will it impact subsequent searches?

danielmitterdorfer · October 19, 2016, 1:22pm

Hi @Vikas_Kumar,

as you mention quorum, I guess you refer to the action.write_consistency setting. quorum is the default, so if less than a quorum of the replicas succeeds, the write is not successful (see a few more details in the Definitive Guide). However, note that this is actually just a pre-check before the actual replication takes place.

We will keep writes on replicas that succeed and get a new replica node assignment from the master if one of the replicas fails. After a new replica node is assigned, the shard in question is synced to the new replica.

However, consistency in distributed systems is a hard problem and we document known edge cases and Elasticsearch's behavior / the status of our fixes on the resiliency page.

Daniel

Topic		Replies	Views
Quorum write + sync replication: guarantees Elasticsearch	2	406	July 6, 2017
Elasticsearch data consistency Elasticsearch	3	1631	December 14, 2016
Elastic Search and consistency Elasticsearch	7	6863	July 6, 2017
2.3.4 write behaviour with replicas set to 2 Elasticsearch	3	769	September 12, 2017
Relationship between action.replication_type and action.write_consistency Elasticsearch	2	683	July 6, 2017

Write failure handling in elasticsearch

Related topics