Will ES discard redundant replicas?

haizaar · February 24, 2016, 10:35am

Suppose I have a cluster with nodes N1 and N2 hosting index I1 with one primary shard S1 and one replica R1.

The scenario is as follows:

Shutdown node N2
Launch new, fresh node N3
Wait until ES creates R3 on S3
Boot N2.

No we have a scenario where I1 has two replicas instead of one.

Will ES be smart enough to discard redundant replica?
What happens if N2 comes back while ES is still in the process of copying S1 to R3? - Will it abort copying and use R2?

I'm talking about ES 2.x. Answers regarding 5.x are welcome as well.

Thank you,
Zaar

nik9000 · February 25, 2016, 4:12pm

Sure.

Its complicated. Newer versions of Elasticsearch can cancel replication in progress and I believe in 2.x its possible for it to do this abort but I could be wrong. I don't know that code super well.

When N2 comes back Elasticsearch has to make sure that R2 is the same as R1. It can do this in two ways:

the files are the same
synced flush

If it doesn't see R2 as the same as R1 then it'll take into account how much of it is the same when determining which shard is "further along".

haizaar · February 25, 2016, 4:24pm

Great. Thank you!

Topic		Replies	Views
3 node ES cluster...one node only holds replicas Elasticsearch	10	2097	July 5, 2017
Shard synchronization behaviour Elasticsearch	4	586	May 16, 2017
Multiple indexes break elasticsearch (2.3.1) cluster replication? Elasticsearch	5	395	January 18, 2019
Need Clarification on Shards Replication Elasticsearch	7	485	July 6, 2017
Assigning Shards in ElasticSearch Cluster Elasticsearch	2	260	March 23, 2022

Will ES discard redundant replicas?

Related topics