What are the best practices around increasing the replica count drastically

UDixit · February 26, 2020, 12:32am

I want to understand the implications of drastically increasing ReplicaCount eg:0->5 on Indexing requests.
Since we have only 1 available copy, and the quorum has suddenly changed to 4 (n/2+1)
the indexing requests are bound to fail.

What are the suggested way to increase the replica count such that minimum indexing requests fail?

DavidTurner · February 26, 2020, 12:59am

Indexing does not use a quorum-based system. Increasing the number of replicas will not cause any indexing requests to fail.

UDixit · February 26, 2020, 1:06am

I keep seeing message of the following type when indexing during a scaleup:

! org.elasticsearch.action.UnavailableShardsException: [indexname][0] Not enough active copies to meet write consistency of [ALL] (have 1, needed Quorum). Timeout: [1m], request: index
! at

Christian_Dahlqvist · February 26, 2020, 5:34am

Which version of Elasticsearch are you using? If I recall correctly default write quorum kicks in once you reach 2 replicas, at least on older versions. If this is the case you may want to increase the replica count gradually.

It looks like this still is tunable in current versions but now defaults to not waiting for a quorum of replicas.

If you are on an old version suffering from this I would however also strongly recommend upgrading.

DavidTurner · February 26, 2020, 7:39am

I stand corrected Since you didn't mention you were using a very old version I assumed you were asking about something recent. This message comes from a version that is well past the end of its life.

The simplest answer is probably to set write consistency to 1.

Christian_Dahlqvist · February 26, 2020, 8:02am

I wonder if the default might have been changed with the introduction of sequence numbers as it makes recovery faster?

DavidTurner · February 26, 2020, 9:14am

The notion of write consistency was removed in 5.0.0 as it doesn't really do what you might expect. IMO the in-sync set mechanism was really the feature that made it unnecessary, although the 6.x series further strengthened the guarantees in this area thanks to sequence numbers.

UDixit · February 26, 2020, 7:54pm

Thanks for the reply guys.
So glad to hear that this is not the case with ES7. I'll start my experiments with ES7.

As per ES7 documentation, the writes are acknowledged as long as the primary is available(wait_for_active_shards=1)

So when a Primary goes down after acknowledging some writes; how do we ensure that:

Stale replicas will not be promoted (since there is no concept of quorum)
Data loss doesn't occur.

DavidTurner · February 26, 2020, 8:04pm

The in-sync set mechanism that I mentioned above is what makes sure we never promote a stale replica.

system · March 25, 2020, 8:04pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
2.3.4 write behaviour with replicas set to 2 Elasticsearch	3	767	September 12, 2017
Error on creating replicas Elasticsearch	4	375	July 5, 2017
Risk associated with action.write_consistency and index.recovery.initial_shards for cluster recovery with a single node Elasticsearch	2	1932	July 5, 2017
Write failure handling in elasticsearch Elasticsearch	2	1122	July 5, 2017
Not enough active copies to meet write consistency of [QUORUM] (have 2, needed 3) Elasticsearch	2	4704	July 5, 2017

What are the best practices around increasing the replica count drastically

Related topics