Delay write to replicas while indexing (not initial data population)

stretchy · October 13, 2022, 2:23pm

Hi,
Is there a way to delay writing to replicas while indexing?
When indexing data continuously, can we reply to the client after writing only to the primary shard, and replica can be written after a while?

Things to note:

Replicas cannot be reduced to zero. Since data is continuously coming (not initial data population)
I've looked into index.translog.durability: async and increasing the index.translog.sync_interval

Another question is does increasing index.translog.sync_interval means it will put data in the primary shard (node) cache and flush to disk for both primary and replica after this interval, or will data be stored in the cache of primary and replica nodes?

Thanks

warkolm · October 13, 2022, 10:45pm

Why do you want to do this?

stretchy · October 14, 2022, 12:26am

To increase the indexing performance of one index.

stephenb · October 14, 2022, 12:47am

That is exactly the default behavior of elasticsearch on index operations.
See Here

By default, write operations only wait for the primary shards to be active before proceeding (i.e. wait_for_active_shards=1 ). This default can be overridden in the index settings dynamically by setting index.write.wait_for_active_shards . To alter this behavior per operation, the wait_for_active_shards request parameter can be used.

Now that IS different that setting replicas to 0 while indexing...

The difference is that resources will still be used to replicate the data to the replica shards

That is why sometimes it makes sense to set replicas to 0 when indexing... perhaps that is what you are thinking of...

But then you say this...

Which I do not understand ... data coming in continuously is not a requirement to having replicas...

All that said I would look at How to Tune for Indexing Speed and start with that

system · November 11, 2022, 12:47am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Extremely slow writing to replicas Elasticsearch	6	2119	August 30, 2018
Can I disable replicas in when index is getting data from logstash? Elasticsearch	3	920	November 10, 2021
What is timeout to write to replica shard Elasticsearch	6	1723	April 24, 2019
When request is forwared to replica in write? Elasticsearch	1	33	August 1, 2025
Some questions with regards to index.translog.durability's interaction with replica shards Elasticsearch	5	2257	March 6, 2018

Delay write to replicas while indexing (not initial data population)

Related topics