Would rolling restart cause continuous bulk data lost?

hackerwin7 · May 15, 2020, 3:08am

Hi,
According to this

if default settings is only 1 primary shard is active, when rolling restart, the indexing write primary shard is down. primary have latest data, replicas haven't sync these latest data, when rolling restart cause primary shard own, the one shard of replica would be promoted to be primary, however it don't have latest data synced.

would this case cause data lost?

DavidTurner · May 15, 2020, 4:51am

No, Elasticsearch makes sure that doesn't happen.

hackerwin7 · May 15, 2020, 5:23am

Thanks for reply

Is there any issues or sources related to this ensures?

DavidTurner · May 15, 2020, 5:35am

Sorry I do not understand the question.

warkolm · May 15, 2020, 5:37am

I think they're after documentation on how Elasticsearch prevents this.

DavidTurner · May 15, 2020, 5:44am

It's kinda complicated, but maybe this blog post helps?

hackerwin7 · May 15, 2020, 7:53am

Thanks for this very much!

hackerwin7 · May 15, 2020, 10:32am

After reading to this, according to PacificA

The index.write.wait_for_active_shards only check before write forward to replica, Is there any more strong style to ensure that check the data sink after the write forward to replica return.

Just like Kafka settings min.insync.replicas and ack will strongly ensure the data sink multiple replica after write then return the response to User

In a case : if In-sync Replica Group have only one replica, that is primary is worked, and the node which own the primary encounter an unrecoverable disaster, that primary data is corrupt permanently.

This case maybe caused to data lost

DavidTurner · May 15, 2020, 2:09pm

Perhaps a simpler example of independent failures leading to data loss is if you have a primary and a replica on distinct nodes and both of them encounter an unrecoverable disaster at the same time. At least in cases like this Elasticsearch will tell you that data was lost, rather than carrying on regardless. You cannot in general protect against collections of independent failures.

system · June 12, 2020, 2:13pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
How does Elasticsearch protect from data loss? Elasticsearch	6	3316	July 5, 2017
Will a rolling restart lose data? Elasticsearch	4	878	July 6, 2017
Data is lost after elasticsearch restart Elasticsearch	4	681	April 25, 2023
If Primary and Replica shards both fail how to recover? Elasticsearch	6	3228	March 20, 2019
Partial index replication causes data loss? Elasticsearch	9	594	July 6, 2017

Would rolling restart cause continuous bulk data lost?

Related topics