Shard synchronization behaviour

gilbertotcc · April 12, 2017, 12:19pm

I'm learning how ES works, reading the Elasticsearch Reference and experimenting its main features. What I did not find in the guide and on the web is how ES behaves when it has to synchronize two shard in scenarios where a shard is not aligned with its primary.

Here an example. I have two nodes N_1 and N_2 and, for simplicity, one-shard index, . Let assume I shutdown N_2 for whatever reason and index some documents in N_1 shard. When I reboot N_2 how does ES synchronize the replica shard in N_2? Does it transfer only changes (files?, documents?) or the shard at all?

Where I can find more information about that topic (references to source code are valid too)?

Thank you in advance for your help.

s1monw · April 18, 2017, 8:33am

until ES 5.x we try to detect if we need to resync and if so we have to transfer the entire shard. In 6.x there will be improvements made to this to only resync deltas if possible (after a long enough period that won't be possible either)

bleskes · April 18, 2017, 8:50am

If you want to know more about the 6.x work and how it differs from the current approach, I highly suggest watching the 3rd part of this Elastic{ON} talk: https://www.elastic.co/elasticon/conf/2017/sf/consensus-and-reception-in-elasticsearch

It covers this in details.

gilbertotcc · April 18, 2017, 10:43am

Thank you very much. I'll watch the talk you've suggested me.

system · May 16, 2017, 10:58am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Replicas synchronization Elasticsearch	5	3532	July 5, 2017
Shard rebalancing after node restart Elasticsearch	2	771	July 5, 2017
Nodes Out of Sync Elasticsearch	7	3507	January 5, 2018
Will ES discard redundant replicas? Elasticsearch	3	408	July 5, 2017
ES 5.X - Primary and replica shards not in sync Elasticsearch	8	5379	November 1, 2017

Shard synchronization behaviour

Related topics