Why data is rebuild during node restart

Jason_Wee · July 18, 2015, 9:41am

Hi,

When a es node (es 0.90.7) restart, the restarted node will rebuild shards using peers replica. Is the rebuild from size 0byte until it is balance with other peers? Why is this done so? is it because of the stale data even within few second restart?

Another question is, considering full cluster restart during upgrade. What's gonna happen to the data in the cluster? Will the cluster think all the data are stale and wipe all the data in this situation?

Thank you.

warkolm · July 18, 2015, 10:42am

Older versions of ES weren't as smart they could have been with recovery, so they will retrieve the entire shard dataset from the current primary.
If you restart the entire cluster then you just have to wait through recovery, data won't be deleted by ES.

That's a pretty old version by the way, you should really upgrade!

Jason_Wee · July 18, 2015, 11:14am

do you have reference (book or code) for this? would definitely be very helpful.

did that before, but too many exceptions. it's a pity nonetheless.

Topic		Replies	Views
Shard rebalancing after node restart Elasticsearch	2	771	July 5, 2017
What happens if data folder is wiped out on individual nodes Elasticsearch	12	1554	December 25, 2018
Why ES node starts recovering all the data from other nodes after reboot? Elasticsearch	17	510	July 6, 2017
[Elasticsearch 5.5] when node left and rejoin, all data in the node gone Elasticsearch	5	223	May 5, 2022
ElasticSearch cluster restarting/adding node(s) Elasticsearch	11	1251	April 4, 2018

Why data is rebuild during node restart

Related topics