How Elasticsearch snapshot works when segments are merged

bbking · March 18, 2019, 7:55pm

Hi, my current understanding is that snapshot is incremental file/segement by file used by indices. When segments are merged even if no new data are indexed to indices, snapshot will captue the difference due to segment change.

Does that mean it will copy duplicate data to snapshot repository?
When I restore indices by those snapshot that contains duplicate segment, will it result in duplicate documents?

dadoonet · March 18, 2019, 8:46pm

Yes
No. When you restore, only the right segments are restored not the old ones.

bbking · March 18, 2019, 9:03pm

How does it know the right segment to restore?

Say snapshot1 contains segment1 and segment2. Then, segmengt1 and 2 are merged to segment3. Snapshot2 will contain segment3. If I restore based on those two snapshot, how doe it know which is the correct segment to use?

dadoonet · March 18, 2019, 9:21pm

snapshot2 knows that segment3 is used and not segment 1 and 2. So if you restore Snapshot2, only segment 3 is restored. You don't restore 2 snapshots.

Think of a snapshot as a full backup.

bbking · March 18, 2019, 9:57pm

Just want to make sure I understand it correctly. Each snapshot will take a copy of entire cluster but will only copy the delta between the latest and current snapshot. Does that mean we only need the most recent snapshot for backup and it is safe to delete all old snashot?

dadoonet · March 18, 2019, 10:09pm

This is correct.

system · April 15, 2019, 10:09pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Is it possible to only restore the incremental portion of snapshot? Elasticsearch	5	3932	September 8, 2017
Question about Snapshot and Restore Elasticsearch	5	359	October 16, 2020
Basic question about snapshots Elasticsearch	8	1022	February 1, 2017
Elasticsearch Incremental Snapshot Elasticsearch	9	612	July 5, 2017
Incremental Snapshot details? Elasticsearch	6	1768	September 12, 2017

How Elasticsearch snapshot works when segments are merged

Related topics