Snapshot management on S3

robhuang · May 30, 2020, 3:42am

We have ~200tb snapshot stored on S3 now, and I am looking for someways to clean it up.

I read the official doc: Snapshots are taken incrementally. This means that when it creates a snapshot of an index, Elasticsearch avoids copying any data that is already stored in the repository as part of an earlier snapshot of the same index.

Let's say we have snapshot1 for Day1, and snapshot2 for Day2. Does this incrementally mean snapshot2 is relying on snapshot1, where deleting snapshot1 makes snapshot2 unrecoverable?

If snapshot1 and snapshot2 are independent from disk storage perspective, can I simply delete snapshot1 once snapshot2 is created, which is covering 100% data of snapshot1 (assuming ES cluster only ingests, and does not delete data)?

Christian_Dahlqvist · May 31, 2020, 3:26pm

Snapshot 2 will link to ant segments within Snapshot 1 that did not change between the snapshots and will not copy these again. Removing Snapshot 1 will however not remove these segments as Snapshot 2 now uses them This is described quite well in this old blog post.

system · June 28, 2020, 3:26pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
ES Snapshot Delete Elasticsearch	6	836	September 7, 2020
Snapshot backup to AWS s3 Elasticsearch	5	441	March 8, 2019
Elasticsearch snapshot working Elasticsearch snapshot-and-restore	4	281	March 21, 2023
Backup overview and strategy Elasticsearch	3	322	January 16, 2019
Elasticsearch Incremental Snapshot Elasticsearch	9	612	July 5, 2017

Snapshot management on S3

Related topics