Compressed snapshot for backing up

whzcl · February 4, 2016, 12:33pm

We use the hdfs as our elasticsearch repositories to back up and restore.We have alread installed the 'elasticsearch-repository-hdfs' plugin in all the elasticsearch nodes.

I have a question as followed:

As the backup progress is incremental, if the snapshot files in the hdfs were compressed, such as in 'har' format, can we still continuing backing up new snapshots?
If not, how can we do back up work on conditions that the existing snapshots were compressed?

costin · February 21, 2016, 6:19pm

Snapshot/Restore works only if the underlying storage is not touched/affected. Moving the files, compressing them, etc.. obviously changes their content and at that point ES cannot use them any more.
Further more since the snapshots are incremental it means each snapshot depends on the previous one - when you delete one, ES will make the right choice and will delete the data that is no longer needed. However this again is something that ES does already and any interference with it will screw things up.

Now you could manually manage the snapshots as in create a repository, take some snapshots, then close it, move the files/compress them but it means in case of a snapshot/restore, you have to do that backwards as otherwise ES will not do an incremental backup but rather a full one.
Note that since ES only backups the delta, new backups are as cheap as they can be.

Topic		Replies	Views
Question About Repository-hdfs Elasticsearch es-hadoop	8	2140	July 6, 2017
Snapshot compress not compressing? Elasticsearch	4	3360	July 6, 2017
Can I use gzip or some other compression algorithms when I create snapshots to HDFS? Elasticsearch	21	2165	August 19, 2019
Backup and restore using snapshots Elasticsearch	2	897	July 6, 2017
Snapshots don't get compressed Elasticsearch	8	1440	July 6, 2017

Compressed snapshot for backing up

Related topics