Backup large indexes to other locations on a time-by-time basis and clean up the index

Christian_Dahlqvist · January 4, 2024, 7:42am

If your data is immutable I would recommend you switch to using time-based indices, e.g. data streams. With this approach all new data is written to the newest backing index and when they reach a certain size a new backing index is created behind the scenes and writing of new data switches to this. This means that only a small portion of indices are actively written to, which means they can be optimised, e.g. through forcemerge. This approach also allows you to manage retention by deleting complete indices, which is a lot more efficient that deleting data through delete-by-query. You can do this through index lifecycle management.

You can then back up old indices through the snapshot API and use the restore API to load them at a later date if needed.

Topic		Replies	Views
Purge elasticsearch data older than 1month Elasticsearch	4	979	April 11, 2024
Elastic search snap shot and backup of data of particular time interval Elasticsearch	1	357	July 4, 2018
Options to Backup data from ElasticSearch Elasticsearch	3	785	July 5, 2017
How to decrease ES backup duration? Elasticsearch	11	1227	July 5, 2017
Backup Old data Instead of Deleting Elasticsearch	0	22	December 4, 2024

Backup large indexes to other locations on a time-by-time basis and clean up the index

Related topics