Offloading old data to cheaper storage

andywt123 · August 31, 2020, 7:15pm

We are currently using elastic to aggregate logs with filebeats. We want to be able to offload data that is older then 60 days and then store in AWS s3. The data does not need to be searchable but we may need to retrieve the data 6 months or 1 year from now. The data once retrieve can be loaded into another cluster or the current one. Which ever is easier. Can someone point me to a blog or documentation? Any advice would be appreciated.

warkolm · August 31, 2020, 9:41pm

We are working towards a solution here called searchable snapshots.

In the meantime, you can take snapshots (ie backups) of your indices, store them in S3 and then manually restore when you need them.

Steve_Mushero · September 1, 2020, 3:33am

Note if you are using time-based indices (like per day/week), you can snapshot just part of your cluster, i.e. some indexes - the ones older than 60 days.

That can be separate (different repo) from the normal full-cluster snapshots I hope you are doing already.

Then later build a big VM with one node ES cluster and restore that cold snap to it, search, etc. all your like; it'll take a while but is nice & cheap solution.

system · September 29, 2020, 3:33am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Offline storage to keep data for several months, how? Elasticsearch	2	586	July 6, 2017
Store Old Indices in S3 and load when needed in future? Elasticsearch	3	196	July 19, 2023
Offload cold data to s3 in cloud Elasticsearch	6	1443	September 12, 2018
Monthly backup and restore along with snapshot Elasticsearch	3	1765	December 6, 2019
Keeping Indices for over a year Elasticsearch	3	414	May 4, 2018

Offloading old data to cheaper storage

Related topics