Storage optimization for ElasticSearch storing large data

asatsi · September 29, 2015, 5:58am

Hi,

As part of devising a solution for monthly 100TB data store and storing about a year's logs, I was thinking of having some kind of hot data, lets say one month old data to be stored locally on local disk/JBOD etc. And for remaining 11 months data to be stored on SAN. I was thinking that the huge data store coming from SAN should be effectively used only for retrieval of past data and all the current logs will be written to locally attached disk providing faster write speeds.

Any thoughts, suggestions?

json · September 29, 2015, 6:08am

Hello Satish,

Please have a look at the blog post titled “Hot-Warm” architecture which describes this architecture and let us know if you have any questions.

asatsi · September 29, 2015, 9:17am

Thanks @json. Interesting article. Have got a question:

It is mentioned - "Elasticsearch will automatically migrate the indices over to the warm nodes."

However there is a reference to curator doing the migration. Do we really need curator to do that OR ES will do the hot/warm migration automatically as mentioned in the article?

json · September 30, 2015, 2:01am

That is a good question. Elasticsearch does not do hot/warm migration automatically. You can do this on a time basis by using a cron job to the REST as mentioned in the section labeled "Warm Data Nodes" or with Curator as mentioned in the example.

Topic		Replies	Views
Warm storage of large (9TB) log data archives Elasticsearch	3	961	July 5, 2017
Retiring indexes from SSDs to HDDs Elasticsearch	4	853	January 9, 2017
Elasticsearch Hot-Warn-Cold on a single node! Elasticsearch	5	1293	July 18, 2020
Index Migration to Slower Media Elasticsearch	3	329	May 17, 2018
Migrating data to "cold" indexes - docs? Elasticsearch	2	517	February 3, 2019

Storage optimization for ElasticSearch storing large data

Related topics