Elasticsearch Hot-Warn-Cold on a single node!

Julius · June 18, 2020, 1:44pm

is there a way to archive old data on elasticsearch and automatically move it to cheaper storage while keeping access to this data from the kibana interface ?

Our architecture has a single node that run, logstash, elasticsearch and kibana.

I went through most of the solutions on the forum and the solutions offered are:

With Curator which consists of snapshot / restore -> the solution does not suit me because I would like something automatic where I will not need to restore to consult the archived data.
Implementing a Hot-Warm-Cold Architecture with Index Lifecycle Management which requires a minimum of 2 nodes -> I only have one node.

basically I would like to know if I can implement solution 2 with a single node?

Thank you in advance for your answers.

Christian_Dahlqvist · June 18, 2020, 1:51pm

Hot/warm architectures are about having different nodes optimised for different phases. You can not move data between paths on a single node.

Julius · June 18, 2020, 2:01pm

@Christian_Dahlqvist

Thank you for your answer, so for what I want to do I must necessarily have a second node?

Do you know of other solutions to advise me?

thank you

Christian_Dahlqvist · June 20, 2020, 1:13am

You do need multiple nodes. Am not aware of any other solution.

Vinayak_Sapre · June 20, 2020, 3:30am

@Julius
On an ES node you can specify one or more directories as data directories. But once specified, there is no way, at least I know of, to tell ES which shard goes to which directory. So if you use fast and slow (cheaper) storage as directories on a single ES node, there is no way to achieve the hot warm architecture.

There are couple of possibilities, each with limitations

If your real constraint is a single host (physical machine) and the host has enough resources, you can run multiple ES nodes on a single host. One node can use fast storage and other can use cheaper storage. You can also adjust heap. If you use docker you will have more control on cpu memory used by each node.
Freeze old indices. Frozen indices are read-only and do not consume resources. So queries will run significantly slower. But data will be searchable. You can unfreeze without going through restore process. It will help you on memory / CPU but not on storage cost.

system · July 18, 2020, 3:30am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Hot, Warm Setup Storage Based Elasticsearch	2	418	November 7, 2019
Storage optimization for ElasticSearch storing large data Elasticsearch	4	1383	July 5, 2017
Share data directory between exclusively running instances Elasticsearch	11	690	March 18, 2020
Two nodes on one physical machine Elasticsearch	4	524	February 4, 2017
Multiple clusters for hot warm architecture Elasticsearch	9	783	December 12, 2018

Elasticsearch Hot-Warn-Cold on a single node!

Related topics