Options available for storing data beyond a limit

zaeemmasood · January 11, 2022, 9:07pm

Hello. We have a hard limit of 4TB of disk storage that can be allocated to individual nodes in a cluster.

My question is what happens when we start nearing the threshold of 4TB? Is there a way we could archive data outside the cluster and which is available as well?

leandrojmp · January 11, 2022, 9:31pm

Elasticsearch has some thresholds that will trigger when the storage usage for each node reaches some of the defined percents usage.

Basically you have the low watermark, the high watermark and the flood stage watermark.

The low watermark by default is set at 85% of disk usage, when the node reaches this stage it will stop to receive new shards, but only affect replica shards, not primary shards.

The high watermark by default is set to 90% of disk usage, when the node reaches this stage Elasticsearch will try to allocate shards out of the node.

And the flood stage is at 95%, when the node reaches this stage all index that have a shard in this node will be set as read only.

You can read more about this in this part of the documentation.

About archiving data outside the cluster, you can use Snapshots for that, this is explained in this part of the documentation, but the data in snapshots are not searchable unless you have the Enterprise license.

system · February 8, 2022, 9:31pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Threshold selection - How to define it? Elasticsearch	7	1059	July 29, 2020
How to avoid sending data to a node close to its disk space limit? Elasticsearch	4	603	April 15, 2018
Disk usage exceeded flood-stage watermark Elasticsearch	2	453	September 14, 2022
Disk space issues on 7.10 Elasticsearch	5	335	August 23, 2021
High flood state disk watermark results in read only indexes Elasticsearch	3	5143	March 31, 2018

Options available for storing data beyond a limit

Related topics