Disk threshold

Abhishek_Shinde · April 27, 2021, 4:40pm

We are trying to implement single node cluster of Elasticsearch per installation of our application. Our application is on-prem. I was looking at the indexing data location parameters in the Elastic documentation. I came to know there are disk-based shard allocator settings with following parameters,

cluster.routing.allocation.disk.threshold_enabled
cluster.routing.allocation.disk.watermark.low
cluster.routing.allocation.disk.watermark.high

I would need to know whether by default Elasticsearch sets the disk threshold as Enabled?

I don’t see the use case of keeping it enable by default in a single node cluster where we will be having multiple disks on a single node/server. We will write our own disk alerts to monitor the threshold.

It would be good if anyone can explain the low and high alert meaning with some simple examples?

DavidTurner · April 27, 2021, 5:10pm

I think the reference manual answers most of these questions so I suggest starting there.

The low and high watermarks are only really useful for logging in a single-node setup, but the flood_stage watermark is still valuable there.

Abhishek_Shinde · May 2, 2021, 5:13pm

Hi @DavidTurner

Could you please let me know, how the high watermark is helpful in a single-node cluster setup considering "one node one disk" as it works for shard relocation across nodes.

In a single node cluster, I believe disk_low_watermark and flood_stage will play an important role?

Kindly confirm!

leandrojmp · May 2, 2021, 5:53pm

For a single data node you need to worry only about the flood_stage configuration, the low and high watermark will have no impact.

The low watermark does not impact the primary shards of new indices, as explained in the reference manual.

This setting has no effect on the primary shards of newly-created indices but will prevent their replicas from being allocated.

Since you have a single-node, you do not have any replicas.

You also do not have other nodes, so the high watermark will also has no impact as it has nowhere to move shards.

The flood_stage is the one that will impact you, since it will block the writes in your node.

As David said before, the low and high will be useful just for logging, you should monitor the logs of your nodes and use this information to decide when to action and free disk space.

Abhishek_Shinde · May 2, 2021, 6:00pm

Thank you! This information is helpful.

DavidTurner · May 3, 2021, 7:44am

Elsewhere you are asking about a design in which you have multiple disks, and therefore multiple nodes, all running on a single host, so I think you don't have a single-node cluster as you claim here. In that case the watermarks all work as documented to move shards around to ensure that no node's disk gets too full.

Abhishek_Shinde · May 3, 2021, 5:57pm

Thanks David! We are planning to integrate Elasticsearch with Enterprise application. We are heavily dependent on indexing. Our Application has use cases of scaling storage vertically as well as horizontally. Existing customers has stored around 10 TB of indexing data on a single sever with older indexing engine. Now, we are seeing challenge fulfilling the vertical scaling use case with Elasticsearch horizontally scale model. Like, one node one path, we won't be able keep existing functionality of adding drives and keep scaling the server vertically.

Anyways, you guys are too prompt in answering and clarifying my doubts!

system · May 31, 2021, 5:58pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
ElasticSearch 2.3.3 Cluster Disk High/Low Watermark Net Effect is Ambiguous Elasticsearch	6	1018	March 27, 2017
Question about High watermark in Single Node cluster Elasticsearch	2	613	August 28, 2017
Threshold selection - How to define it? Elasticsearch	7	1059	July 29, 2020
Disk Allocation Threshold Elasticsearch	1	420	July 6, 2017
Is there much value in having different values for cluster.routing.allocation.disk.watermark.low/high configs? Elasticsearch	2	502	January 11, 2019

Disk threshold

Related topics