Shards not allocating based on disk space

Peterson · April 16, 2019, 4:36pm

I have a cluster with 9 nodes, all the same size. The shards are allocating evenly in regards to the number of shards per node, but not in regards to the disk space. In reading the documentation, "Elasticsearch considers the available disk space on a node before deciding whether to allocate new shards to that node or to actively relocate shards away from that node.", but that does not seem to be the case. The nodes range from 31% full to 71% full. I tried using the API to change the rebalancing, but that did not solve anything. Is there a way to allocate the shards based off of disk space (or size of shards) rather than the number of shards? Thank you.

DavidTurner · April 16, 2019, 4:45pm

This is from the page on the disk-based shard allocator and the rest of the page describes this logic in much more detail. The goal is not to balance the disk usage, it is to keep the disk usage below the configured watermarks.

Peterson · April 16, 2019, 4:46pm

Is there a way to balance disk usage?

DavidTurner · April 16, 2019, 4:49pm

No, not really. I'm not sure I understand why you would want to do this. It would potentially lead to a lot of unnecessary shard movement as the shards grow over time. Can you explain in a bit more detail what problem you're looking to solve with this feature?

Peterson · April 16, 2019, 4:55pm

We often run into the issue where one node goes above the watermark which causes the shards to unallocate and then cannot reallocate. When this happens, we have other nodes that are below the watermark.

DavidTurner · April 16, 2019, 5:20pm

This is surprising to me. Shards are not normally deallocated when a node goes above a watermark. If you exceed the low watermark then nothing happens to existing shards; if you exceed the high watermark then shards are moved elsewhere, but they remain allocated on their current node until the relocation is complete; if you exceed the flood stage watermark then the shards are marked as read-only, but they stay allocated to their current node. I'd like to understand the sequence of events that leads from a full disk to an unassigned shard in more detail. Do you have logs of a case where this happened?

system · May 14, 2019, 5:20pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Total_shards_per_node and disk usage too high causes shards to stay unallocated Elasticsearch	5	12	December 18, 2024
Shard allocation based on shard size Elasticsearch	14	937	January 18, 2021
How can I modify the distribution of the shards in nodes with diffent disk capacity? Elasticsearch	5	416	August 2, 2019
Shard reallocation and disk space Elasticsearch	5	765	August 4, 2020
Can es shards allocate to nodes based on disk capacity？ Elasticsearch	3	305	October 8, 2020

Shards not allocating based on disk space

Related topics