Unbalanced cluster - one node running out of space

elssar · June 24, 2016, 1:28pm

I have a cluster with 1 dedicated master node, 1 client node, and 5 data nodes..

While running a particularly heavy aggregation, the cluster turned yellow and a few replica shards got stuck in Initializing state. After the aggregation was ran again, a lot of replica shards (~1/6th ) became unassigned.

Now I see that one of the data nodes has been assigned a bulk of the primary shards and it is running out of disk space fast.

All data nodes have 500gb disks, and the other 4 have between 100 and 150 GB of disk space free. The 5th one has less than 20.

How can I remedy this? Will moving shards away from this node help?

EDIT: I'm using Elasticsearch version 1.7.1

Josh_J_Luo · June 24, 2016, 2:13pm

I would suggest you to first check your _routing field

https://www.elastic.co/guide/en/elasticsearch/reference/current/mapping-routing-field.html

There is a chance that your ids (by default _routing field uses ids) are not well distributed, causing the routing to crowd certain nodes.

elssar · June 24, 2016, 2:27pm

Elasticsearch auto generates ids, so I doubt thats the case

Josh_J_Luo · June 24, 2016, 2:29pm

Do you have parent-child relationship in your index?

elssar · June 24, 2016, 2:31pm

No. Its a standard logstash populated index

Josh_J_Luo · June 24, 2016, 2:59pm

If so, probably you hit a bug (or hit by one). See if this helps https://github.com/elastic/elasticsearch/pull/14494

elssar · June 25, 2016, 5:31am

Thanks @Josh_J_Luo, I'll take a look.

The cluster recovered on its own though. The problematic node went from almost running out of disk space (<1gb) to having over a 100gb free.

Topic		Replies	Views
Unbalanced cluster with nearly half of the shards allocated to a single node Elasticsearch	5	1991	July 5, 2017
Node is in cluster but shards are unassigned Elasticsearch	8	1572	July 12, 2017
Unbalanced disk usage with ES 6.1.3 Elasticsearch	4	2554	May 1, 2018
One node taking much more space than others Elasticsearch elastic-stack-monitoring	2	2714	April 20, 2019
Three Node Elastic Cluster balance issue Elasticsearch	7	216	December 8, 2022

Unbalanced cluster - one node running out of space

Related topics