All primary shards of some indices keep getting assigned to 1 of 14 data nodes

danksim · March 18, 2021, 10:23pm

ES: 7.6.x
3 coordinator, 3 master, 14 data
6 shards, 1 replica

All primary shards of some indices keep getting assigned to only 1 of 14 data nodes. So data node 12 has 6 primary shards of index x, y, z, etc. The other 13 nodes are normal.

This results in high JVM heap usage for data node 12 and also unassigned shards (reason: circuitBreakingException).

Why would this happen for only data node 12?

What I have tried:

Set -XX:CMSInitiatingOccupancyFraction=50
Find each index with all of its primary shards on data node 12 and set index.routing.allocation.total_shards_per_node to 1 or 2
a. This helps but all primary shards of some new indices still get allocated to data node 12

What I have not tried:

Set cluster.routing.allocation.total_shards_per_node in index template
a. Not sure how this would work when setting index.routing.allocation.require._name (I cannot use ILM for some reason so I wrote the shrink process myself and I need all primary shards for an index on the same node in order to shrink. See here)
Change circuitBreaker settings

EDIT: a coworker found this and it looks like this is the problem.

coudenysj · March 30, 2021, 9:56am

I keep having the same issue.

All primary shards of new incides are assigned to one (almost always the same hot node) node.

danksim · April 5, 2021, 7:58pm

We solved this problem by scaling up our JVM heap size from 8g to 12g (i.e. -Xmx12g -Xms12g) per data node (and kubernetes statefulset memory from 16Gi to 24Gi).

Once this change was applied, we waited until the problematic node's shard count went up to par with the other nodes and then the cluster intelligently rebalanced itself by relocating the primary shards of the newly created indices off of the problematic node.

system · May 3, 2021, 7:58pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
ES allocates primary shards on the same data node Elasticsearch	19	4588	August 8, 2018
Setting index.routing.allocation.total_shards_per_node to 1 still results in data nodes getting more than one shard Elasticsearch	1	372	September 5, 2018
Replica shards do not get assigned (not consistent), even though they can Elasticsearch	10	4972	April 15, 2018
Primary shard allocation on a server running multiple ES nodes Elasticsearch	5	1602	July 5, 2017
Shard allocation - primaries allocated on same data nodes Elasticsearch docker	5	1055	July 22, 2020

All primary shards of some indices keep getting assigned to 1 of 14 data nodes

Related topics