Relocation in loop on Elasticsearch 7.7

dextermorgan · April 18, 2021, 6:50am

We recently upgraded from ES 5.4 to ES 7.7 and are facing this weird issue in production where relocation goes in a loop. Have noticed majorly on large clusters and having hot/cold config. e.g. following is cluster config

3 master nodes
2 nodes with node.attr.tag as hot
30 nodes with node.attr.tag as cold (relocation in loop on these nodes)

Indices having hot and cold inclusion/exclusion in settings

For the sake of debugging and simplifying things, added following in cluster settings

cluster_concurrent_rebalance: 10
node_concurrent_recoveries: 1
balance.index: 0.0f
balance.shards: 1.0f

Enabled trace logs for BalancedShardsAllocator and went through code and noticed that the first time it relocates the shard to the minNode from maxNode and then saw simulate logs which increase number of shards on that minNode on the model due to throttling. So, in next indices run, the shards on minNode increase in theory and that leads to shards moving away from minNode even if (in actual) it has less shards. Attaching logs for same.

Logic and code wise this is the case with ES 5.4 as well but we never faced this issue there. Could someones please help on how can I debug this and also why do we simulate relocation and theoretically increase shards on node?

Ubuntu Pastebin - Node RlZgpH1xTYKuEjn1gyH3CA is the one which has least shards when i enable rebalance shards.

dextermorgan · April 26, 2021, 1:25pm

Gentle reminder if anyone could help me with the query.

system · May 24, 2021, 1:26pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
I see always relocation going on, should I be worried? Elasticsearch	4	43	October 3, 2024
Shards refuse to relocate to different nodes using cluster.routing.allocation.exclude Elasticsearch	3	2208	July 13, 2019
Weird rebalancing strategy Elasticsearch	4	323	October 23, 2021
Shard relocation storms when cluster disk low Elasticsearch	11	2533	July 24, 2018
5 new hot nodes were added, but the relocating shards were all freeze nodes, and the number of freeze nodes was close to equilibrium. ES version 6.8.3 Elasticsearch	1	304	April 14, 2020

Relocation in loop on Elasticsearch 7.7

Related topics