ES shards data rebalancing

gintek · September 9, 2020, 12:29pm

Hello,

I'm wondering how are you guys doing with data rebalancing on large clusters ? We have 3 masters, 3 hot nodes and 6 cold. Some cold nodes have 4T disk and few 2T. ES is rebalancing shards based on number shards per node so time to time we need manually relocated heavy shards between nodes. We are not able control shards size, some are 200G some 10G. We are still on ES 6.8 and we are planning upgrade to 7.x where we can use index lifecycle management.
I read about plugin that enables rebalancing based on disk usage, another option is cron job and script to rebalance data.

What is your approach and why you choose it ?
Thanks,

d.silwon · September 9, 2020, 6:14pm

We had the same problem but with smallest amount of data and from my point of view there is only one way i.e. change shard size. Maybe I'm wrong but I do not find another sollution. Why you can not change the size of shards for new indexes? Do you have implemented any deletion polices for existing indexes?

gintek · September 10, 2020, 7:07am

@d.silwon thanks for response.

We are not using ILM so it's hard control shards size, and we have some "silver tape" solution. To speed up cluster and avoid outage we precreate indexes, we use daily index name convention.

Maybe someone has experience with https://github.com/simplymeasured/tempest ?

system · October 8, 2020, 7:07am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Fundamental question about ES data/shards Elasticsearch	3	417	July 6, 2017
Shard allocation based on shard size Elasticsearch	14	937	January 18, 2021
Shard rebalancing Elasticsearch	3	1756	July 6, 2017
Shard rebalancing after node restart Elasticsearch	2	771	July 5, 2017
Rebalancing of shards Elasticsearch	7	3520	July 6, 2017

ES shards data rebalancing

Related topics