Speedup Migration/shard allocation on hot warm nodes

arvind297 · December 27, 2018, 6:31am

Hi,

While migrating the indices older than number of days from hot to warm,I beleive setting replica to 0(I know it is bit risky) can speed up the migration of indices,Please suggest if there is any other way(any parameter changes) where the shard allocation speed to warm nodes can be increased.

DavidTurner · December 27, 2018, 8:32am

The speedup you get by reducing the number of replicas to 0 is due to copying less data: you only have to make one new copy of the shard on a warm node. However this means you don't have any redundancy, and given that disks are generally a little unreliable I would replace "bit risky" as "guaranteed to lose data in the long run" in what you said.

If you mean to set the number of replicas to 0, perform the migration, and then add replicas again, then you will copy the same amount of data either way, so I don't understand the benefit.

Can you give some more numbers about the problem you're trying to solve? How large is your cluster, how much data are you talking about, and how long does it currently take?

arvind297 · December 27, 2018, 11:19am

Thanks for the update, I was aware of setting replica to 0 as an option available with risk of losing data(haven't used it).

Cluster size is around 30TB with 5 hot and warm nodes each.

To Move around 100gb of data it takes around 8 to 10 hours and the shard count is around 150 with replica set to 1.

Hence wanted to know if there is any changes that can be done to speedup the shard allocation in warm node.

Christian_Dahlqvist · December 27, 2018, 11:23am

How many indices and shards is that speread across?

arvind297 · January 2, 2019, 5:35am

We have around 25000 shards and 7146 indices, 5 hot nodes and 7 warm nodes and total heap memory allocated to all these nodes(hot+warm) is around 450 gb.

Sometimes it takes around 4 to 5 hours to move around 400 gb of data from hot to warm(using curator we are moving 7 days old data daily) ,whereas there are also days where it takes 10+ hours to move around 400+gb of data.Hence wanted to check if there is anyway where we can speed up the data movement from hot to warm.

Christian_Dahlqvist · January 2, 2019, 5:42am

You have far too many shard given the size of your cluster and data. Please read this blog post for some practical guidelines on recommended shard sizes and sharing practices.

Having so many shards can slow down cluster state updates and propagation that need to happen as shards are moved around. I would expect you to see much better performance with fewer larger shards.

system · January 30, 2019, 5:42am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Shard allocation / cluster design Elasticsearch	4	625	July 1, 2018
Shards Taking a Long Time to Move Between Nodes - Cloud [7.1.1] Elasticsearch	50	3998	July 29, 2019
Sharding hot vs warm Nodes Elasticsearch	8	2121	October 16, 2020
Moving data from hot to warm node Elasticsearch	5	3972	April 23, 2019
Restarting node takes time Elasticsearch	4	1079	July 5, 2017

Speedup Migration/shard allocation on hot warm nodes

Related topics