How to speed up move es node migration to another machine

dalei2019 · March 29, 2022, 5:05am

Hi team,
I have a 10 node es cluster created by ECK on k8s v1.22 .
And Persistent storage uses local pv .
For some reasons, I want to move one of the es instances to a new node.
Because the index has opened the copy.
I changed node affinity settings. Then delete the node directly.
And wait for eck to reschedule.
But it turned out that the machine had stored 500GB of data, and it took dozens of hours to reschedule calls.

My question is:

Is my approach correct？
How to speed up node migration to another machine？

Any suggestion will be very helpful for me, thanks a lot !

dalei2019 · April 1, 2022, 3:20am

+1 +1

pebrc · April 19, 2022, 9:31am

I am not sure I fully understand the sequence of events your are describing. I am assuming you are running your indices with replicas configured? If so, once you deleted the node the replicas should be promoted to primary, which should be almost instant.
Elasticsearch will then start reallocating replicas for the shards that have been lost with the node you removed. This can indeed take a long time depending on the size of the indices as they recover the replica slowly from the primary.

As to whether this is right approach: local volumes on Kubernetes are tricky to manage. We have written up some thoughts in our documentation here: Storage recommendations | Elastic Cloud on Kubernetes [master] | Elastic

But unless you have a way of copying the local volume to another node, I don't see another way than forcibly removing the node as you did it. This is of course not ideal because if this operation coincides with an unplanned failure of another node that holds the replicas (which are just about to become primaries) you are set up for data loss.

abrx · April 20, 2022, 1:31pm

You may speed up by adding the new node in advance, so it will take some shards as usual.

Then check the _health of your cluster is green (so every index has primary and copy)

Then delete the old node (or first change the shard allocation awareness with a temporary zone to avoid going to yellow when deleting, in this case wait for the relocation of shards from the old node)

system · May 18, 2022, 1:31pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Recommendation for upgrading underlying kubernetes nodes Elastic Cloud on Kubernetes (ECK)	6	903	November 4, 2022
Migrating data from one node setup to new cluster Elasticsearch	6	2844	February 4, 2019
Moving to a different elasticsearch storage using ECK Elastic Cloud on Kubernetes (ECK)	2	155	July 18, 2024
ES migration options Elasticsearch	5	432	November 14, 2019
Increasing shard relocation speed Elasticsearch	7	28700	July 5, 2017

How to speed up move es node migration to another machine

Related topics