Restart node after 15 mins

mukularora89 · April 22, 2024, 1:20pm

Hi,

We want to put the node offline for 15 mins and then bring back into the cluster. So the approach we are following is mentioned below

disable shard allocation except for primaries
Perform flush
Shutdown Elasticsearch service and node
Perform node operations (for 15 mins)
Restart node and Elasticsearch service
Reenable shard allocation to "all" when node joins cluster

Cluster Information

Current ES Version - 8.12.2
Node Data Size - 2.7 TB
No of shards per node - 93
Active Indexing and searching will be happening on the cluster

My understanding is that when we disable shard allocations except primaries, there will be unassigned shards as replica is staying on the node which is getting restarted. When node comes back online, cluster will turn to Green and those replica shards will be available. The node will have only replica shards.

So I have few questions

What is the time in which shard on a node become stale?
I believe that data movement will happen on the restarted node to make replicas in sync with primary due to active indexing. Please confirm.
Is the recovery duration function of data size? How to evaluate the time for unassigned shards to be allocated and data movement?
Is it suggested to set cluster routing to "new_primaries" considering node will be offline for 15 mins and there will be active indexing?

DavidTurner · April 22, 2024, 1:54pm

Marking a shard as stale is not a time-based thing. It happens as soon as the primary processes an operation that isn't on the replica.
Confirmed.
Mostly it's determined by only the size of the data changed while the node was down. So if there were no writes, the recovery should be very fast, whereas if you wrote GiBs of data then it will take longer.
Yes, that's part of the documented process.

Topic		Replies	Views
Rolling restart, replica allocation, cluster.routing.allocation.enable vs. index.unassigned.node_left.delayed_timeout Elasticsearch	2	703	December 27, 2022
Restarting node takes time Elasticsearch	4	1080	July 5, 2017
Restarting of node taking much time Elasticsearch	6	2441	July 6, 2017
Shard allocation on restarted node takes too long Elasticsearch	5	3406	July 5, 2017
Quickly restarting a node Elasticsearch	6	577	April 11, 2019

Restart node after 15 mins

Related topics