Hi,
We want to put the node offline for 15 mins and then bring back into the cluster. So the approach we are following is mentioned below
- disable shard allocation except for primaries
- Perform flush
- Shutdown Elasticsearch service and node
- Perform node operations (for 15 mins)
- Restart node and Elasticsearch service
- Reenable shard allocation to "all" when node joins cluster
Cluster Information
- Current ES Version - 8.12.2
- Node Data Size - 2.7 TB
- No of shards per node - 93
- Active Indexing and searching will be happening on the cluster
My understanding is that when we disable shard allocations except primaries, there will be unassigned shards as replica is staying on the node which is getting restarted. When node comes back online, cluster will turn to Green and those replica shards will be available. The node will have only replica shards.
So I have few questions
- What is the time in which shard on a node become stale?
- I believe that data movement will happen on the restarted node to make replicas in sync with primary due to active indexing. Please confirm.
- Is the recovery duration function of data size? How to evaluate the time for unassigned shards to be allocated and data movement?
- Is it suggested to set cluster routing to "new_primaries" considering node will be offline for 15 mins and there will be active indexing?