Whenever we restart a node (or a few) in a cluster, and also when we do a
full cluster restart, the cluster takes a lot of time to recover from
YELLOW state to GREEN.
What we observed is the cluster is relocating indexes when it goes up, or
when a few nodes restarted. Since our indexes are not small it takes them
time to relocate, and also one relocation triggers another (for balancing
reasons) this becomes a never ending pursuit.
As long as the cluster is up - no relocations happen. This only happens
when a node or few restart, or when the cluster restarts as a whole. But
still - a balance that is preserved while the cluster is up should be
preserved when there's a small disturbance. I will only expect a rebalance
when a game-changer event happens - large index is added or removed, or a
node permanently joins / removed.
I'm aware of the various settings in place that should prevent that
(expected number of nodes, time to wait etc) but obviously they don't play
Our hope is the new allocation decider would help with that, but that issue
seems to be originating from some sort of a bug in the decision of WHEN to
I'll be happy to provide anything that could help pinpoint the issue
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to email@example.com.
For more options, visit https://groups.google.com/groups/opt_out.