Is it required to restart the routing/coordinator process when data node disconnects from network and comes back?

dany · March 19, 2016, 7:51pm

I know this could have been discussed in several posts. But I couldn't find a solution yet. This happens in few cases[not always]. When a data node moves out of the network and comes back, it is not getting registered to cluster automatically. I end up restarting the routing node process to identify the data node

Is it required to restart the routing/coordinator process when data node disconnects from network and comes back?
Or is there any setting I can change to increase the number of checks from cluster for new nodes when a node goes out of network?

warkolm · March 19, 2016, 11:33pm

It shouldn't need a restart, no.
Assuming you are using zen discovery, take a look at https://www.elastic.co/guide/en/elasticsearch/reference/2.2/modules-discovery-zen.html

dany · March 22, 2016, 8:24pm

Thanks for the response. I use zen discovery. I only use the zen minimum_master_nodes settings in the config.

zen:
minimum_master_nodes: 2

Do I need set the ping_retries[default to 3] setting if the network failure exists for more than the default retries time 90s. [3 * ping_timeout = 3 * 30s = 90s]

I mean set ping_retries to a higher value [eg. 10 or 20].
So the node will join back in case if the network failure exists for 10 mins with
ping_retries : 20
ping_timeout:30s

Topic		Replies	Views
Master node re-join cluster after network outage Elasticsearch	5	1217	July 5, 2017
Cluster failures Elasticsearch	2	284	July 6, 2017
Is zen discovery fd ping_retries setting is for the maximum amount of time to rejoin the cluster? Elasticsearch	1	519	July 5, 2017
Discovery.zen.fd.ping_retries: 5 Elasticsearch	4	993	June 13, 2018
Elasticsearch recovering process took a long time Elasticsearch	3	367	July 6, 2017

Is it required to restart the routing/coordinator process when data node disconnects from network and comes back?

Related topics