Timed out while waiting for initial discovery state - timeout: 30s

alexus · August 26, 2018, 3:02am

I'm using Elastic Stack 6.4.0 (did rolling upgrade from 6.3.2 recently) and after stop of entire elasticsearch cluster (master, data, coordinate and ingest nodes), I'm no longer able to start it; I'm getting following messages in logs:

timed out while waiting for initial discovery state - timeout: 30s

and also:

org.elasticsearch.discovery.MasterNotDiscoveredException: null

I use Zen Discovery and publish_host seems to be correct.

Please advise.

alexus · August 26, 2018, 9:24pm

after downgrading Elastic Stack to 6.3.2 (and later rolling upgrade to 6.4.0) everything works as it expected, however starting with 6.4.0 cluster Elastic Stack, causing nodes not see each other...

warkolm · August 27, 2018, 1:09am

Can you share your full config and the logs?

alexus · August 27, 2018, 4:09am

My elasticsearch cluster consist of:

5 master eligible nodes
5 data nodes
2 injest nodes
and few coordinate only nodes

they all share following configuration:

# cat cluster.env 
cluster.name=X
# cat discovery.zen.env 
discovery.zen.minimum_master_nodes=3
discovery.zen.ping.unicast.hosts=esm1,esm2,esm3,esm4,esm5
#

esm = master nodes

unfortunately, I blew everything away to go to plan "b", which I commented earlier...

warkolm · August 27, 2018, 4:10am

Are they dedicated master nodes?

alexus · August 27, 2018, 4:10am

yes, and they have 12G HEAP memory

warkolm · August 27, 2018, 4:12am

You really only need 3, 5 is a bit of overkill.

Without full logs it's a little hard to speculate as to why this is happening.

alexus · August 27, 2018, 4:14am

5 allows me to do rolling upgrades w/ minimal downtime

warkolm · August 27, 2018, 4:15am

You can do the same thing with 3.
Having dedicated masters is great, but having more than 3 is diminishing returns.

alexus · August 27, 2018, 4:19am

I have following line:

discovery.zen.minimum_master_nodes=3

if I only have 3 nodes, and I restart one of them, cluster still green?

warkolm · August 27, 2018, 4:20am

You just reduce that to 2, which is still a majority of 3.

system · September 24, 2018, 4:21am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
ES 6.0 timeout on cluster Elasticsearch	9	1163	January 18, 2018
Cluster failures Elasticsearch	2	284	July 6, 2017
Slow cluster startup with zen discovery and large number of nodes Elasticsearch	4	1122	July 6, 2017
ES 6.2.3 cluster goes down unexpectedly Elasticsearch	3	734	June 7, 2018
Periodically getting time out errors while inserting documents Elasticsearch	6	995	July 5, 2017

Timed out while waiting for initial discovery state - timeout: 30s

Related topics