Master goes down, even after re-election, cluster is unresponsive

Abhilash_Bolla · September 22, 2017, 6:05am

I have a 5 node cluster with 3 master eligible nodes and 2 dedicated data nodes.

In the current cluster state, the current master node left owing to long GC's.

After re-election, master was assigned to some other master eligible node. I get the following exception in my Elasticsearch logs on one of the dedicated slave node.

java.lang.IllegalStateException: cluster state from a different master than the current one, rejecting (received {cls-es-slave1}{Bh9_gR2jRqiLn3IjNfHYpA}{10.240.0.18}{10.240.0.18:9300}{master=true}, current {cls-es-master}{Z1O52E-fRIu2itHXL3l1Xg}{10.240.0.15}{10.240.0.15:9300}{master=true})

Can anyone explain me whats going on?

Thanks in advance.

warkolm · September 22, 2017, 6:06am

What version?

Abhilash_Bolla · September 22, 2017, 6:15am

@warkolm Elasticsearch version: 2.4.5

warkolm · September 22, 2017, 6:15am

Do you have minimum masters set?

Abhilash_Bolla · September 22, 2017, 6:19am

No, But I guess when a new master is elected, the cluster should be healthy again automatically.

If I had minimum master nodes set to 2 and I have 3 master eligible nodes out of which one is the current master. Now if the current master goes down and a new one is elected. The cluuster should be automatically up and healthy again.

Am I missing something here?

warkolm · September 22, 2017, 6:22am

If you don't have min masters then it's possible that you had a split brain.

Abhilash_Bolla · September 22, 2017, 6:29am

Even if I had set the minimum masters to 2, I would have faced this situation right?

As a cluster state was being published from a different master than the current one as know to the dedicated data node.

warkolm · September 22, 2017, 6:38am

Well it looks like you have multiple masters sending out conflicting updates, whereas if you had min masters set then only 1 master would ever be active and there wouldn't be the conflict.

Abhilash_Bolla · September 22, 2017, 6:39am

But if I had the min master set, then the cluster would have been inoperable as the cluster would have waited for those many masters to join. How is this fault tolerant?

warkolm · September 22, 2017, 6:44am

If you have 3 masters then min masters is 2, so you can still lose a master and maintain availability.
If you don't set it then you risk data loss and corruption.

It's a balance for sure, but I'd prefer consistency over availability myself, cause what's the point of having access to the data if it's wrong?

Abhilash_Bolla · September 22, 2017, 7:24am

You mean the above exception might still have occurred even if I had min masters set?

warkolm · September 22, 2017, 7:31am

No. It's saying there that there are multiple masters, so setting min masters would have prevented that.

system · October 20, 2017, 7:31am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Elasticsearch master node election Elasticsearch	5	2563	March 10, 2020
Stale master elected in elastic version < 7 Elasticsearch	8	315	June 1, 2022
Elasticsearch cluster down: no known master node, scheduling a retry Elasticsearch	3	6116	August 14, 2017
Cluster master self-destructs soon Elasticsearch	5	479	July 5, 2017
Prevent setting minimum_master_nodes to more than the current node count Elasticsearch	5	354	March 28, 2023

Master goes down, even after re-election, cluster is unresponsive

Related topics