I would like to know the proper way to correctly handle a split brain situation, I am constantly seeing only "failed to send join request to master" messages in an attempt to connect to a node that is no longer there. The cluster is dead in the water, and does not respond to any requests.
Do you not have minimum masters set?
Have you restarted the existing/old master?
_cat/master show for each node in the cluster?
How many nodes cluster you have and how do you determine it as split brain? Does all the nodes work independently and doesn't join cluster. What is the way of your node cluster join mechanism. (Zen or cloud plugins like cloud-aws)
Hi, i have the following setting:
The old master in this case is completely gone.
I'm thinking that this is split brain, since none of the masters will actually hold an election as they are still looking for the old master.
We have 30 nodes total 3 - master, 2 - client, 25 - data nodes.
No nodes work independently
discovery.zen.ping.unicast.hosts: [ List of hosts here ]
We also have the following set
However we are not in AWS, but an internal cloud solution similar to VMware.
If you have 3 master eligible nodes, you should have discovery.zen.minimum_master_nodes set to 2, not 1, as described in the Definitive Guide.
Yes, I just found that thank you. We use Chef to write out the config, now to figure out why chef suddenly started writing a 1 rather than a 2
I've changed it to 2 however, it's still looking for an old master node. Is there a cache somewhere I need to clear?
I believe there is no cache mechanism. You can do a quick reboot i believe taking a good amount of downtime.