Master not discovered yet, this node has not previously joined a bootstrapped (v7+) cluster

DavidTurner · April 11, 2019, 1:46pm

Ok, that looks like it succeeded:

[2019-04-11T13:41:51,674][INFO ][o.e.c.s.ClusterApplierService] [d-gp2-kyles-1] master node changed {previous [], current [{d-gp2-kyles-3}{DFvnIfwDRS-g1Z3nGRuogg}{0HE_55jUTOSQDJImo0GcZA}{10.124.193.72}{10.124.193.72:9300}{ml.machine_memory=4143783936, ml.max_open_jobs=20, xpack.installed=true}]}, added {{d-gp2-kyles-3}{DFvnIfwDRS-g1Z3nGRuogg}{0HE_55jUTOSQDJImo0GcZA}{10.124.193.72}{10.124.193.72:9300}{ml.machine_memory=4143783936, ml.max_open_jobs=20, xpack.installed=true},}, term: 1, version: 1, reason: ApplyCommitRequest{term=1, version=1, sourceNode={d-gp2-kyles-3}{DFvnIfwDRS-g1Z3nGRuogg}{0HE_55jUTOSQDJImo0GcZA}{10.124.193.72}{10.124.193.72:9300}{ml.machine_memory=4143783936, ml.max_open_jobs=20, xpack.installed=true}}

So now the question is, why doesn't this work for you with ${HOSTNAME}?

kyle_che · April 11, 2019, 1:46pm

actually shows it's working now

kyle_che · April 11, 2019, 1:48pm

yeah, that's weird ... and why it didn't work when i was doing my upgrade ... i believe i tried it with both ${HOSTNAME} and the actual name.

DavidTurner · April 11, 2019, 1:49pm

Can you try echo -n $HOSTNAME | xxd to see if there's any weird characters in there that aren't being logged faithfully?

kyle_che · April 11, 2019, 1:49pm

00000000: 642d 6770 322d 6b79 6c65 732d 31 d-gp2-kyles-1

kyle_che · April 11, 2019, 1:50pm

maybe because i was putting the extension and hostname does not have the extension?

kyle_che · April 11, 2019, 1:53pm

do you have to be consistent? i had node.name=hostname and i had network.host=ip_address

kyle_che · April 11, 2019, 1:54pm

this was not an issue in 6.7.1 for me

DavidTurner · April 11, 2019, 1:55pm

As I said above:

DavidTurner:

The exact strings listed here...

must discover master-eligible nodes [d-gp2-es46-1., d-gp2-es46-2., d-gp2-es46-3.]
                                     ^^^^^^^^^^^^^  ^^^^^^^^^^^^^  ^^^^^^^^^^^^^

... must match the strings here in between the first set of braces ...

have discovered [{d-gp2-es46-2}{H2bib1wCSBKGu_Ku_4DgjA}{rzokY9nmRDCBNz0lBMgUYw}{<.....>.166.183}{<.....>.166.183:9300}{ml.machine_memory=4143783936, ml.max_open_jobs=20, xpack.installed=true}
                ,{d-gp2-es46-3}{8KNzmk5uS2mZSZiftNVTDQ}{exvQChr7RPyDlkuJ-FT2Rg}{<.....>.165.141}{<.....>.165.141:9300}{ml.machine_memory=4143783936, ml.max_open_jobs=20, xpack.installed=true}]
                  ^^^^^^^^^^^^

No extensions or anything, they need to be exactly the same.

kyle_che · April 11, 2019, 2:00pm

ok, HOSTNAME gives me the shortname ... so i believe the problem is that i was specifying hostname with the extension and it was comparing the two and failing. i just went in and put the {HOSTNAME} back and removed the extension everywhere else and it works still. i will try the upgrade again and let you know my findings.

kyle_che · April 11, 2019, 3:24pm

yes, that was the issue ... they have to be identical with the node_name.blah.com or just node_name but seems that elasticsearch creates a hashcode on the node_name so they must be identical. thanks for your help.

safderali5 · April 12, 2019, 5:49am

I tried with your recmendation, then it poped up with this error,

org.elasticsearch.transport.RemoteTransportException: [elasticsearch-data-2][172.20.13.119:9300][internal:cluster/coordination/join/validate]
Caused by: org.elasticsearch.cluster.coordination.CoordinationStateRejectedException: join validation on cluster state with a different cluster uuid KkR7myqKQU6x02Qzd5KMIw than local cluster uuid U5B7kS8rQ12gS2sJeFfbIA, rejecting

at org.elasticsearch.cluster.coordination.JoinHelper.lambda$new$4(JoinHelper.java:147) ~[elasticsearch-7.0.0.jar:7.0.0]

at org.elasticsearch.xpack.security.transport.SecurityServerTransportInterceptor$ProfileSecuredRequestHandler$1.doRun(SecurityServerTransportInterceptor.java:251) ~[?:?]

at org.elasticsearch.common.util.concurrent.AbstractRunnable.run(AbstractRunnable.java:37) ~[elasticsearch-7.0.0.jar:7.0.0]

at org.elasticsearch.xpack.security.transport.SecurityServerTransportInterceptor$ProfileSecuredRequestHandler.messageReceived(SecurityServerTransportInterceptor.java:309) ~[?:?]

at org.elasticsearch.transport.RequestHandlerRegistry.processMessageReceived(RequestHandlerRegistry.java:63) ~[elasticsearch-7.0.0.jar:7.0.0]

at org.elasticsearch.transport.TcpTransport$RequestHandler.doRun(TcpTransport.java:1077) ~[elasticsearch-7.0.0.jar:7.0.0]

at org.elasticsearch.common.util.concurrent.ThreadContext$ContextPreservingAbstractRunnable.doRun(ThreadContext.java:751) ~[elasticsearch-7.0.0.jar:7.0.0]

at org.elasticsearch.common.util.concurrent.AbstractRunnable.run(AbstractRunnable.java:37) ~[elasticsearch-7.0.0.jar:7.0.0]

at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) [?:1.8.0_202]

at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) [?:1.8.0_202]

at java.lang.Thread.run(Thread.java:748) [?:1.8.0_202]

DavidTurner · April 12, 2019, 6:12am

@safderali5 would you open another thread about your issue(s) - this thread is marked as resolved, and the problems you're facing are different from the ones we dug into here.

safderali5 · April 12, 2019, 6:17am

OK, I will it in a different thread.

DavidTurner · April 12, 2019, 1:12pm

We have added a note to the docs to clarify this point lest it catch anyone else out:

The node names used in this list must exactly match the node.name properties of the nodes. By default the node name is set to the machine’s hostname which may or may not be fully-qualified depending on your system configuration. If each node name is a fully-qualified domain name such as master-a.example.com then you must use fully-qualified domain names in the cluster.initial_master_nodes list too; conversely if your node names are bare hostnames (without the .example.com suffix) then you must use bare hostnames in the cluster.initial_master_nodes list. If you use a mix of fully-qualifed and bare hostnames, or there is some other mismatch between node.name and cluster.initial_master_nodes , then the cluster will not form successfully and you will see log messages like the following.

[master-a.example.com] master not discovered yet, this node has not previously joined a bootstrapped (v7+) cluster, and this node must discover master-eligible nodes [master-a, master-b] to bootstrap a cluster: have discovered [{master-b.example.com}{...

This message shows the node names master-a.example.com and master-b.example.com as well as the cluster.initial_master_nodes entries master-a and master-b , and it is apparent that they do not match exactly.

jswid · April 12, 2019, 7:48pm

I am also running on Kubernetes and had an issue bootstraping the cluster in 7. My solution was to set an environment variable:
- name: cluster.initial_master_nodes valueFrom: fieldRef: fieldPath: metadata.name

I'm not sure this is the best solution, ideally i'd just configure the cluster to require 2/3 masters at any time like in the old ES. Using this, when the cluster first starts, the first node will already consider itself ready, and there is potential the clusters are partitioned if they don't find the other masters immediately. Hopefully it will only matter once, but I'm not really sure things will work out when updating the cluster yet.

DavidTurner · April 13, 2019, 7:45am

@jswid your formatting was mangled, but assuming you mean the following:

This is not recommended. From the docs:

You must set cluster.initial_master_nodes to the same list of nodes on each node on which it is set in order to be sure that only a single cluster forms during bootstrapping and therefore to avoid the risk of data loss.

With your suggestion you are configuring cluster.initial_master_nodes differently on each node, and there is a good chance that you will form more than one cluster.

jswid · April 15, 2019, 4:21pm

Yes, thanks.. i even tried to delete my comment, but I guess it didn't take. I ended up doing it a different way. I changed the Deployment to a StatefulSet, which I think is better for two reasons: one is that the node names are constant, which solves the big issue in this thread, but the other is that the masters seem to care more about a cluster uuid now, so I am now mounting a persistent volume so the nodes' data folders are no longer lost when the masters are updated.

I have a public template on github that more or less shows how I am planning on moving to ES7 on Kubernetes here: https://github.com/jswidler/elasticsearch-kubed/blob/master/templates/2_elasticsearch/es-master.yml

Ken_Liao1 · May 3, 2019, 4:50pm

'm trying to setup auto scaling elasticsearch cluster environment in docker swarm with this page: http://derpturkey.com/elasticsearch-cluster-with-docker-engine-swarm-mode/

version: '3'  
services:  
  elasticsearch:
    image: 'elasticsearch:5'
    command: [ elasticsearch, -E, network.host=0.0.0.0, -E, discovery.zen.ping.unicast.hosts=elasticsearch, -E, discovery.zen.minimum_master_nodes=1 ]    
    volumes:
      - /elasticsearch/data:/usr/share/elasticsearch/data
    deploy:
      mode: 'global'
      placement:
        constraints: [node.labels.app_role == elasticsearch]

it will automatically deploy elasticsearch node to all hosts with label: app_role == elasticsearch;
it works fine with elasticsearch 6.7.0, but it got the same error if I upgrade it to 7.0.0;
The container name is dynamically changed, then what should I put for cluster.initial_master_nodes to fix it?

Thanks!

DavidTurner · May 3, 2019, 5:09pm

Hi @Ken_Liao1, it's probably best to start a new thread with your question rather than adding it to the bottom of this (rather long) one.

Topic		Replies	Views
Master not discovered yet, this node has not previously joined a bootstrapped (v7+) cluster, and [cluster.initial_master_nodes] is empty on this node Elasticsearch	15	15026	May 24, 2019
Getting "master not discovered or elected yet" causing cluster not up in version 7.1.0 Elasticsearch	28	15235	July 1, 2019
Node Discovery Elasticsearch 7.0.0 Elasticsearch	13	2933	June 6, 2019
Issue upgrade from 6.7 to 7 for single node Elasticsearch	9	2828	May 12, 2019
Wrong config? Elasticsearch	8	1250	June 7, 2019

Master not discovered yet, this node has not previously joined a bootstrapped (v7+) cluster

Related topics