Thanks for the reply
telnet is working from all 3 servers
also checked and OK:
curl 10.15.20.10:9200
curl 10.15.20.11:9200
curl 10.15.20.12:9200
[elasticsearch@master ~]$ curl -XGET 'http://10.15.20.10:9200/_nodes/transport?pretty'
{
"_nodes" : {
"total" : 1,
"successful" : 1,
"failed" : 0
},
"cluster_name" : "elkcl",
"nodes" : {
"H4pskB4fQOOmJT4m74goIg" : {
"name" : "master",
"transport_address" : "10.15.20.10:9300",
"host" : "10.15.20.10",
"ip" : "10.15.20.10",
"version" : "5.2.0",
"build_hash" : "24e05b9",
"roles" : [
"master",
"ingest"
],
"transport" : {
"bound_address" : [
"10.15.20.10:9300"
],
"publish_address" : "10.15.20.10:9300",
"profiles" : { }
}
}
}
}
[elasticsearch@ptktl-elkdev2 ~]$ curl -XGET 'http://10.15.20.11:9200/_nodes/transport?pretty'
{
"_nodes" : {
"total" : 1,
"successful" : 1,
"failed" : 0
},
"cluster_name" : "elkcl",
"nodes" : {
"H4pskB4fQOOmJT4m74goIg" : {
"name" : "ptktl-elkdev2",
"transport_address" : "10.15.20.11:9300",
"host" : "10.15.20.11",
"ip" : "10.15.20.11",
"version" : "5.2.0",
"build_hash" : "24e05b9",
"roles" : [
"data",
"ingest"
],
"transport" : {
"bound_address" : [
"10.15.20.11:9300"
],
"publish_address" : "10.15.20.11:9300",
"profiles" : { }
}
}
}
}
[elasticsearch@ptktl-elkdev2 ~]$ curl -XGET 'http://10.15.20.12:9200/_nodes/transport?pretty'
{
"_nodes" : {
"total" : 1,
"successful" : 1,
"failed" : 0
},
"cluster_name" : "elkcl",
"nodes" : {
"H4pskB4fQOOmJT4m74goIg" : {
"name" : "ptktl-elkdev3",
"transport_address" : "10.15.20.12:9300",
"host" : "10.15.20.12",
"ip" : "10.15.20.12",
"version" : "5.2.0",
"build_hash" : "24e05b9",
"roles" : [
"data",
"ingest"
],
"transport" : {
"bound_address" : [
"10.15.20.12:9300"
],
"publish_address" : "10.15.20.12:9300",
"profiles" : { }
}
}
}
}
I want to have one master and 2 data machine
configure the yml as follow:
master node:
cluster.name: elastic
node.name: master
node.master: true
node.data: false
bootstrap.system_call_filter: false
network.host: 10.15.20.10
transport.host: 10.15.20.10
transport.tcp.port: 9300
http.port: 9200
network.publish_host: 10.15.20.10
discovery.zen.ping.unicast.hosts: ["10.15.20.10:9300", "10.15.20.11:9300", "10.15.20.12:9300"]
discovery.zen.minimum_master_nodes: 1
discovery.zen.join_timeout: 90s
discovery.zen.ping_timeout: 90s
data 1+2:
cluster.name: elasic
node.name: data1
node.master: false
node.data: true
bootstrap.system_call_filter: false
transport.host: 10.15.20.11
transport.tcp.port: 9300
http.port: 9200
network.host: 10.15.20.11
network.publish_host: 10.15.20.11
discovery.zen.ping.unicast.hosts: ["10.15.20.10:9300", "10.15.20.11:9300", "10.15.20.12:9300"]
discovery.zen.minimum_master_nodes: 1
discovery.zen.join_timeout: 90s
discovery.zen.ping_timeout: 90s
[elasticsearch@master ~]$ curl http://10.15.20.10:9200/_cluster/health?pretty=true
{
"error" : {
"root_cause" : [
{
"type" : "master_not_discovered_exception",
"reason" : null
}
],
"type" : "master_not_discovered_exception",
"reason" : null
},
"status" : 503
}
in the log:
[INFO ][o.e.t.TransportService ] [master] publish_address {10.15.20.10:9300}, bound_addresses {10.15.20.10:9300}
[INFO ][o.e.b.BootstrapChecks ] [master] bound or publishing to a non-loopback or non-link-local address, enforcing bootstrap checks
[WARN ][o.e.n.Node ] [master] timed out while waiting for initial discovery state - timeout: 30s
[INFO ][o.e.h.HttpServer ] [master] publish_address {10.15.20.10:9200}, bound_addresses {10.15.20.10:9200}
[INFO ][o.e.n.Node ] [master] started
[DEBUG][o.e.a.a.c.h.TransportClusterHealthAction] [master] no known master node, scheduling a retry
[DEBUG][o.e.a.a.c.h.TransportClusterHealthAction] [master] timed out while retrying [cluster:monitor/health] after failure (timeout [30s])
[WARN ][r.suppressed ] path: /_cluster/health, params: {pretty=true}
org.elasticsearch.discovery.MasterNotDiscoveredException: null
at org.elasticsearch.action.support.master.TransportMasterNodeAction$AsyncSingleAction$4.onTimeout(TransportMasterNodeAction.java:211) [elasticsearch-5.2.0.jar:5.2.0]
at org.elasticsearch.cluster.ClusterStateObserver$ContextPreservingListener.onTimeout(ClusterStateObserver.java:307) [elasticsearch-5.2.0.jar:5.2.0]
at org.elasticsearch.cluster.ClusterStateObserver$ObserverClusterStateListener.onTimeout(ClusterStateObserver.java:237) [elasticsearch-5.2.0.jar:5.2.0]
at org.elasticsearch.cluster.service.ClusterService$NotifyTimeout.run(ClusterService.java:1157) [elasticsearch-5.2.0.jar:5.2.0]
at org.elasticsearch.common.util.concurrent.ThreadContext$ContextPreservingRunnable.run(ThreadContext.java:527) [elasticsearch-5.2.0.jar:5.2.0]
at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) [?:1.8.0_101]
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) [?:1.8.0_101]
at java.lang.Thread.run(Thread.java:745) [?:1.8.0_101]
[INFO ][o.e.d.z.ZenDiscovery ] [master] failed to send join request to master [{master}{H4pskB4fQOOmJT4m74goIg}{zEXhVspzSuy69JgbEZbRoQ}{10.15.20.10}{10.15.20.10:9300}], reason [RemoteTransportException[[master][10.15.20.10:9300][internal:discovery/zen/join]]; nested: NotMasterException[Node [{master}{H4pskB4fQOOmJT4m74goIg}{Y0Bi0UqcT-e0mHFC5OYasg}{10.15.20.10}{10.15.20.10:9300}] not master for join request]; ], tried [3] times
Any suggestions?
Also if I change the discovery.zen.minimum_master_nodes: 2 its still not working
Please advise