Hi guys,
With the following elasticsearch.yaml:
cluster.name: test
node.name: ${HOSTNAME}
network.host: _eth0:ipv4_
discovery.zen.ping.unicast.hosts: ["public_ip_1", "public_ip_2"]
discovery.zen.minimum_master_nodes: 1
discovery.zen.ping_timeout: 10s
One host succeeds to be elected as master, whereas the second one fails to join the master.
Logs from the expected slave:
connected to node [{#zen_unicast_2#}{ekYs0cmNQbGPTFwL2ZZuFQ}{172.31.10.156}{172.31.10.156:9300}]
disconnecting from [{#zen_unicast_2#}{ekYs0cmNQbGPTFwL2ZZuFQ}{172.31.10.156}{172.31.10.156:9300}] due to explicit disconnect call
filtered ping responses: (ignore_non_masters [false])
--> ping_response{node [{dcc65eaca77f}{TzGpUkVZTbSpC6JOYczpfw}{bICdACsIRQmHyK8ULAHJGg}{172.17.0.2}{172.17.0.2:9300}], id[564], master [{dcc65eaca77f}{TzGpUkVZTbSpC6JOYczpfw}{bICdACsIRQmHyK8ULAHJGg}{172.17.0.2}{172.17.0.2:9300}], cluster_state_version [2], cluster_name[mrgiggy-cluster-sly]}
--> ping_response{node [{3bd521b16904}{zRlNhZEHRx-czZnRu8rMAw}{FgQgX8X0SW2b1QevmCVf6A}{172.17.0.2}{172.17.0.2:9300}], id[1849], master [null], cluster_state_version [-1], cluster_name[mrgiggy-cluster-sly]}
processing [zen-disco-election-stop [{dcc65eaca77f}{TzGpUkVZTbSpC6JOYczpfw}{bICdACsIRQmHyK8ULAHJGg}{172.17.0.2}{172.17.0.2:9300} elected]]: execute
processing [zen-disco-election-stop [{dcc65eaca77f}{TzGpUkVZTbSpC6JOYczpfw}{bICdACsIRQmHyK8ULAHJGg}{172.17.0.2}{172.17.0.2:9300} elected]]: took [0s] no change in cluster_state
disconnecting from [{#zen_unicast_1#}{E_PHzBqzRUuv-8-dK79lHA}{172.31.10.157}{172.31.10.157:9300}] due to explicit disconnect call
processing [zen-disco-node-join[{3bd521b16904}{zRlNhZEHRx-czZnRu8rMAw}{FgQgX8X0SW2b1QevmCVf6A}{172.17.0.2}{172.17.0.2:9300}]]: execute
cluster state update task [zen-disco-node-join[{3bd521b16904}{zRlNhZEHRx-czZnRu8rMAw}{FgQgX8X0SW2b1QevmCVf6A}{172.17.0.2}{172.17.0.2:9300}]] failed
org.elasticsearch.cluster.NotMasterException: Node [{3bd521b16904}{zRlNhZEHRx-czZnRu8rMAw}{FgQgX8X0SW2b1QevmCVf6A}{172.17.0.2}{172.17.0.2:9300}] not master for join request
processing [zen-disco-node-join[{3bd521b16904}{zRlNhZEHRx-czZnRu8rMAw}{FgQgX8X0SW2b1QevmCVf6A}{172.17.0.2}{172.17.0.2:9300}]]: took [0s] no change in cluster_state
processing [zen-disco-node-join[{3bd521b16904}{zRlNhZEHRx-czZnRu8rMAw}{FgQgX8X0SW2b1QevmCVf6A}{172.17.0.2}{172.17.0.2:9300}]]: execute
cluster state update task [zen-disco-node-join[{3bd521b16904}{zRlNhZEHRx-czZnRu8rMAw}{FgQgX8X0SW2b1QevmCVf6A}{172.17.0.2}{172.17.0.2:9300}]] failed
org.elasticsearch.cluster.NotMasterException: Node [{3bd521b16904}{zRlNhZEHRx-czZnRu8rMAw}{FgQgX8X0SW2b1QevmCVf6A}{172.17.0.2}{172.17.0.2:9300}] not master for join request
processing [zen-disco-node-join[{3bd521b16904}{zRlNhZEHRx-czZnRu8rMAw}{FgQgX8X0SW2b1QevmCVf6A}{172.17.0.2}{172.17.0.2:9300}]]: took [0s] no change in cluster_state
processing [zen-disco-node-join[{3bd521b16904}{zRlNhZEHRx-czZnRu8rMAw}{FgQgX8X0SW2b1QevmCVf6A}{172.17.0.2}{172.17.0.2:9300}]]: execute
cluster state update task [zen-disco-node-join[{3bd521b16904}{zRlNhZEHRx-czZnRu8rMAw}{FgQgX8X0SW2b1QevmCVf6A}{172.17.0.2}{172.17.0.2:9300}]] failed
org.elasticsearch.cluster.NotMasterException: Node [{3bd521b16904}{zRlNhZEHRx-czZnRu8rMAw}{FgQgX8X0SW2b1QevmCVf6A}{172.17.0.2}{172.17.0.2:9300}] not master for join request
processing [zen-disco-node-join[{3bd521b16904}{zRlNhZEHRx-czZnRu8rMAw}{FgQgX8X0SW2b1QevmCVf6A}{172.17.0.2}{172.17.0.2:9300}]]: took [0s] no change in cluster_state
failed to send join request to master [{dcc65eaca77f}{TzGpUkVZTbSpC6JOYczpfw}{bICdACsIRQmHyK8ULAHJGg}{172.17.0.2}{172.17.0.2:9300}], reason [RemoteTransportException[[3bd521b16904][172.17.0.2:9300][internal:discovery/zen/join]]; nested: NotMasterException[Node [{3bd521b16904}{zRlNhZEHRx-czZnRu8rMAw}{FgQgX8X0SW2b1QevmCVf6A}{172.17.0.2}{172.17.0.2:9300}] not master for join request]; ], tried [3] times
processing [finalize_join ({dcc65eaca77f}{TzGpUkVZTbSpC6JOYczpfw}{bICdACsIRQmHyK8ULAHJGg}{172.17.0.2}{172.17.0.2:9300})]: execute
processing [finalize_join ({dcc65eaca77f}{TzGpUkVZTbSpC6JOYczpfw}{bICdACsIRQmHyK8ULAHJGg}{172.17.0.2}{172.17.0.2:9300})]: took [0s] no change in cluster_state
Any idea?