Rolling upgrade from 2.1.0 to 2.1.1 failed. please help to identify the probelm

makeyang · December 25, 2015, 3:03am

stop one of them, upgrade it and restart it. below is discovery log and it seems the server can connected to all of other nodes.
[2015-12-25 10:57:00,313][TRACE][discovery.zen.ping.unicast] [192.168.200.190] [119] received response from {#zen_unicast_3#}{192.168.200.196}{192.168.200.196:9390}: [ping_response{node [{192.168.200.190}{QDVnI6x7RQ2u3T3zBsKQgw}{192.168.200.190}{192.168.200.190:9390}{max_local_storage_nodes=20, master=false}], id[351], master [null], hasJoinedOnce [false], cluster_name[21test]}, ping_response{node [{192.168.200.190}{QDVnI6x7RQ2u3T3zBsKQgw}{192.168.200.190}{192.168.200.190:9390}{max_local_storage_nodes=20, master=false}], id[352], master [null], hasJoinedOnce [false], cluster_name[21test]}, ping_response{node [{192.168.200.190}{QDVnI6x7RQ2u3T3zBsKQgw}{192.168.200.190}{192.168.200.190:9390}{max_local_storage_nodes=20, master=false}], id[353], master [null], hasJoinedOnce [false], cluster_name[21test]}, ping_response{node [{192.168.200.190}{QDVnI6x7RQ2u3T3zBsKQgw}{192.168.200.190}{192.168.200.190:9390}{max_local_storage_nodes=20, master=false}], id[354], master [null], hasJoinedOnce [false], cluster_name[21test]}, ping_response{node [{192.168.200.190}{QDVnI6x7RQ2u3T3zBsKQgw}{192.168.200.190}{192.168.200.190:9390}{max_local_storage_nodes=20, master=false}], id[355], master [null], hasJoinedOnce [false], cluster_name[21test]}, ping_response{node [{192.168.200.190}{QDVnI6x7RQ2u3T3zBsKQgw}{192.168.200.190}{192.168.200.190:9390}{max_local_storage_nodes=20, master=false}], id[356], master [null], hasJoinedOnce [false], cluster_name[21test]}, ping_response{node [{192.168.200.196}{FzxGsQclQ22izuwoi-wTpQ}{192.168.200.196}{192.168.200.196:9390}], id[1081], master [{192.168.200.196}{FzxGsQclQ22izuwoi-wTpQ}{192.168.200.196}{192.168.200.196:9390}], hasJoinedOnce [true], cluster_name[21test]}]

but it keep logging: [DEBUG][action.admin.indices.create] [192.168.200.190] no known master node, scheduling a retry

makeyang · December 25, 2015, 3:05am

cluster toplogy: 4 nodes, three of them are master&data node, the other is data node.
the config is below:
cluster.name: 21test
node.name: 192.168.200.190
node.master: false
node.data: true
path.data: /export/Data/elasticsearch/es219290
path.logs: /export/Logs/elasticsearch/es219290
path.plugins: /export/plugins/elasticsearch/es219290
bootstrap.mlockall: true
network.host: 192.168.200.190
http.port: 9290
transport.tcp.port: 9390
discovery.zen.ping.unicast.hosts: ["192.168.200.191:9390","192.168.200.192:9390","192.168.200.196:9390"]
discovery.zen.minimum_master_nodes: 3
indices.memory.index_buffer_size: 20%
script.engine.groovy.inline.aggs: on
index.translog.sync_interval: 30s
index.translog.durability: async
index.refresh_interval: 180s
index.max_merged_segment: 1g
node.max_local_storage_nodes: 20

Christian_Dahlqvist · December 25, 2015, 10:07pm

If you have 3 master eligible nodes in the cluster, minimum_master_nodes should be set to 2, not 3. 3 would however be the correct setting if all 4 nodes were master eligible.

Topic		Replies	Views
Upgrading ES from 2.1.1 to 2.2.0 - no known master node Elasticsearch	2	1708	July 5, 2017
Zen Discovery cannot resolve the master Elasticsearch	3	370	July 6, 2017
Upgrade cluster from 0.17.8 to 0.18.1 nuwer node unable to join Elasticsearch	3	304	July 6, 2017
Rolling upgrade problem from 6.8 to 7.1.1 Elasticsearch	18	2950	July 18, 2019
Discovery.zen.minimum_master_nodes and gateway.recover_after_nodes does not work after upgrading to ES 1.0.1? Elasticsearch	2	397	July 6, 2017

Rolling upgrade from 2.1.0 to 2.1.1 failed. please help to identify the probelm

Related topics