Patched a rpm based ES 2.3.2 cluster to 2.3.3 last friday, since we're seen multiple events like this:
[2016-05-20 09:23:13,302][INFO ][node ] [d1r1n1] started
[2016-05-20 09:23:26,212][DEBUG][action.admin.indices.create] [d1r1n1] no known master node, scheduling a retry
[2016-05-20 09:23:56,783][DEBUG][action.admin.indices.create] [d1r1n1] no known master node, scheduling a retry
[2016-05-20 09:24:11,558][DEBUG][action.admin.indices.create] [d1r1n1] no known master node, scheduling a retry
[2016-05-20 09:24:26,213][DEBUG][action.admin.indices.create] [d1r1n1] timed out while retrying [indices:admin/create] after failure (timeout [1m])
[2016-05-20 09:24:26,216][WARN ][rest.suppressed ] /_bulk Params: {}
ClusterBlockException[blocked by: [SERVICE_UNAVAILABLE/1/state not recovered / initialized];[SERVICE_UNAVAILABLE/2/no master];]
at org.elasticsearch.cluster.block.ClusterBlocks.globalBlockedException(ClusterBlocks.java:154)
at org.elasticsearch.cluster.block.ClusterBlocks.globalBlockedRaiseException(ClusterBlocks.java:144)
at org.elasticsearch.action.bulk.TransportBulkAction.executeBulk(TransportBulkAction.java:212)
at org.elasticsearch.action.bulk.TransportBulkAction.access$000(TransportBulkAction.java:71)
at org.elasticsearch.action.bulk.TransportBulkAction$1.onFailure(TransportBulkAction.java:150)
at org.elasticsearch.action.support.TransportAction$1.onFailure(TransportAction.java:95)
at org.elasticsearch.action.support.ThreadedActionListener$2.doRun(ThreadedActionListener.java:104)
at org.elasticsearch.common.util.concurrent.AbstractRunnable.run(AbstractRunnable.java:37)
at java.util.concurrent.ThreadPoolExecutor.runWorker(Unknown Source)
at java.util.concurrent.ThreadPoolExecutor$Worker.run(Unknown Source)
at java.lang.Thread.run(Unknown Source)
tried stopping all nodes and restarting them again,
but cluster only semi seems to work, (data in and out), hints appreciated on howto investigate this further, TIA
After stopping all and restarting again we have:
> [root@d1r1n1 ~]# curl -XGET "http://`hostname`:9200/_cat/nodes"
> <redacted>.170 <redacted>.170 14 99 0.18 d * d1r1n6
> <redacted>.176 <redacted>.176 4 99 0.14 d m d1r1n12
> <redacted>.168 <redacted>.168 10 98 0.67 d m d1r1n4
> <redacted>.165 <redacted>.165 8 99 0.19 d m d1r1n1
> <redacted>.178 <redacted>.178 13 99 0.38 d m d1r1n14
> <redacted>.169 <redacted>.169 18 93 0.41 d m d1r1n5
> <redacted>.175 <redacted>.175 8 99 0.04 d m d1r1n11
> <redacted>.183 <redacted>.183 4 47 0.12 c - kibana/perf
> <redacted>.177 <redacted>.177 7 99 0.45 d m d1r1n13
> <redacted>.171 <redacted>.171 13 99 0.19 d m d1r1n7
> <redacted>.172 <redacted>.172 10 99 0.33 d m d1r1n8
> <redacted>.166 <redacted>.166 8 99 0.24 d m d1r1n2
> <redacted>.174 <redacted>.174 10 99 0.29 d m d1r1n10
> <redacted>.167 <redacted>.167 8 97 0.47 d m d1r1n3