Elasticsearch cluster crashed when 1 node got IO issues

Andriy_Pyshchyk · November 18, 2019, 10:52am

Hi Everyone,

currently I'm using ES 6.6.1, in 48 nodes cluster, replication factor 1, and faced an issue when 1 of nodes had IO issues - whole cluster got to Red state, all indices got red, couldn't execute _cat/nodes. In logs got lots of errors: [2019-11-16T19:50:59,398][WARN ][r.suppressed ] [es1-master-01-...] path: /.kibana/doc/kql-telemetry%3Akql-telemetry, params: {index=.kibana, id=kql-telemetry:kql-telemetry, type=doc}

And errors were not connected to node which actually failed

Only removing sick node helped cluster to recover. Is it any way to tell Elasticsearch to remove more from cluster if it doesn't respond for a while?

system · December 16, 2019, 10:52am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
First steps troubleshooting ES cluster crashes? Elasticsearch	9	3536	March 3, 2018
Recovery from red ES node and red indices Elasticsearch	4	572	July 6, 2017
Help for removing a crashed node? Elasticsearch	5	1054	July 5, 2017
Nodes out of cluster- ES crash- crash logs? Elasticsearch	1	445	August 17, 2017
How to fix elasticsearch red status Elasticsearch	3	779	December 22, 2016

Elasticsearch cluster crashed when 1 node got IO issues

Related topics