Would a node failure take down the cluster?

Jtomage · October 8, 2020, 11:53pm

Trying to troubleshoot why a 3 Node Master-data cluster would go down. Mainly looking for direction on where to look for issues and any possible solutions

Currently running version 7.0.1
Indices - 19
Max Memory - 38.8 GB
Total Shards - 38
Documents - 8,136,787
Data Size - 507.2 GB

Going through the logs, I found a lot of the following exceptions that seem to chain together.

CircuitBreakingException: Data too large, data for [<transport_request>]
Cluster health status changed from [GREEN] to [YELLOW] failed to list shard for shard_store on node
AlreadyClosedException engine is closed

Thank you

DavidTurner · October 9, 2020, 9:12am

The 7.0.x series is pretty old and passes the end of its supported life tomorrow. There have been a good number of resiliency improvements since its release. Before digging deeper I suggest you upgrade to the latest version.

Jtomage · October 9, 2020, 3:57pm

Thanks, will recommend upgrading. Going to see if I can get a server to test upgrading.

system · November 6, 2020, 3:57pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Facing data too large exception frequently Elasticsearch	5	603	December 24, 2020
Circuit_breaking_exception [parent] Elasticsearch	1	224	May 6, 2022
Should CircuitBreakingException cause the node to become failed? Elasticsearch	14	1818	April 9, 2020
Elasticsearch 7.17.3: [parent] Data too large, data for [cluster:monitor/nodes/stats[n]] Elasticsearch	13	2208	December 3, 2022
Circuit Breaking Exception Elasticsearch	4	863	October 12, 2018

Would a node failure take down the cluster?

Related topics