Cluster goes yellow abruptly

Faiz_Ahmed_Mushtak_H · May 8, 2020, 7:34am

We're using elasticsearch 7.2 in production and lately we've been observing our cluster going yellow quite often even though none of the nodes left the cluster!

Whenever the cluster have gone yellow, we've seen a sudden drop in the indices.store.size_in_bytes on the problematic node. So far it has always been a single node that behaved bad. At the same time, there were a couple of 429 rejection requests (parent circuit breaker trips). Not sure if a destabilized cluster is a cause of circuit tripping or circuit tripping is the cause of node being inaccessible (note that it doesn't look like all the data is being deleted, it just drops by 300gb or so)

Regarding the cluster setup
We have a 8 core, 64GB machine, JVM heap size is 30GB. We have taken care of https://github.com/elastic/elasticsearch/pull/46169 as well. We make use of AWS NVME SSD's (which means we lose data if the instance is stopped, but in this case the instance was up, the node never left)

I really doubt if our ingestion rate is the problem (around 8k updates per minute). Last week we ran our indexing job which was ingesting around 1M per minute but that didn't destabilize the cluster

Our usecase involves a lot of regular updates & periodic batched deletes

warkolm · May 10, 2020, 11:36pm

What does the logs from the master show around that time?

Faiz_Ahmed_Mushtak_H · May 11, 2020, 5:20am

@warkolm i think its a duplicate of Shards getting marked as stale frequently causing cluster to go yellow

I have been able to correlate it with the time when we are indexing / updating huge documents around 5-10mb. In GC logs I've been seeing humongous allocations

https://github.com/elastic/elasticsearch/pull/46169 is taken care of but the IHOP is still adaptive. So isn't it possible that InitiatingHeapOccupancyPercent may grow back to 70% and we face the same issue again?

system · June 8, 2020, 5:34am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Cluster keeps going into yellow even after scaling out nodes (client & data) Elasticsearch	1	402	February 9, 2018
Cluster health turns into yellow from Green Elasticsearch	2	869	July 5, 2017
How to avoid cluster yellow without any indexing Elasticsearch	3	337	July 6, 2017
My cluster always yellow Elasticsearch	14	3469	November 4, 2022
Index doc, pdf, odt .... => cluster : yellow, why? Elasticsearch	10	465	July 6, 2017

Cluster goes yellow abruptly

Related topics