Elasticsearch Cluster turns Red - Is JVM Heap main culprit?

Nikesh · February 18, 2019, 3:52pm

Hi,

I am indexing at a decent rate.
20 indices each with 10000 fields and 50000 documents continuously indexing through 9 threads.
I have a cluster with a dedicated master node and two data nodes. Each node has 16GB RAM and 8GB Heap size.
No issues were found when only indexing the above mentioned scenario although it was at peak(let's say 6.5 to 7.5 GB). JVM Heap crossed the upper limit when more indexing and search requests were performed. Cluster went to Red Status and OOM error was thrown in logs.
My doubts are :

What contributes to JVM Heap? As I have both text and keyword for a single field.
fielddata stays in in-memory but is not enabled by default and I have not changed this behaviour.
Stored_fields also contributes to JVM Heap.
I have attached Kibana Screenshot at the time of indexing (partly).
What measures can be taken to bring down Heap-Size or rather prevent Heap-Size to reach its maximum?

3.Even though I restarted my cluster, JVM Heap didn't drop down after giving it some time. What can be the causes of this?

DavidTurner · February 18, 2019, 6:58pm

Is that a typo? If not, that seems excessive and is likely to cause issues.

How many shards do you have?

Nikesh · February 19, 2019, 4:53am

Thanks David for the response.
Unfortunately, it's not an typo. It is common case for my users to have 5000-7000 fields per index.
I have configured 2 primary shards and 1 replica per index.

DavidTurner · February 19, 2019, 9:06am

There's a good reason that the default limit in Elasticsearch is 1000 fields per index. I recommend working towards respecting that limit.

I meant in total. How many shards do you have in total? As in, what does GET _cluster/health report?

Nikesh · February 19, 2019, 10:05am

My Cluster consists of 1300 shards!

DavidTurner · February 19, 2019, 12:17pm

This also sounds like too many for your cluster size. See this article for more detail:

In particular:

A good rule-of-thumb is to ensure you keep the number of shards per node below 20 per GB heap it has configured.

Thus with 8GB of heap you should aim to limit yourself to 160 shards per node.

Nikesh · February 28, 2019, 12:09pm

Thanks David for your continuous help!

Is there a similar thumb rule as to how much GB of data a particular shard can hold?

DavidTurner · February 28, 2019, 12:23pm

From the very same article:

Aim to keep the average shard size between at least a few GB and a few tens of GB. For use-cases with time-based data, it is common to see shards between 20GB and 40GB in size.

system · March 28, 2019, 12:23pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Cluster stuck on high JVM heap usage Elasticsearch	4	975	July 5, 2017
35 shards but maxing out JVM heap Elasticsearch	12	4268	April 5, 2018
Jvm Heap Size & Indexing Perfmance Problem Elasticsearch	1	489	March 11, 2020
Elasticsearch Heapsize query Elasticsearch	3	746	February 7, 2020
Elastic Search heap alert Elasticsearch	13	1681	December 17, 2018

Elasticsearch Cluster turns Red - Is JVM Heap main culprit?

Related topics