Using the Bulk Indexing API, if my node crashes, my elasticsearch heap memory does not get freed

sahilc2200 · July 18, 2015, 3:54am

So I am using the Bulk indexing API for ES 1.6. If a OOM occurs, then on restart, the node's heap does not get cleared for some reason. Then, OOM's occur much faster. I am not using any other queries except to index my documents, and I am running a two node cluster with 9GB allocated to each machine, and I have to index about 21M documents.

warkolm · July 18, 2015, 4:00am

If it's still OOMing then you likely have too much data in there, but more information will help. However how much data is in the cluster?

sahilc2200 · July 18, 2015, 5:28am

The cluster actually has like 50 GB worth of data, and I am indexing about 150 documents at a time. Each document is about 4KB on average.

warkolm · July 18, 2015, 6:31am

Then that's very odd.

If you start ES and then check _cat/fielddata, what does it report? Also, what's in your logs?

sahilc2200 · July 18, 2015, 6:59am

It tells me that there are 0b allocated for the fielddata. Is that supposed to happen? I am using ElasticHQ for monitoring the cluster, and it shows me that about 80% of the heap is occupied on node restart as well.

warkolm · July 18, 2015, 7:00am

Are you using parent/child or nesting?

Topic		Replies	Views
ElasticSearch can't automatically recover after a big HEAP utilization Elasticsearch	3	383	July 6, 2017
Garbage collection not kicking in - Heap is growing to 98% Elasticsearch	3	930	June 29, 2017
Continuing after java heap space runs out Elasticsearch	7	1079	July 18, 2017
Elasticsearch sizing Elasticsearch	3	545	January 27, 2018
What's eating node's memory? Elasticsearch	10	2956	July 5, 2017

Using the Bulk Indexing API, if my node crashes, my elasticsearch heap memory does not get freed

Related topics