Help analyzing cause of OOM

Dominik_Stadler · December 15, 2016, 6:40am

Hi,

on one of our test-systems we ran into an OutOfMemory situation that required a full restart of both instances to get up again.

We have heapdumps enabled and are trying to analyze what caused this instead of triggering a breaker.

Elasticsearch 2.3.5, 2 instances, 4GB OS RAM, 2GB allocated for ES

When looking at the heapdump I see that a lot of data is kept in one of the tasks, there are already >900 tasks, likely because work was queuing up.

The histogram shows many Buckets being allocated

Following from there to the root-object shows the following link-chain

So it seems some complex query with aggregations killed memory.

Is there some breaker that should catch this?

Or is there some setting to limit the query to not overwhelm the instances when an aggregation allocates a large number of buckets?

system · January 12, 2017, 6:41am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Aggregate query: Elasticsearch:java.lang.OutOfMemoryError: Java heap space Elasticsearch	8	1438	July 25, 2019
ElasticSearch OutOfMemory Exceptions Elasticsearch	8	364	July 6, 2017
Using the Bulk Indexing API, if my node crashes, my elasticsearch heap memory does not get freed Elasticsearch	6	800	July 6, 2017
Understanding off-heap usage Elasticsearch	6	4960	September 15, 2017
Elasticsearch 5.2.2 : Memory keeps on increasing steadily untill ES gets killed by System OOM Killer Elasticsearch	4	1218	June 12, 2017