Client node crash with OOM exception

rkalhans · April 28, 2016, 5:20am

Hello,

We have setup a cluster with 3 masters 5 client and 8 data nodes With ES 1.7.1. We are seeing crashes in client nodes with OOM exception. According to the thread dump at the time of crash there are nearly 35 threads active on the node.

Here is the threadump

This crash happened in 2 of the 5 client nodes. total number of client threads bulk indexing in parallel = 24(on the entire cluster, they only contact the client nodes.).

danielmitterdorfer · May 2, 2016, 7:23am

Hi,

35 threads are not particularly much for a server process. From the thread dump I just see that a few HTTP handler threads are allocating buffer memory (presumably for incoming bulk requests). I suggest that you take a closer look at the heap dump instead (for example with Memory Analyzer which should reveal why the client nodes need so much memory. I guess it is because you're sending / buffering too much bulk index traffic for the amount of memory you've allocated. This means you either need to increase the system capacity or reduce the load on the client nodes. So you can try to:

Reduce bulk request size
Reduce the number of clients sending bulk index requests
Reduce incoming queue lengths
Increase heap memory on the client nodes
Add more data nodes (based on the premise that they cannot cope with the amount of traffic sent and they put backpressure on the client nodes)

But these are just pointers based on my assumption above. You need to check the heap dump and run further tests. I hope this gets you started.

Daniel

Topic		Replies	Views
Out of memory on Client Node Elasticsearch	1	457	April 21, 2020
Using the Bulk Indexing API, if my node crashes, my elasticsearch heap memory does not get freed Elasticsearch	6	800	July 6, 2017
OutOfMemory Exception on client Node Elasticsearch	10	649	July 6, 2017
Client Nodes being oom-killed Elasticsearch	17	2324	June 28, 2018
Question on timeout on the client side and OOM error on the cluster Elasticsearch	2	321	July 6, 2017

Client node crash with OOM exception

Related topics