Randomly out of memory exception when bulk importing data

docwarems · March 17, 2024, 9:55am

Hi, we use the Javascript bulk helper API to bulk import data to ES.
Randomly the import process crashes with out of memory exception (OOME). Of course we check for errors from the bulk import but there aren't any.
I'm familar with backpressure issues with nodejs stream processing but I can't see any backpressure controling mechanicm here for the ES bulk import, although the input comes from a readable stream.

I have tried to analyze heap dumps taken shortly before the OOME, but so far I couldn't get a clue of it. I'm not too familar with nodejs heapdump analyze. however.

Interesting fact too is that I never get these OOMEs from my dev console, but only from deployments. They both address the same ES cluster. The dev environment has similar memory limits as the deployment.

Any ideas, what I can check?

docwarems · March 17, 2024, 12:00pm

I have to add that I'm comparing behaviour using the same test data set. If it's a feature branch deployment, there are no other actions running in parallel. I have no insight about the cluster status because this is out of my Dev scope

Christian_Dahlqvist · March 24, 2024, 8:18am

What is the size of your bulk requests? What is the specification and configuration of your cluster? Which version of Elasticsearch are you using? Is there possibly a difference in the number of concurrent bulk requests between the two scenarios?

docwarems · March 27, 2024, 9:47am

What is the size of your bulk requests?

All what I can answer is: We bulk import files with pretty much the same code as the example code from here.

The records of the import files contain say 5 to 20 fields containing strings between say 5 and 50 characters.

Which version of Elasticsearch are you using?

Kibana "GET /" returns 8.11.3

What is the specification and configuration of your cluster?

As I wrote I have no insight beyond my dev scope. Is this a question specific enough to beeing forward to our ops?

Is there possibly a difference in the number of concurrent bulk requests between the two scenarios?

No there isn't

system · April 24, 2024, 9:47am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Using the Bulk Indexing API, if my node crashes, my elasticsearch heap memory does not get freed Elasticsearch	6	800	July 6, 2017
Recover from Out of Memory Error Elasticsearch	6	1310	July 5, 2017
OutOfMemory Exceptions during bulk insert Elasticsearch	9	2935	July 6, 2017
Bulkindex in elasticsearch(facing exception heap size out of memory) Elasticsearch	1	450	October 18, 2018
ES 1.7.1 bulk inserts problems Elasticsearch	2	1196	July 5, 2017

Randomly out of memory exception when bulk importing data

Related topics