Elasticsearch Master Node JVM reaching 99%

rsmith013 · July 20, 2020, 2:41pm

I am running a 3 Master 8 Data node cluster.
Elasticsearch Version 7.8.0

The cluster bumps along happily for several days with the regular sawtooth JVM pattern then suddenly ramps up and then hits 99%.

This leads to issues performing normal operations. A restart puts the memory back to normal levels but I can't figure out what causes it. I have seen before that a memory leak in a previous version caused similar issues but I would have thought that the GC would keep the JVM usage within comfortable limits.

DavidTurner · July 20, 2020, 3:50pm

Would you capture a heap dump the next time the heap maxes out like that? Let us know when you've got it and we'll arrange a way to send it in.

rsmith013 · July 30, 2020, 9:28am

@DavidTurner It has happened again on all 3 of my master nodes. I see there is now a 7.8.1, might that fix it?

DavidTurner · July 30, 2020, 9:46am

I doubt it since I don't think anything has changed recently in this area. I asked for heap dumps above -- without them, we can't really say what the problem is, and without knowing that we can't really say anything about a fix.

rsmith013 · July 30, 2020, 9:48am

I have captured one. What is the best way to send it to you.

DavidTurner · July 30, 2020, 10:08am

Great, thanks. I've sent you a direct message with an upload link.

DavidTurner · July 30, 2020, 6:15pm

Looks to me like you're using the third-party ReadonlyREST plugin and it's gone haywire, but I can't say much more than that as it's not something we support. I recommend using the official security features instead, they're included by default, they don't cost anything and AFAIK don't suffer from this kind of problem.

rsmith013 · July 30, 2020, 6:48pm

Thanks for looking into it. I am keeping an eye on the native security. At the moment, we are making use of IP filtering and LDAP integration, which we would lose if we went to stack native security. Although, if this lasts too long, a loss of convenience outweighs a stable cluster.

Thanks,

Richard

Topic		Replies	Views
High node JVM heap cause ES cluster almost stop working Elasticsearch	5	1816	January 27, 2019
Elasticsearch data node JVM Running out of memory Elasticsearch	1	521	April 10, 2020
Elasticsearch 6.8.10 memory leak Elasticsearch	6	1056	November 11, 2020
Persistent high JVM usage issue (all methods tried already) Elasticsearch	0	457	April 28, 2017
Heap memory overflow master nodes (continuous GC) Elasticsearch	3	302	July 11, 2022

Elasticsearch Master Node JVM reaching 99%

Related topics