Heap JVM memory leak in ElasticSearch 2.4.5

Shkarbatov_Dmitiry · December 19, 2018, 3:29pm

Hi all, I use ElasticSearch 2.4.5
I attache image heap-memory ElasticSearch.

In the end I need to restart my server and it started again.

I have 3 node in my claster, but one of them looks different from other:

Memory still leak and I don't know what to do.

One interesting things is scripts cache:

curl -XGET 'http://ip:9200/_nodes/stats'

  "script": {
    "compilations": 68717,
    "cache_evictions": 68617
  }

  "script": {
    "compilations": 3945,
    "cache_evictions": 3845
  }

  "script": {
    "compilations": 70015,
    "cache_evictions": 69915
  }

Whats wrong?

Igor_Motov · December 19, 2018, 7:47pm

What kind of script is this? I have seen this before with groovy scripts that were changed over and over again. If this is the case, could you try moving the parts of the script that are changing into parameters so the actual script text would always remain the same?

warkolm · December 19, 2018, 7:55pm

You should really upgrade, 2.4 has been end of life for nearly a year now - Elastic Product End of Life Dates | Elastic

Shkarbatov_Dmitiry · December 20, 2018, 6:25am

We working on it. But untill we will upgrade I want to fix this problem

Christian_Dahlqvist · December 20, 2018, 6:35am

I think Igor was referring to this issue earlier. Have a look at it and see if this applies to how you are using scripts.

Shkarbatov_Dmitiry · December 20, 2018, 6:50am

Thanks, will try.

Shkarbatov_Dmitiry · December 20, 2018, 7:11am

Can I manually clear script cache without server restart?

And why it not growing only on one node?

  "script": {
    "compilations": 98476,
    "cache_evictions": 98376
  }
  "script": {
    "compilations": 3945,
    "cache_evictions": 3845
  }
  "script": {
    "compilations": 99784,
    "cache_evictions": 99684
  }

Shkarbatov_Dmitiry · December 20, 2018, 10:27am

We have 2 types queries with groovy scripts:

$script = new \Elastica\Script('ctx._source.prefix_name = prefix_name;ctx._source.match_name = match_name;', [], 'groovy');
$script->setParam('prefix_name', strtolower(Transliterator::ruToEn($data['name'])));
$script->setParam('match_name', $data['name']);
$client->updateDocument($itemId, $script, self::INDEX_NAME, self::INDEX_TYPE);

And:

$query->addSort(['_script' => [
    'script' => "if (_source.containsKey('other_properties')) {
        for (item in _source.other_properties) {
            if (item.v_label == '".addslashes($row['v_label'])."' && item.k == ".self::VENDOR_PROPERTY_ID.") {
                return 10;
            }
       }}
       return 1;",
       'type'   => 'number',
       'order'  => 'desc'
]]);

What you mean udner:

try moving the parts of the script that are changing into parameters so the actual script text would always remain the same?

Not adding dynamic parameters in script section, instead of this using parameters block?
Like in first query example.

Igor_Motov · December 20, 2018, 2:28pm

If it is what I think it is, the problem is not in the cache, the problem is in compiled scripts that were already evicted from the cache and in the process they leaked memory. Cleaning the cache is not going to fix that.

Exactly! This way the script will never change (only parameters will), it will only compile once and will be kept in cache forever. It should increase the performance as well since script compilation is pretty heavy process.

Shkarbatov_Dmitiry · December 20, 2018, 2:41pm

Thanks, we will try to fix it.

Shkarbatov_Dmitiry · December 20, 2018, 3:15pm

I have one instance from cluster, where cache_evictions in scripts does not growing.

"script": {
    "compilations": 3945,
    "cache_evictions": 3845
  }

But memory still leak.
What I need to check in that case?

Igor_Motov · December 20, 2018, 6:37pm

First I would check other stats, if they don't grow - analyze heapdump.

Shkarbatov_Dmitiry · December 22, 2018, 7:19am

Thanks, it helps! cache_evictions stop growing.
But unfortunately memory still growing.

Can you please help me.
All the params are growing
Which one I need to look first?

Nodes stats after restart:
https://github.com/Shkarbatov/trash/blob/master/ElasticNodesStats_part1.txt

Nodes stats after 16 hours work:
https://github.com/Shkarbatov/trash/blob/master/ElasticNodesStats_part2.txt

Igor_Motov · December 24, 2018, 5:12pm

I don't see anything particularly wrong in the stats that you sent me. It is normal for heap to grow to a certain degree. The problem is when it grows above 80% and stays there. I see only 28% used on one of the nodes, which doesn't indicate an issue.

Shkarbatov_Dmitiry · December 29, 2018, 8:37am

I think it really helps.

This is ElasticSearch heap for the last 7 days.

Thanks a lot!

But still don't know why one node looks different from another.

This is ElasticSearch heap for the last 7 days.

Igor_Motov · December 31, 2018, 9:21pm

It's possible that nodes have different load. For example if you are running a lot of update operations then the update script will be only executed on the primary shard. If you are only connecting to one node with your client and retrieving a lot of data the load on that node might be higher as well. There could be many reasons for the difference in the behavior. We need to see how shards are allocated between nodes and what roles the nodes play to say for sure.

system · January 28, 2019, 9:22pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Nodes slowly climbing to high memory usage Elasticsearch	10	3855	February 13, 2017
Script groovy may cause memory leak in elastic v5.4 Elasticsearch	5	1414	April 9, 2018
Elastic search using a lot of memory, GC thrashing Elasticsearch	4	2051	July 5, 2017
Heap issue after upgrading Elasticsearch from v1.7.5 to v2.4.1 Elasticsearch	4	1029	December 12, 2016
ElasticSearch Inline script compilation circuit breaker triggered on static script code Elasticsearch painless	4	711	August 3, 2021

Heap JVM memory leak in ElasticSearch 2.4.5

Related topics