OOM for ES: fielddata.cache.size and breaker.fielddata.limit doesn't work

asaraseka · June 15, 2018, 9:25am

Hello !
We have a cluster with 3 data, 3 master and 2 client nodes. We're using it as a reporting cluster - running a big queries with aggregations.
ES uses 50% of VM RAM.
We're getting OOM time-to-time (seems related to time of reporting queries). To prevent this we set:
indices.fielddata.cache.size: 40%
indices.breaker.fielddata.limit: 45%
for all nodes in cluster. But seems it doesn't work, we still getting OOM and don't see in logs anything related to CircuitBreakingException. Please advice how to prevent OOM for cluster (prevent execution of killer-queries by cluster?) ? (add more memory - we know about this option )

Christian_Dahlqvist · June 15, 2018, 9:27am

Which version of Elasticsearch are you using?

asaraseka · June 15, 2018, 9:41am

  "version" : {
    "number" : "2.4.4",
    "build_hash" : "fcbb46dfd45562a9cf00c604b30849a6dec6b017",
    "build_timestamp" : "2017-01-03T11:33:16Z",
    "build_snapshot" : false,
    "lucene_version" : "5.5.2"
  }

Christian_Dahlqvist · June 15, 2018, 9:51am

That is a very old version, and I do not remember what the limitations were back then. I recall there have been a number of improvements to circuit breakers across more recent versions (e.g. this one) so I would recommend upgrading.

asaraseka · June 15, 2018, 10:14am

Thank you for your reply, Christian.
We have plans for upgrading our ES production and reporting clusters, but this will not happen soon ...

Christian_Dahlqvist · June 15, 2018, 10:22am

Then I suspect adding more memory and/or nodes will be the best way to go. Note that coordinating only nodes can have heap set higher than 50% of total RAM as they do not rely own the file system cache the same way data nodes do.

system · July 13, 2018, 10:22am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Circuit breaker not effective Elasticsearch	4	645	September 26, 2017
Indices.fielddata.cache.size and indices.breaker.fielddata.limit Elasticsearch	1	2044	July 5, 2017
Circuit breaker to prevent ES client from having OOM problem Elasticsearch	5	862	June 6, 2018
Circuit Breaker limit Elasticsearch	2	684	July 5, 2017
Monitoring for field data circuit breaker Elasticsearch	3	1103	July 5, 2017

OOM for ES: fielddata.cache.size and breaker.fielddata.limit doesn't work

Related topics