High fielddata usage on 2.3.3

abhijith_reddy · August 6, 2016, 2:11pm

We started using aggregations on elasticsearch 2.3.3 and we have a spike in field data usage. I have a few basic questions regarding field data.
We are only doing a cardinality aggregation (with default precision settings) on a single field which is not analyzed and the field's mapping looks like this

  ....
  "name" : {
    "type" : "string",
    "index" : "not_analyzed",
    "fields" : {
      "lowercase" : {
        "type" : "string",
        "analyzer" : "lower_keyword"
      }
    }
  }
  ....

From the documentation as the field is not analyzed doc values should be used instead of field data but from looking at our metrics, this is consuming a whopping 17GB of heap on each of hosts (see below). Any reason why this can happen ?

  "indices" : {
    "fielddata" : {
      "memory_size_in_bytes" : 18575882472,
      "evictions" : 0,
      "fields" : {
        "name" : {
          "memory_size_in_bytes" : 18575882472
        }
      }
    }
  }
}

Sherry_Ger · August 6, 2016, 3:47pm

Run a cat fielddata to find out what is consuming fieddata.

abhijith_reddy · August 6, 2016, 4:15pm

Like I mentioned above, it says 'name' field (the field used in the aggregation) is using up all the space.

Topic		Replies	Views
Spike in Fielddata memory usage Elasticsearch	1	446	June 25, 2019
Term aggregation and doc values Elasticsearch	5	890	July 5, 2017
Not_analyzed field with doc_values still in fielddata cache Elasticsearch	3	2584	July 5, 2017
_type in FieldData Elasticsearch	2	720	July 5, 2017
How to find out memory usage while aggregating some fields? Elasticsearch	5	575	April 15, 2020

High fielddata usage on 2.3.3

Related topics