We are reviewing our ElasticSearch setup & settings, here's a question on
fielddata cache and usage of doc values.
Currently we set fielddata cache to be 30% of the heap size and we've
enabled doc_values for all fields that we want to sort or aggregate (except
2 boolean fields).
As I understand, doc values are a form of on disk cache of field data. So
my questions is, by using doc values, can we reduce the memory fielddata
cache size? Say to 10%, is it a good idea?
Yes , definitely doc values would be a better idea.
As it is not 100% memory resident , it will give a lot better stability and
memory optimization to the system.
On the flip side, performance might go down to , say 10 to 15%.
We are reviewing our Elasticsearch setup & settings, here's a question on
fielddata cache and usage of doc values.
Currently we set fielddata cache to be 30% of the heap size and we've
enabled doc_values for all fields that we want to sort or aggregate (except
2 boolean fields).
As I understand, doc values are a form of on disk cache of field data. So
my questions is, by using doc values, can we reduce the memory fielddata
cache size? Say to 10%, is it a good idea?
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.