Frequent GC and OOM due to many fields


(Murali Krishna P) #1

We have an elastic search installation on a tiny box with 256MB heap space. There are many shards (400+) in a single node and total of 8k+ fields. We see GC running every minute and frequent OutofMemoryErrors. This may be expected with such a low memory and high number of fields, but wanted to know whether there are knobs to control memory usage.

Heap dump analysis shows that most of it is Byte array

Class Count
org.apache.lucene.util.BytesRef 61009
org.apache.lucene.util.fst.FST 13753

Mainly Referenced by:
Class Count
org.apache.lucene.codecs.blocktree.FieldReader 41259
org.apache.lucene.util.fst.FST 13753

Is there a way to control these data structures in lucene/es?


(Isabel Drost-Fromm) #2

Can you explain your use case? Having so many shards, and so many fields on one single node with such low memory seems odd.

For general hints on sizing see https://www.elastic.co/blog/found-sizing-elasticsearch and https://www.elastic.co/guide/en/elasticsearch/guide/current/hardware.html

Hope this helps,
Isabel


(Murali Krishna P) #3

Thanks for the response.
You can think about that as time series indices, one index per day. We wanted to see how much we can push it before we start deleting the old indices. Please note that performance is not a consideration. So, i tried changing term_index_divisor to see whether we could reduce what is loaded into memory, but that support is removed: https://github.com/elastic/elasticsearch/pull/4379/commits/6c189310b9b299defc0746576c7d91d4c5c3d576

Any other way tune the memory usage in es or lucene would be really helpful.


(Jörg Prante) #4

It's not a question of tuning. You squeeze 400+ shards into a single node with small heap space. Just use 1 shard and you will be happy.


(Christian Dahlqvist) #5

As already suggested you have far too many shards, which results in a fair amount of overhead. Time-based indices are useful for managing retention, but you probably want to reduce the number of shards per index to 1 and also switch to monthly or weekly rather than daily indices.


(Murali Krishna P) #6

Thanks for the suggestions. Given the heap dump, won't this still happen with one shard with too many fields or too many terms for a field?. This seems like some kind of data structure lucene keeps in memory for term dictionary which is growing as terms or fields grow. More shards and indices is probably making it worse.

So I am looking for options to optimize what goes into memory, for example can we increase the terms per block in lucene posting?


(Christian Dahlqvist) #7

Which version of Elasticsearch are you using?


(Murali Krishna P) #8

2.3.2


(Christian Dahlqvist) #9

Since you are on Elasticsearch 2.x, doc_values will be enabled by default, which reduces heap pressure. Reducing the number of shards and ensuring the average shard size is in the GB range is therefore the way to go. Please not that a heap size of at least 1 or 2GB is recommended for any kind of production system.


(system) #10