Global ordinals on high cardinality fields with large indices

YvorL · April 27, 2020, 11:48am

Hi,

I've got a daily index with a couple of high cardinality fields. Even after reading some threads and the documentation available, I'm not sure how should I proceed. The index has ~600M docs in it every day and the most requested HC field has ~20M unique values. The index has 6 shards on 8 nodes. I'm trying to lower the general memory usage and I'm not sure if adding eager global ordinals to the affected fields' mapping would help in any way or if I change the execution hint to map would do any harm. Unfortunately, the queries are aggregating those unique values to different term buckets almost every time.
Any suggestions?

Thanks!

Christian_Dahlqvist · April 27, 2020, 11:54am

If daily indices are no longer written to you can reduce memory usage by forcemerging them down to a single segment as described in this webinar. It can take a while and result in a lot of disk I/O though.

YvorL · April 27, 2020, 11:57am

Thank you @Christian_Dahlqvist !
Even if it'd be a segment that's 300GB in size?

Christian_Dahlqvist · April 27, 2020, 11:57am

Is that your shard size??

YvorL · April 27, 2020, 11:58am

Nope, I have 6 shards so one shard would only be around 50-60GB.

Christian_Dahlqvist · April 27, 2020, 12:03pm

Then will be a segment per shard, but shards that size will take some time to forcemerge.

YvorL · April 27, 2020, 12:04pm

I see, thanks! I'll watch the webinar soon too. Would eager global ordinals help for the actively written shards?

Christian_Dahlqvist · April 27, 2020, 12:14pm

I do not think so.

system · May 25, 2020, 12:28pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Eager global ordinals & doc values memory consumption Elasticsearch	1	841	July 6, 2017
Cardinality limit in elastic? Elasticsearch	2	1119	July 5, 2017
Terms Aggregation performance high cardinality Elasticsearch	8	5270	July 5, 2017
Shard size, count, how to stay fast and less GC? Elasticsearch	14	4847	July 5, 2017
Frequent GC and OOM due to many fields Elasticsearch	9	1512	July 5, 2017

Global ordinals on high cardinality fields with large indices

Related topics