Terms aggregation + Cardinality Aggregation performance

jonyadamit · May 28, 2017, 1:53pm

My terms aggregation (on a high cardinality field) with an inner cardinality aggregation takes about 23 seconds to complete (one node, one shard, 300,000 documents, keyword fields).
The equivalent SQL query using SQL server takes a few seconds at most.

Is it reasonable of me to make that comparison? Maybe SQL Server is just more suited for this kind of queries?

I posted a related question with much more detail on Stack Overflow (https://stackoverflow.com/q/44225038/1545350) if anyone cares to assist there as well.

Thanks, Jonathan.

Mark_Harwood · May 28, 2017, 9:30pm

Try reduce the precision_threshold [1] setting to reduce the memory used to calculate these values for each of your many order_id buckets. The default value is 3,000 and I imagine the average number of items per order falls way below this value.

[1] https://www.elastic.co/guide/en/elasticsearch/reference/current/search-aggregations-metrics-cardinality-aggregation.html#_precision_control

jonyadamit · May 29, 2017, 9:25am

Thanks! Didn't know it had this kind of effect on memory consumption!
This indeed lowers memory consumption for me below the circuit breaker.
Performance wasn't affected that much by this change, but it does help.

Side note - my comparison with SQL Server was faulted because of caching.

system · June 26, 2017, 9:26am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Cardinality query takes more than 10 seconds to return for buckets of unique values ~15K Elasticsearch	7	610	June 22, 2020
Poor I/O on heavy aggregation Elasticsearch	6	736	March 18, 2017
Slow terms aggregations after use of eager_global_ordinals Elasticsearch	6	760	November 9, 2020
Why is my composite aggregation faster than a terms bucket aggregation for a high cardinality field? Elasticsearch	1	572	May 27, 2020
Aggregation is too slow Elasticsearch	8	874	June 24, 2018

Terms aggregation + Cardinality Aggregation performance

Related topics