Cardinality agg off by one even after precision increase

Mark_Harwood · September 2, 2021, 7:50am

I've set the precision threshold to 6000 or higher, without any change.

The precision threshold doesn't mark the boundary between accurate and inaccurate - just use of counting technique 1 versus counting technique 2. Counting technique 1 is less susceptible to inaccuracies but is still not guaranteed to be fully accurate. That said, it is based on collecting hashes of values which can occasionally collide so I'd have expected an under-count rather than an over-count. One possible explanation is that a value may be held in different field types across indices, in which case string 1234 != integer 1234 when merging results from the different indices.

Topic		Replies	Views
Cardinality Aggregation gives wrong number? Elasticsearch	33	7429	March 7, 2019
Cardinality is more than Count. How to achieve the exact uniq count? Elasticsearch	7	2203	July 5, 2017
Is the precision of cardinality aggregation decided by total unique value count or filtered unique value count? Elasticsearch	5	192	January 10, 2024
Problem cardinality and date_histogram Elasticsearch	3	730	July 5, 2017
Count distinct values lower than doc_count Elasticsearch	9	1517	September 21, 2018

Cardinality agg off by one even after precision increase

Related topics