Cardinality Aggregation gives wrong number?

Mark_Harwood · February 7, 2019, 12:10pm

Everything elasticsearch does is designed for scale. We don't want to build data analysis functions that blow up when users provide us with a lot of data.

The world of big data is, by necessity, built on fuzzier constructs [1]. We're using the same algorithms used by the other big data platforms for exactly the same reasons. As I outlined here - it's a necessary trade off in the age of big data. You can't expect to beat physics.

Arguably we could offer a function to guarantee cardinality accuracy at small scale but small scale is not our mission.

[1] Introduction to Probabilistic Data Structures - DZone

Topic		Replies	Views
Get number of unique values in a field Elasticsearch	3	1026	July 6, 2017
Cardinality is more than Count. How to achieve the exact uniq count? Elasticsearch	7	2179	July 5, 2017
Cardinality agg off by one even after precision increase Elasticsearch	2	406	September 30, 2021
Is the precision of cardinality aggregation decided by total unique value count or filtered unique value count? Elasticsearch	5	180	January 10, 2024
Cardinality and value_count aggr values are 200-500% off Elasticsearch	4	857	March 21, 2017

Cardinality Aggregation gives wrong number?

Related topics