Different counts of unique values in Kibana and CSV-export of the raw data

Mario_Lie · March 14, 2023, 12:24pm

Hello Community!

I am using Kibana v 7.17.9

I count unique values for various log variables (Visualize->Metric->unique count). But the number of unique values differs slightly, depending on whether I use Kibana or export the same database as .csv-file and edit it with another statistics program (R, Python).

Example:

Total hits in Kibana 3567 -> unique count of user_id: 3267

Total count in .csv-file/Python 3567 -> unique count of user_id: 3275

Does anyone know reasons why this is happening?

Thanks a lot!

Mario

system · April 11, 2023, 12:24pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

drewdaemon · April 12, 2023, 5:17pm

Hi @Mario_Lie ! Sorry you didn't get a reply here sooner.

Under the hood, Kibana uses Elasticsearch's cardinality aggregation to generate that unique count number. Since Elasticsearch is a distributed data store, computing a true cardinality is difficult and precision requirements need to be balanced with cluster load.

See this technical explanation for specific details, including the actual algorithm ( HyperLogLog++) that is being used.

Does this help?

Mario_Lie · April 13, 2023, 10:05am

Hi @drewdaemon! Thanks a lot for your response and the links. That was exactly the information I needed.

Topic		Replies	Views
Unique value counts in kibana Kibana	5	84909	December 18, 2017
Difference between using (in the Field Metric) Count and Unique Count Kibana	4	1317	August 17, 2021
Unique count > Count in kibana Elasticsearch	3	1905	February 15, 2019
Problem with unique count and cardinality Elasticsearch	4	208	May 1, 2024
How to perform calculation for the unique count document in kibana? Kibana	4	1802	August 14, 2019

Different counts of unique values in Kibana and CSV-export of the raw data

Related topics