Field statistics - sampling

secsubs · August 13, 2015, 6:39pm

Hi,

As I understand, in Kibana4, the field statistics are done on the n records/documents that are returned to the browser (set under Advanced settings -> sampleSize). If this is correct, then how does Kibana select n records/docs out of the total query result population? Are there any attempts made to make sure it is a truly representative sample? Random sample?

Thanks

tbragin · August 14, 2015, 5:23pm

I assume you're referring to top-values by field in the Discover pane? If so, those statistics are based on the 500 documents that appear in Discover. By default, for time-series data, those are the latest documents, but that also changes depending on what searches and filters you've applies in Discover.

secsubs · August 14, 2015, 5:56pm

Thanks for that clarification, Tanya. But sounds like the field statistics are highly misleading since they aren't a true sample of the query results.

Topic		Replies	Views
Quick Count feature Kibana	5	5418	March 14, 2016
Increase Fields Statistics on Discover Tab Kibana Kibana	2	66	March 25, 2025
Increase document limit/sample size in Discover? Kibana discover	3	326	October 14, 2024
What's the base of field data statistics in discover page? Kibana	1	571	June 9, 2015
How does Kibana get available fields stats? Kibana	5	1517	February 23, 2017

Field statistics - sampling

Related topics