Field stats search in ES 6.X?

fassisrosa · March 2, 2018, 9:21pm

Hi there,

In previous versions of ES, fields stats were used internally by ES to determine which indexes to use when searching. A typical example for timeseries indexes would be to search on a time range, internal field stats would be used to not even attempt searching on indexes for which the time range did not match. The field stats API is gone in more recent versions of ES. Two questions:

*) Does ES still do a smart selection of indexes to avoid touching ES on time range queries based on field statistics for each index?

*) If I want to have access to those per index field stats, is there any way to do it in 5.6.X and 6.X?

Thanks,

Francisco.

polyfractal · March 3, 2018, 6:50pm

Yep! As part of the change, we introduced a new pre-filter phase that executes to find "matching" shards. Each shard can evaluate the query from a high-level and see if it potentially has matching documents (e.g. has documents in the correct time range). The shards that don't have any potentially matching docs will be skipped for the main search phase.

More details here: https://github.com/elastic/elasticsearch/pull/25658

If you need the field-stats style data, the best way to do it now is just via an aggregation for most of the stats (doc counts, min/max time range, etc). You can also use TermVectors if you need stats about the terms themselves.

fassisrosa · March 3, 2018, 7:07pm

Thanks so much for your answer! Exactly what I was looking for!

polyfractal · March 3, 2018, 7:20pm

Np, happy to help!

system · March 31, 2018, 7:20pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
FieldStats support Elasticsearch	11	1483	January 3, 2018
Field_stats_api deprecated? Elasticsearch	4	1102	October 30, 2017
How to retrieve field statistics now that _field_stats is deprecated Elasticsearch	3	883	November 23, 2017
Efficient time-based multi index searches in ES6 Elasticsearch	5	435	August 22, 2018
Performance querying time-based indices in a date range Elasticsearch	3	2374	August 3, 2020

Field stats search in ES 6.X?

Related topics