Terms Agg

aniaks · March 18, 2019, 3:44pm

I set size and shard_size to 4200 , but still i was not getting the intended results. So i set to less number 500 (both size and shard_size) to see if that makes any difference. But still no difference.

sorry for the confusion.

Mark_Harwood · March 18, 2019, 3:47pm

What did you get? Were there any failures reported in the response status or the JSON body?

aniaks · March 18, 2019, 3:48pm

I was not getting empty buckets for the terms that do not have data. There was no error though.

Mark_Harwood · March 18, 2019, 3:55pm

So you should have had exactly 4,200 term buckets under each date bucket. How many buckets did you get?
Was this perhaps because some of the date buckets had exactly zero hits, in which case it didn't descend into the child terms aggregation?

aniaks · March 18, 2019, 4:05pm

thats good point, i will check from that perspective.

aniaks · March 18, 2019, 4:06pm

One more point to note - for the timeframe i am searching there are only 6 terms that would be returned. In my response some date has 4 and some date has 5 and some has 6.

Mark_Harwood · March 18, 2019, 4:16pm

One possible factor is that this optimisation avoids even running the query on some shards if they lie outside of the time range for the query.
This would mean that terms that only exist on excluded shards would not appear in results.

If this is the reason behind the absence I don't suggest you trawl indices outside of your date range just to come up with the missing values - that would be very inefficient compared to an approach where your client can deduce the missing terms with some custom logic

aniaks · March 18, 2019, 6:50pm

If the terms outside the time range is not returned that would be fine. But in my case for a given time range - one timestamp has 5 buckets, one time stamp has 6 buckets and another has 4 buckets. I would expect all the time in the time range should have same number of buckets. Let me know if my understanding is right ?

Mark_Harwood · March 18, 2019, 8:11pm

Not if the buckets are taken from different (time-based) indices. They each could have a different set of terms.

system · April 15, 2019, 8:11pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Returning bucket with 0 count in terms aggregation Elasticsearch	7	14258	July 11, 2017
Return only matching bucket in aggregation Elasticsearch	1	839	July 5, 2017
Elasticsearch aggregation query ignores parent filter when setting min_doc_count to 0 in child terms aggregation Elasticsearch	1	740	April 25, 2019
Questions about aggregation min_doc_count = 0 Elasticsearch	3	1787	July 6, 2017
Aggregation date_histogram and terms does not respect min_doc_count Elasticsearch	3	767	November 18, 2019

Terms Agg

Related topics