Terms Aggregation Partitioning + filter buckets

Nam_Nguyen · January 19, 2021, 1:40pm

Hey ,

Context

I am currently struggling to create a proper query for the following use case.
Let's say I wanted to create a group-by aggregation for e.g. term app:.*, I would do the following:

          "terms": {
            "field": "tags",
            "include": "app:.*",
            "size": 300000
          }

This would give me all buckets which matches the regex.

Why

I have some high cardinality fields + nested terms aggregations which can trip the circuit breaker

Questions

I have the following questions:

How can I combine above with partitioning? AFAICT it is not possible to filter what kind of buckets we will create and therefore for every tag a bucket is created. Is there a way to filter it like above?
Something like this?

          "terms": {
            "field": "tags",
            "include_regexp": "app:.*",
"include":{"partition":0, "num_partitions": 100}
            "size": 300000
          }

Does ES recomputate the partitions each time we query them e.g. partition:1 and the next query partition:3 or does it somehow cache the values?

Thanks

system · February 16, 2021, 1:40pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Terms aggregation and regex filter Elasticsearch	1	2447	July 6, 2017
Composite Aggregations in the Java API and question about Terms Aggregation Partitioning Elasticsearch	1	1434	September 2, 2018
Filter out buckets in an aggregated query Elasticsearch	3	1246	July 6, 2017
Elasticsearch terms aggregation with partition does not retuning equal bucket Elasticsearch	6	1194	April 11, 2022
Elasticsearch terms aggregation with partition does not honor the “size” value Elasticsearch	5	1848	May 25, 2021

Terms Aggregation Partitioning + filter buckets

Context

Why

Questions

Related topics