Bucket-key=custom, value=array(_ids) aggregation?

ddorian43 · April 28, 2016, 1:46pm

Hi!,

I have a mapping with fields "a string,b string,c string,t timestamp". Can I make bucket aggregations, where I can specify the key to be for example::

key=t(yymmdd):a:b (generate the key from script)

Each bucket should have as value an array of documents with the ability to also include _source. Ability to return top(x) + doc-count if there are alot of documents in a bucket.

The buckets should be sortable by a field-value (ex: t timestamp)

Ability to limit the number of buckets.

I also need to get back the min(timestamp) that is on the whole aggregation (in case the last bucket has too many documents to return the _source of them all).

Is this possible ? If not, can I do anything (custom java?) to make it possible ?

I think this can be done using terms-script-aggregation to generate the initial buckets https://www.elastic.co/guide/en/elasticsearch/reference/current/search-aggregations-bucket-terms-aggregation.html#search-aggregations-bucket-terms-aggregation-script
+
top-hits as sub-aggregation to return the documents for each bucket https://www.elastic.co/guide/en/elasticsearch/reference/current/search-aggregations-metrics-top-hits-aggregation.html ?

But I don't know how to get the "minimum timestamp" on the last bucket ? Maybe by sorting by timestamp-ascending on the top-hits-sub-aggregation (so I get the top-documents) ?

Makes sense ?

Thanks

ddorian43 · April 30, 2016, 11:03am

shameless bumping

warkolm · May 4, 2016, 11:18am

Can you reformat your OP, it's hard to see what is happening. Wrap it in code tags

ddorian43 · May 4, 2016, 10:38pm

Hope it's more clear now.

warkolm · May 4, 2016, 11:21pm

It sounds like you are on the right track, regarding scripting. But I can't help there as I don't know much on that.

However a better solution might be to look at crafting fields with these sorts of values during ingestion, that way it'll be much simpler (and better on your resources).

ddorian43 · May 4, 2016, 11:25pm

The "what/how to group on" is dynamic (from the client side) so I can't do that. I just wanted to know if that's the right way, and looks like it is.

Thanks

Topic		Replies	Views
Help with aggregations Elasticsearch	4	1883	December 25, 2017
Calculation over aggregation results Elasticsearch	3	8296	July 6, 2017
Can aggregation return the documents in each bucket? Elasticsearch	8	4638	July 6, 2017
How can I limit bucket to n results for averaging? Elasticsearch	6	1802	July 5, 2017
Top hits by sort criteria or include source into arbitrary aggs Elasticsearch	7	2357	July 5, 2017

Bucket-key=custom, value=array(_ids) aggregation?

Related topics