Getting aggregations data / paginations on aggregation results

ugolas · June 19, 2016, 10:22am

Hi,
We have a lot of records and a lot of duplicates in the our storage.

I need to search and return results without duplicates, so I wrote a query which does an aggregation on a specific term, and returns one record per bucket so in my bucket results we have no duplications.

Is there a way to add an aggregation to get facets values for additional terms - but only for the bucket results?

My example:
lets say I have these docs:
{
"id": 123,
"type": "Good",
"timestamp": 12345
}, {
"id": 123,
"type": "Bad",
"timestamp": 12346
}, {
"id": 125,
"type": "Bad",
"timestamp": 12347
}, {
"id": 126,
"type": "Good",
"timestamp": 12348
}

After an aggregation on 'id' and sorting by 'timestamp' desc to eliminate duplicates i get 3 buckets with the records:
'id': 123, 'type': "Bad"
'id': 125, 'type': "Bad"
'id': 126, 'type': "Good"

And I need to get the data:
'type': "Good", 'doc_count': 1, 'type': "Bad", 'doc_count': 2.

Also is it possible to apply pagination on the aggregations buckets?

Thanks!

mainec · June 20, 2016, 8:13am

To my knowledge not yet. Check https://github.com/elastic/elasticsearch/issues/4915 for details.

Hope this helps,
Isabel

Topic		Replies	Views
Faceting, sorting, paginating within buckets Elasticsearch	1	171	June 28, 2023
Pagination + Sorted Aggregations: Efficiently Retrieve Sorted List of Values? Elasticsearch	2	185	August 23, 2023
Pagination over aggregation result Elasticsearch	2	333	May 21, 2019
Aggregation Pagination Elasticsearch	1	442	August 20, 2021
Duplicates when paging Elasticsearch	3	1023	July 6, 2017

Getting aggregations data / paginations on aggregation results

Related topics