Get top hits aggregation without aggregating on all values

tpraizler · October 10, 2018, 8:23am

Hey,

Based on the documentation top aggregations can be used as a sub aggregator.

Maybe I can achieve it in a different way but my use case is:
I have the following structure:

{
    name: "x"
}

I basically need to get the top 10 document based on the name field.
The problem is that I have 100M documents, and possible 100K values for the name field.
From what I understand, this is not a good use case for ES, because running an aggregation with more then tens of thousands buckets will not have good performance, and will take a lot in terms of resources from the cluster.

To sum it up. I just need the topX documents based on a field with many possible values.

Is there a "good" way to achieve this?
Is my assumption regards the number of buckets is wrong?

Thanks!!

system · November 7, 2018, 8:23am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
How to use `top_hits` hits as input of another elasticsearch pipeline aggregation Elasticsearch	1	677	September 6, 2018
Can you search top_hits aggregation documents? Elasticsearch	1	401	July 5, 2017
Aggregation over top N hits Elasticsearch	2	535	September 9, 2018
How to get documents from the buckets in terms aggregation? Elasticsearch	4	11679	July 5, 2017
Aggregation over aggregation on another field + top_hits Elasticsearch	2	503	November 4, 2022

Get top hits aggregation without aggregating on all values

Related topics