Aggregation on top N results

polya · January 7, 2016, 11:44pm

This comes from http://stackoverflow.com/questions/25429903/limiting-aggreation-to-the-top-x-hits-in-elasticsearch which I have been trying to solve as well.

question: Is there any way to build a query whose hit count has an upper limit N in order to be able to build an aggregation limited to those top N results ? And if so how ?
Just to clarify, aggregation needs to be done on the top hits of the scope query, and not access the top hits of each bucket which (if i am right) is what the top_hits aggregation provides. i.e. is it possible to have a sub-aggregation of the top_hits aggregation? if so how?

softwaredoug · January 8, 2016, 12:09am

Maybe you want the experimental sampler aggregation? I have an example
using US high school data that uses a sampler aggregation to answer
questions about students most similar to one under analysis. It uses a
sampler aggregation to learn the predominant characteristics of the N most
similar students.

github.com

o19s/student-dropout-predictor/blob/master/simStudents.py

from elasticsearch import Elasticsearch
from elasticsearch.exceptions import TransportError

es = Elasticsearch()


from sys import argv

mostSimilar = 250

incomeCategories = [None, "None", "$1000 or less", "$1001-$5000", "$5001-10000", "$10001-$15000", "$15001-$20000", "$20001-25000",
                    "$25001-$35000", "$35001-$50000", "$50001-$75000", "$75001-$100000", "$100001-$200000", "> $200000"]

query = {
    "query": {
        "function_score": {
            "functions": [
                {"gauss": {
                    "BYINCOME": {
                        "origin": int(argv[1]),

This file has been truncated. show original

(If you're interested I'll be demoing this the at ElasticOn)

Doug

Topic		Replies	Views
Aggregation over top N hits Elasticsearch	2	535	September 9, 2018
Top N documents from top_hits, rather than top N per bucket Elasticsearch	1	886	July 5, 2017
Get Max Aggregate value for top N hits from elasticsearch Logstash aggregations	3	167	July 5, 2024
Get top hits aggregation without aggregating on all values Elasticsearch	1	389	November 7, 2018
Aggregation topHits problem Elasticsearch	7	809	July 5, 2017

Aggregation on top N results

Related topics