Top_hits aggregation distinct nested documents

buinauskas_evaldas · December 13, 2016, 6:21pm

Is there a way to remove dupes from nested top_hits aggregation? My current query:

GET /analysis_elasticsearch/_search
{
  "size": 0,
  "aggs": {
    "Survey": {
      "nested": {
        "path": "Survey"
      },
      "aggs": {
        "TopHits": {
          "top_hits": {
            "size": 10,
            "_source": ["Survey.SurveyID", "Survey.SurveyName"], 
            "sort": "Survey.SurveyID"
          }
        }
      }
    }
  }
}

which returns following result:

{
  "took": 13,
  "timed_out": false,
  "_shards": {
    "total": 5,
    "successful": 5,
    "failed": 0
  },
  "hits": {
    "total": 1000,
    "max_score": 0,
    "hits": []
  },
  "aggregations": {
    "Survey": {
      "doc_count": 1000,
      "TopHits": {
        "hits": {
          "total": 1000,
          "max_score": null,
          "hits": [
            {
              "_nested": {
                "field": "Survey",
                "offset": 0
              },
              "_score": null,
              "_source": {
                "Survey": {
                  "SurveyID": "2",
                  "SurveyName": "Eating And Drinking"
                }
              },
              "sort": [
                2
              ]
            },
            {
              "_nested": {
                "field": "Survey",
                "offset": 0
              },
              "_score": null,
              "_source": {
                "Survey": {
                  "SurveyID": "2",
                  "SurveyName": "Eating And Drinking"
                }
              },
              "sort": [
                2
              ]
            },
            {
              "_nested": {
                "field": "Survey",
                "offset": 0
              },
              "_score": null,
              "_source": {
                "Survey": {
                  "SurveyID": "2",
                  "SurveyName": "Eating And Drinking"
                }
              },
              "sort": [
                2
              ]
            },
            {
              "_nested": {
                "field": "Survey",
                "offset": 0
              },
              "_score": null,
              "_source": {
                "Survey": {
                  "SurveyID": "2",
                  "SurveyName": "Eating And Drinking"
                }
              },
              "sort": [
                2
              ]
            },
            {
              "_nested": {
                "field": "Survey",
                "offset": 0
              },
              "_score": null,
              "_source": {
                "Survey": {
                  "SurveyID": "2",
                  "SurveyName": "Eating And Drinking"
                }
              },
              "sort": [
                2
              ]
            },
            {
              "_nested": {
                "field": "Survey",
                "offset": 0
              },
              "_score": null,
              "_source": {
                "Survey": {
                  "SurveyID": "2",
                  "SurveyName": "Eating And Drinking"
                }
              },
              "sort": [
                2
              ]
            },
            {
              "_nested": {
                "field": "Survey",
                "offset": 0
              },
              "_score": null,
              "_source": {
                "Survey": {
                  "SurveyID": "2",
                  "SurveyName": "Eating And Drinking"
                }
              },
              "sort": [
                2
              ]
            },
            {
              "_nested": {
                "field": "Survey",
                "offset": 0
              },
              "_score": null,
              "_source": {
                "Survey": {
                  "SurveyID": "2",
                  "SurveyName": "Eating And Drinking"
                }
              },
              "sort": [
                2
              ]
            },
            {
              "_nested": {
                "field": "Survey",
                "offset": 0
              },
              "_score": null,
              "_source": {
                "Survey": {
                  "SurveyID": "2",
                  "SurveyName": "Eating And Drinking"
                }
              },
              "sort": [
                2
              ]
            }
          ]
        }
      }
    }
  }
}

I'd like to keep only unique nested records. I know I could use terms aggregation to get unique SurveyID and then do terms aggregation once more to get their names, but this doesn't feel right.

Is there a way to get this done using top_hits agg?

Elasticsearch version: 5.1.1
I'm fine with using Painless to get desired result.

system · January 10, 2017, 6:21pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
[QUERY] Top hits of nested fields sum aggregated by parent document Elasticsearch	3	22	November 12, 2024
Using inner_hits inside an aggregation Elasticsearch	2	923	July 5, 2017
Excluding inner_hits from top_hits aggregation Elasticsearch	1	642	July 23, 2019
Nested top_hits scoring question Elasticsearch	1	340	February 28, 2019
Question about nested top_hits aggregation results _score Elasticsearch	1	325	February 20, 2019

Top_hits aggregation distinct nested documents

Related topics