Aggregation: Custom score for ordering via sub aggregation

Martin_Berlin · April 3, 2024, 10:16am

I have an Array of Objects with keywords:

"keywords":[
   {
      "name":"Testing Equipment",
      "score":0.999
   },
   {
      "name":"Film Shrinkage Tester",
      "score":0.666
   },
   {
      "name":"Universal Tensile Tester",
      "score":0.332
   },
   {
      "name":"Hydraulic Pressure Testing Machine",
      "score":0.122
   }
]

with the following mapping:

{
   "keywords":{
      "properties":{
         "score":{
            "type":"float"
         },
         "name":{
            "type":"text",
            "fields":{
               "keyword":{
                  "ignore_above":256,
                  "type":"keyword"
               }
            }
         }
      }
   }
}

I want the aggregation of the keywords to be sorted by the sum of its score.

I'm trying like this:

{
   "aggs":{
      "keywords":{
         "terms":{
            "size":10,
            "field":"keywords.name.keyword",
            "order":{
               "keywords_score.value":"desc"
            }
         },
         "aggs":{
            "keywords_score":{
               "sum":{
                  "field":"keywords.score"
               }
            }
         }
      }
   }
}

But it seems like the sub-aggregation keywords_score takes all of the scores related to all keywords into account

For testing I changed the sum sub aggregation to terms:

{
   "aggs":{
      "keywords":{
         "terms":{
            "size":10,
            "field":"keywords.name.keyword"
         },
         "aggs":{
            "keywords_score":{
               "terms":{
                  "field":"keywords.name.keyword"
               }
            }
         }
      }
   }
}

with this output (just to see how what the sum function would take into account) - I would expect that there is only one keyword (the one found in the upper aggregation)

{
   "buckets":[
      {
         "key":"Testing Equipment",
         "doc_count":18707,
         "keywords_score":{
            "doc_count_error_upper_bound":1754,
            "sum_other_doc_count":1701567,
            "buckets":[
               {
                  "key":"Film Shrinkage Tester",
                  "doc_count":18707
               },
               {
                  "key":"Universal Tensile Tester",
                  "doc_count":4305
               },
               {
                  "key":"Hydraulic Pressure Testing Machine",
                  "doc_count":3647
               },
               .....
            ]
         }
      }
   }

That's about the same way the sum get's aggregated - the sum of all other keyword score properties and not only the one associated with the parent aggregation.
Is there a way to do a custom ranking via sub aggregation which takes only the sum of the scores associated with the parent keyword into account?

Thanks for helping!

Martin_Berlin · April 17, 2024, 1:21pm

ping please

system · May 15, 2024, 1:22pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Sort aggregation by score Elasticsearch	1	353	March 23, 2018
Aggregating, picking top value using ordering then ordering buckets by score Elasticsearch	2	363	October 8, 2019
Sort aggregation by other query Elasticsearch	1	361	July 17, 2018
Top hit aggregation with _score sorting Elasticsearch	3	1688	July 5, 2017
How to sort the composite aggregation results by score? Elasticsearch	1	907	October 16, 2018

Aggregation: Custom score for ordering via sub aggregation

Related topics