Ranking short vs. long documents vs nested docs

randel_2 · June 3, 2018, 4:01pm

A search with standard settings ranks hits in documents with fewer words higher than longer texts. The word count seems to also take all nested documents into consideration.

Lets say I have:

PUT /books/_doc/1
{
  "editions": [
    {
      "title": "apes"
    }
  ]
}

PUT /books/_doc/2
{
  "editions": [
    {
      "title": "a group of apes"
    }
  ]
}

PUT /books/_doc/3
{
  "editions": [
    {
      "title": "chimpansee kingdom"
    },
    {
      "title": "apes"
    }
  ]
}

Search:

POST /books/_search
{
  "query": {
    "bool": {
      "should": [
        {
          "nested": {
            "path": "editions",
            "query": {
              "match": {
                "editions.title": {
                  "query": "apes"
                }
              }
            }
          }
        }
      ]
    }
  }
}

Right now book id 1 ranks highest followed by 2 and last comes 3.
Can I restrict that effect to the length of the nested document with the hit instead of taking all nested documents into account so that book id 1 and 3 rank hightest (equally) followed by 2?

system · July 1, 2018, 4:01pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Using top_hits to get the best matching nested doc inside another query Elasticsearch	1	363	July 6, 2017
Elastic Search nested query and inner hits question Elasticsearch	1	532	July 31, 2017
Only show one hit per defined group of documents Elasticsearch	1	571	June 28, 2018
Tuning ranking in search result Elasticsearch	2	193	May 22, 2022
Can you filter on top ranked inner nested hit in same query Elasticsearch	1	244	October 7, 2022

Ranking short vs. long documents vs nested docs

Related topics