Tuning ranking in search result

kentoto · April 24, 2022, 11:42am

doc1 = {"id":1,"text":"tonight"}
doc2 = {"id":2,"text":"tonight tonight"}
doc3 = {"id":3,"text":"tonight tonight tonight"}
doc4 = {"id":4,"text":"tonight and something else"}
doc5 = {"id":5,"text":"tonight and you"}

es.index(index="tonight", document=doc1)
es.index(index="tonight", document=doc2)
es.index(index="tonight", document=doc3)
es.index(index="tonight", document=doc4)
es.index(index="tonight", document=doc5)

Suppose I have the above documents indexed. When I use the below query:

data = json.dumps({
    "query":{ 
        "bool":{
            "should":[
                {
                    "match":{
                        "text": "tonight"
                    }
                }
            ]
        }
    }
})

The hits returned in the order of "tonight tonight tonight", "tonight tonight","tonight", "tonight and you" and "tonight and something else". May I ask if there is a way to make "tonight" as the first hit returned with highest _score? In my real life usecase, I am conducting iterating throught the whole index to find out the most relevant text other than itself, and the document being searched should be returned as the first hit (most matched) if possible. Could someone please give me some ideas on how to query? Thank you!

RabBit_BR · April 24, 2022, 1:12pm

I recommend read articles like this:
https://www.compose.com/articles/how-scoring-works-in-elasticsearch/

In resume:

There are three main factors for a document's score in a given search.

TF (term frequency) – The more a term appears in a single field, the more relevant it is.
IDF (inverse document frequency) – The more documents have the search term, the less relevant it is.
Field Length – Smaller fields are naturally more relevant than larger fields.

system · May 22, 2022, 1:12pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
How to give shorter hit a higher score? Elasticsearch	1	229	January 28, 2023
Odd scoring behavior Elasticsearch	7	501	March 22, 2018
Sorting by _score on an aggregation Elasticsearch	4	393	May 5, 2020
A question about custom scoring Elasticsearch	3	1359	March 27, 2017
Help me understand how ES calculate the score to match query Elasticsearch	5	1301	July 6, 2017

Tuning ranking in search result

Related topics