_score not as I'd expect

bilpor · November 3, 2017, 12:15pm

Hi All,

I have created an index in Elasticsearch. Using Kibana, I performed a search of a field for a particular word. Kibana returned me the expected results, but the _score for each record isn't what I would have expected e.g. One record had the word only once in it's text and another had it twice. The one that had it only once gave a score of 6.33 whilst the one with 2 gave a score of 5.726. Other returned records where the word was only in the text once returned scores of 5.968, 5.379 etc. I thought it might have been taking into account letter casing, but changing the case on the search made no difference. Can someone explain to the how the _score is obtained? I'd have thought that all those records with only one occurrence for instance would all have had the same _score.

Thanks
Bill

dadoonet · November 3, 2017, 1:54pm

If you add "explain": true in you query you will get those details. Bit complicated though.

In short: what is taken into account:

frequency of term you are searching for within your document field: the more, the better. But be aware that text is analyzed before being indexed
frequency of term you are searching for within the full index: the lesser, the better.
size of the term

If you want to leverage the casing, you can index the same text using 2 different analyzers: standard analyzer and one custom which just uses a standard tokenizer. Then search using a bool query and 2 should clauses: one on the lowercased field (standard analyzer) and one on the preserved case field (custom analyzer).

It will give you on top of the list the ones which matches with exact case.

Ivan · November 3, 2017, 6:35pm

Don't forget about the number of shards and IDF values

system · December 1, 2017, 6:36pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Elastic does not see the difference between phrases in search Elasticsearch	12	1398	August 26, 2018
Odd scoring behavior Elasticsearch	7	500	March 22, 2018
_score higher than suspected Elasticsearch	7	881	July 5, 2017
Why the score in Elasticsearch is different if the data is same in two records Elasticsearch	9	1729	July 5, 2017
Elasticsearch relevance score calculation Elasticsearch	3	2132	April 29, 2019

_score not as I'd expect

Related topics