Help to understand fuzzy score

yeikel · November 21, 2024, 3:22am

Given the following search:

{
  "size": 100,
  "query": {
    "bool": {
      "must": [
        {
          "match": {
            "business_names": {
              "query": "my company",
              "operator": "and",
              "fuzziness": "auto"
            }
          }
        }
      ]
    }
  }
}

I see results like MY LITTLE COMPANY with a higher score than documents that match the input exactly.

How can I formulate the query so that results that match the input exactly are at the top of the results?

One Idea is to create another query that matches exactly with a boost, but why that is needed?

dadoonet · November 21, 2024, 7:43am

You can create a bool query with 2 should clauses. One with a fuzzy search. Another one without.

As the second will match when texts are identical, the score will be higher.

An example of this here:

gist.github.com

https://gist.github.com/dadoonet/5179ee72ecbf08f12f53d4bda1b76bab

search_kibana_console.txt

### REINIT
DELETE user
PUT user
{
  "mappings": {
    "properties": {
      "name": {
        "type": "text"
      },
      "comments": {

This file has been truncated. show original

yeikel · November 21, 2024, 12:33pm

Thanks, that's what I hinted above

I am still trying to understand why the fuzzy result is scored higher

Is this due to the relevance of the tokens relatively to the index?

dadoonet · November 21, 2024, 12:57pm

Note that it could depend on the size of the field, on the total number of terms in the index, on the number of shards... So many factors.

You can try to understand using "explain": true.

Topic		Replies	Views
Fuzziness & score computation Elasticsearch	2	5844	July 6, 2017
Boolean similarity module with fuzzy search scoring Elasticsearch	1	404	August 14, 2020
Fuzzy Search on some selected fields Elasticsearch	1	568	July 6, 2017
The fuzzier matching, the higher score? Elasticsearch	6	458	January 23, 2019
Index boosting and how the exact matches can rank higher than fuzzy match and phrase match in the elastic search? Elasticsearch	1	626	November 28, 2022

Help to understand fuzzy score

Related topics