How to make shorter (closer) token match more relevant? (edge_ngram)

rebelfreq · September 15, 2020, 3:21pm

Hello,

I'm getting very weird results with edge_ngram tokenizer I'm using for autocomplete. I'm trying to figure out how to make my results more relevant. I copied the example from the elasticsearch documentation.

I have documents with the following descriptions:

"Apples, raw, without skin"
"Apples, raw, golden delicious, with skin",
"APPLEBEE'S, chili"
"Babyfood, fruit, applesauce, junior"

If i search for apple, "APPLEBEE'S, chili" will get higher score than "Apples, raw, without skin"
If i search for apples, "Babyfood, fruit, applesauce, junior" will get higher score than "Apples, raw, golden delicious, with skin"

In both cases I would like to have higher score for the more relevant closer/shorter match (ie. apples when I search for apple or apples

My settings are:

"settings": {
  "analysis": {
    "analyzer": {
      "autocomplete": {
        "tokenizer": "autocomplete",
        "filter": [
          "lowercase",
          "asciifolding"
        ]
      },
      "autocomplete_search": {
        "tokenizer": "lowercase"
      }
    },
    "tokenizer": {
      "autocomplete": {
        "type": "edge_ngram",
        "min_gram": 2,
        "max_gram": 20,
        "token_chars": [
          "letter"
        ]
      }
    }
  }
},

query:

"query": {
    "match": {
      "description": {
          "query": "apple", 
          "operator": "and"
        }
    }
  }

What do I have to do to get the more relevant results score higher?

Thanks,
Gabor

system · October 13, 2020, 3:21pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Elasticsearch - how to make shorter phrase more relevant in result Elasticsearch	2	644	September 13, 2019
Edge_ngram results Elasticsearch	4	360	July 6, 2017
Issue with elasticsearch edge_ngram query Elasticsearch	1	356	May 29, 2020
Which is the best (right) use of NGrams? Elasticsearch	19	5693	July 6, 2017
Autocomplete search Elasticsearch	2	414	July 5, 2017

How to make shorter (closer) token match more relevant? (edge_ngram)

Related topics