Does Edge Ngram Token filter creates Synonym for tokens?

Farnaz · June 21, 2023, 12:03pm

ave added Edge Ngram Token Filter to my analyzer, ngram_back_fa. Here is my analyzer:

"ngram_back_fa": {
                    "tokenizer": "standard",
                    "filter": [
                        "lowercase",
                        "decimal_digit",
                        "trim",
                        "reverse",
                        "edge_ngram_filter",
                        "reverse"
                    ]
                }

I have added Edge Ngram Token Filter to my analyzer, ngram_back_fa. Here is my analyzer:

"ngram_back_fa": {
                    "tokenizer": "standard",
                    "filter": [
                        "lowercase",
                        "decimal_digit",
                        "trim",
                        "reverse",
                        "edge_ngram_filter",
                        "reverse"
                    ]
                }

Here is my filed, title_en:

"title_en": {
                "type": "text",
                "analyzer": "title_en_analyzer",
                "boost": 40,
                "fields": {
                    "with_back_ngram": {
                        "type": "text",
                        "analyzer": "ngram_back_en",
                        "boost": ngram_boost
                    }
                }
            }

And my search query is like this:

{
     "query": {
            "dis_max": {
                "queries": [
                    {
                        "multi_match": {
                            "query": "HomeComing",
                            "type": "best_fields",
                                "title_en.with_back_ngram"
                                ]
                        }
                    }
                ]
            }
        }
    
}

When I use Explain API to see how this analyzer is calculating my score I see that apparently, the tokens that are generated by token filter are like Synonym of the original tokens and therefore the score are really low. You can see the result of Explain API in the below image:

How Can I make elastic to stop this and treat this tokens (generated by token filter) the sane way it treats with the token generated by tokenizer?

system · July 19, 2023, 12:04pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Synonym token filter and edge_ngram tokenizer conflicts Elasticsearch	6	1551	October 7, 2019
Help with synonyms and edge ngram analyzers Elasticsearch	2	1927	July 6, 2017
Edge_ngram tokenizer and edge_ngram filter don't behave the same? Elasticsearch	1	382	December 30, 2020
Highlighting with edge ngram token + synonym filter Elasticsearch	1	1231	July 30, 2020
Synonym_filter and edge_ngram token filter not working together Elasticsearch	3	660	May 2, 2018

Does Edge Ngram Token filter creates Synonym for tokens?

Related topics