The search for full words or terms does not work correctly

Rainbow_Sprinkles · July 22, 2024, 7:56am

Dear talents,
I hope everyone is doing well. Could you kindly review the statement below and suggest possible fixes?
The keyword is "alt legal".
Currently, the search engine is expected to display titles that either contain the full phrase "alt legal" or both the words "alt" and "legal". As a result, the entry "bluh bluh Alt legal" should be ranked at the top. However, the current top candidate is "LegalType," which includes "alT" and "Legal," but they are not a complete word.
Does someone have experience like this?

Christian_Dahlqvist · July 22, 2024, 9:27am

What is the mapping of the fields you are querying?

Rainbow_Sprinkles · July 22, 2024, 10:12am

Thanks for your reply @Christian_Dahlqvist

Both of they are string type.
I replaced displayName with title for more comprehension.

Christian_Dahlqvist · July 22, 2024, 10:49am

What is the mapping of the fields in the index as shown using the get mapping API?

Rainbow_Sprinkles · July 22, 2024, 12:54pm

Here is mapping data:

"description" : {
          "type" : "text",
          "analyzer" : "full",
          "search_analyzer" : "full_search"
        },
        "displayName" : {
          "type" : "text",
          "fields" : {
            "raw" : {
              "type" : "keyword"
            }
          },
          "analyzer" : "full",
          "search_analyzer" : "full_search"
        },

Christian_Dahlqvist · July 22, 2024, 12:58pm

What is the definition of the "full_search" and "full" analysers? Why do you have a separate search analyser that is different from the index analyser?

Rainbow_Sprinkles · July 22, 2024, 1:07pm

Well, would this be an interruption of searching for "alt legal"?
Actually, I am a newbie on this search engine

Christian_Dahlqvist · July 22, 2024, 1:33pm

It is key to how the search is executed, so you need to provide this information.

Rainbow_Sprinkles · July 22, 2024, 2:58pm

Here is detail.

"analyzer": {
        "full": {
          "tokenizer": "full",
          "filter": [
            "lowercase",
            "asciifolding",
            "english_stop",
            "synonym"
          ],
          "char_filter": [
            "html_strip"
          ]
        },
        "full_search": {
          "tokenizer": "search",
          "filter": [
            "lowercase",
            "asciifolding",
            "english_stop",
            "synonym"
          ],
          "char_filter": [
            "html_strip"
          ]
        },
}

Rainbow_Sprinkles · July 22, 2024, 4:20pm

Could you please take a look?

Christian_Dahlqvist · July 22, 2024, 5:40pm

What is the definition of your synonym filter?

Rainbow_Sprinkles · July 22, 2024, 6:00pm

I think there is no synonym of it in this list

Rainbow_Sprinkles · July 22, 2024, 7:06pm

Actually, there is no data that is related to "legaltrek"
Why does this issue happen?

Christian_Dahlqvist · July 23, 2024, 8:39am

I have not had time to recreate this so would recommend that you use the analyze API to analyze and commpare how the search string and indexed matching data are analyzed.

michaelcizmar · July 27, 2024, 11:59am

A couple of notes here. First, you should be aware that if you use a synonym filter last then the synonyms you place in your list will not be filtered. So those synonyms need to be in the final form.

Second, what are search and full tokenizers? There's probably no need for all of these customizations. You should start with out of the box and then make adjustments as necessary to achieve a specific result. My assumption is that y

LegalType is likely getting ngramed. So it is being tokenized ["Leg", "Lega", "Legal", "LegalT", "LegalTy","LegalTyp","LegalType","egalType","galType","alType", "lType","Type", "ype"...."alT"] (note this is not the complete list)

after tokenization you have a token "alT" and "Legal" which are then lowercased. See this: N-gram tokenizer | Elasticsearch Guide [8.14] | Elastic

But as Christian mentioned, use the analyze api to see how your analyzers are actually performing and you'll see the issue.

Topic		Replies	Views
Wrestling with analyzer Elasticsearch	5	433	July 6, 2017
Synonym configuration Elasticsearch	7	1994	July 6, 2017
How to search with synonym analyzer Elasticsearch	4	2525	December 29, 2016
Why doesn't this Synonym work? Elasticsearch	13	3059	July 5, 2017
Synonyms in a query Elasticsearch	7	1393	July 6, 2017

The search for full words or terms does not work correctly

Related topics