First 2 characters have no typo tolerance in App Search

Chenko · March 21, 2024, 5:25pm

Like the title said, we have an issue, on multiple environments and in multiple engines.
Where when we mistype the first or second character App Search does not return any relevant results.

Example we have a document with the title "simple" I mistype it in my query with "sample" this will return NO results, same with other use cases.
we have a document with the title "smile" I mistype it in my query with "stile" it returns no results despite only one character being different.

If I do this with other letters than the first 2 I do get relevant results.

No matter how low I set my precision tuning It does not work.

We have thought this might be because the prefix_length is set to 2 even on the lowest precision tuning.

We would like to be able to continue using App Search's search however also have typo tolerance for the first two characters.

Any help would be appreciated.

Kind regards,
Chenko Mortier

Kathleen_DeRusso · March 21, 2024, 7:20pm

Hi @Chenko thanks for the feedback. In this case, App Search is working as designed.

Here's an example snippet of what this generated query is, of course your fields will be different:

 {
                                "multi_match": {
                                  "query": "park",
                                  "minimum_should_match": "1<-1 3<49%",
                                  "type": "best_fields",
                                  "fuzziness": "AUTO",
                                  "prefix_length": 2,
                                  "fields": [
                                    "world_heritage_site.stem^0.1",
                                    "description.stem^0.24",
                                    "title.stem^0.5",
                                    "nps_link.stem^0.07",
                                    "states.stem^0.28"
                                  ]
                                }

The important point to note is that in the multi_match query that App Search generates, the prefix_length is set to 2. This is described in the parameters for the fuzzy query.

Here's a simple example that illustrates this:

PUT discuss-test/_doc/1
{
  "title": "A tale of two cities",
  "text": "It was the best of times, it was the worst of times"
}

POST discuss-test/_search
{
  "query": {
    "multi_match": {
      "query": "cties",
      "fields": [ "title", "text" ], 
      "fuzziness": "AUTO", 
      "prefix_length": 2
    }
  }
}

If you set the prefix_length to a smaller number then cties will return the document.

App Search is a very opinionated piece of software, and part of what makes it so easy to use out of the box is that opinionatedness. Unfortunately that comes with somewhat less flexibility than using the Elasticsearch search platform directly.

If there are specific queries that you are concerned with you may be able to put in synonyms or curations as a workaround. And of course you always have the option of modifying this query and sending it via the Elasticsearch Search API. But otherwise, it is a technical limitation of App Search.

Kathleen_DeRusso · March 21, 2024, 9:02pm

FYI - this has now been clarified in our documentation

system · April 18, 2024, 9:02pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Precision tuning does not affect prefixing Elastic Search elastic-app-search	2	379	March 15, 2022
Multi-word search Elastic Search elastic-app-search	3	448	January 3, 2021
Reasonable values for fuzziness, prefix_length, and max_expansions Elasticsearch	2	586	July 6, 2017
Elasticsearch fuzziness with multi_match and bool_prefix type Elasticsearch	1	644	December 30, 2021
Quotation Marks + Prioritizing Exact Matches Elastic Search elastic-app-search	5	1503	November 19, 2021

First 2 characters have no typo tolerance in App Search

Related topics