Avoid Partial Text Match using n-gram analyser

kashi_mn · July 3, 2017, 6:36am

I have a scenario where in I am using n-gram analyser as Default Analyser and Standard analyser as search analyser in my settings for the field - "title".

The word "NOS" is very specific to our application and I do have few documents with the text starting with "NOS". When I do a search I am getting results of all the documents including the ones that has NOS as a substring. But my expectation is to get only documents which has only NOS text in it.

For example : When I search for "NOS" i get the results containing "diagnosis", "noseband" etc.

n-gram analyser is really working great for rest of the application in terms of partial matching. But I would like to avoid partial matching for such keywords- NOS, BNN, etc.

Any help please?

Ivan · July 4, 2017, 5:28am

Slightly tricky scenario since you want to go against the default matching
algorithm. There is the keyword marker filter, but it will only protect
tokens against further filters that recognize the marker. Stemmers and
synonyms will recognize it, but I doubt n-gram filters will.

One hack would be to apply a pattern replace character filter [1] (which
works pre-tokenization) to convert those words to a nonsensical work
(unicode characters work best), which should be shorter than your minimum
n-gram and then reconvert them with a synonym filter. These special tokens
need to be unique enough to be found in the text (all caps is great).

[1]
https://www.elastic.co/guide/en/elasticsearch/reference/current/analysis-pattern-replace-charfilter.html

kashi_mn · July 4, 2017, 6:07am

Thank you @Ivan for the help. I will try pattern replace approach and see how it goes.

system · August 1, 2017, 6:08am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Want to do partial match without using wildcards Elasticsearch	5	429	July 11, 2024
Analyser Search Help Elasticsearch	2	333	July 6, 2017
Partial word search does not work with Ngram Analyzer! Elasticsearch	2	1415	October 11, 2017
Phrase matching using query_string on nGram analyzed data Elasticsearch	4	1625	July 6, 2017
Exact Sub-String Match \| ElasticSearch Elasticsearch	4	4081	December 9, 2019

Avoid Partial Text Match using n-gram analyser

Related topics