Standard set of Analyzers & Tokens


(arjun) #1

Hi All,
I just started learning elastic search and trying to choose the appropriate analyzers and tokens for my search. I have the customer reviews & comments data where I need to perform the search. After analyzing the analyzers, I got confused and concerned because I may choose the wrong set of analyzer which could cause performance issues. Are they any boilerplate settings which is kind of a standard settings please?
My search should be :

  • case insensitive
  • match synonyms eg: tv - television, mobile - cellphone
  • Ignore Present, Past, Future tenses , eg : Sing should match Sang, Catch should match Caught

Thanks in advance


(Mayya Sharipova) #2

Hi there,

elasticsearch has a great endpoint _analyze that can show you how your sample text will be analyzed. You can use it to try different analyzers.

  • Ignore Present, Past, Future tenses , eg : Sing should match Sang, Catch should match Caught

For this you can try English analyzer.


(system) #3

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.