Standard set of Analyzers & Tokens

Hi All,
I just started learning elastic search and trying to choose the appropriate analyzers and tokens for my search. I have the customer reviews & comments data where I need to perform the search. After analyzing the analyzers, I got confused and concerned because I may choose the wrong set of analyzer which could cause performance issues. Are they any boilerplate settings which is kind of a standard settings please?
My search should be :

  • case insensitive
  • match synonyms eg: tv - television, mobile - cellphone
  • Ignore Present, Past, Future tenses , eg : Sing should match Sang, Catch should match Caught

Thanks in advance

Hi there,

elasticsearch has a great endpoint _analyze that can show you how your sample text will be analyzed. You can use it to try different analyzers.

  • Ignore Present, Past, Future tenses , eg : Sing should match Sang, Catch should match Caught

For this you can try English analyzer.

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.