Standard set of Analyzers & Tokens

arjunp · October 4, 2018, 8:17pm

Hi All,
I just started learning elastic search and trying to choose the appropriate analyzers and tokens for my search. I have the customer reviews & comments data where I need to perform the search. After analyzing the analyzers, I got confused and concerned because I may choose the wrong set of analyzer which could cause performance issues. Are they any boilerplate settings which is kind of a standard settings please?
My search should be :

case insensitive
match synonyms eg: tv - television, mobile - cellphone
Ignore Present, Past, Future tenses , eg : Sing should match Sang, Catch should match Caught

Thanks in advance

mayya · October 12, 2018, 5:49pm

Hi there,

elasticsearch has a great endpoint _analyze that can show you how your sample text will be analyzed. You can use it to try different analyzers.

Ignore Present, Past, Future tenses , eg : Sing should match Sang, Catch should match Caught

For this you can try English analyzer.

system · November 9, 2018, 5:54pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Analyzer tokenizer in search mode Elasticsearch	5	779	July 5, 2017
Analyzers at Index time and search time are not matching Elasticsearch	1	336	December 28, 2021
Correctly set up index analyzer and search analyzer Elasticsearch	3	760	May 29, 2021
Analyzer and search_analyzer for common tokens Elasticsearch	1	465	November 14, 2017
Deciding correct analyzer for the field mapping & searching Elasticsearch	1	336	July 6, 2017

Standard set of Analyzers & Tokens

Related topics