Non-standart analizer/tokenizer

enp · February 7, 2016, 1:28pm

Hi,

What is the best way to configure non-standart analizer/tokenizer: split not only my whitespaces but even with underscores, dashes and slashes, exclude numbers - so only words with more than 3 letter without numbers and in lower case must stay?

nik9000 · February 7, 2016, 8:46pm

If any of the tokenizers here will do then you can just configure them. The Pattern tokenizer lets you define a regex and so its super flexible. It might be the best thing in your case.

Topic		Replies	Views
Pattern Analyzer with separator tokens Elasticsearch	1	320	July 6, 2017
Design custom analyzer with custom tokenizers Elasticsearch	3	971	July 5, 2017
Pattern analyzer regex help Elasticsearch	3	253	August 24, 2022
Configuring the standard tokenizer Elasticsearch	8	15242	July 5, 2017
Analysing and Searching Elasticsearch	3	332	July 6, 2017

Non-standart analizer/tokenizer

Related topics