How to custom analyzer to define an analyzer that emits one term per letter

NgocPham · January 15, 2016, 3:17am

Hi . Im newbie in elasticsearch And I have some question . Please help me

Im trying input data from postgres database into elasticsearch. Now i want spaces with each letter before input it into elasticsearch server.

Example : i have three field like this:
id....code....name
1.....10...........John
2......19.........Lina

i want spaces in someone data in some field name . It look like John --> J o h n . Lina ---> L i n a
Anybody have anyidea about how to do it in analyzer elasticsearch?
im using elasticsearch 1.7.3
Thanks for your help :x :x

dadoonet · January 15, 2016, 7:44am

I'm unsure I understood what you want to do.

If it's before, so that _source will reflect those changes, you have to do that before elasticsearch which means in your client or in logstash if you are using logstash.

If you want that a field john become j,o,h,n at index time, then you can look at the ngram tokenizer and set min_gram and max_gram to 1.

Not sure that I understood the use case though.

NgocPham · January 15, 2016, 9:44am

thanks for your help. I will try with ngram tokenizer

Topic		Replies	Views
Elasticsearch can't hanlde space after add analyzer Elasticsearch	3	405	April 21, 2022
Bug in official document sample Elasticsearch	4	725	July 5, 2017
Design custom analyzer with custom tokenizers Elasticsearch	3	971	July 5, 2017
Set Custom analyzer as default Elasticsearch	1	650	February 25, 2019
Problem when using analyzers (very small data set) Elasticsearch	3	317	July 6, 2017

How to custom analyzer to define an analyzer that emits one term per letter

Related topics