Index analyzer settings: is there a way to

Michal_Wegorek · February 22, 2012, 11:49am

Is there an index analyzer setting to:

Treat diacritic letters (in my case polish diacritic letters ą, ć,
ę, ł, ń, ó, ś, ź, ż) as US alphabet equivalents during search:
ą -> a
ć -> c
..

What I mean is when I do search with pattern 'abc' i want to see in
results 'abc' as well as 'ąbc', but when I search for 'ąbc' I want ES
to find only 'ąbc'

This settings do not work, I expected asciifolding might be doing the
trick:

index.analysis.analyzer.default.type: standard
index.analysis.analyzer.default.stopwords: none
index.analysis.analyzer.default.tokenizer: standard
index.analysis.analyzer.default.filter: [standard, lowercase, stop,
asciifolding, porter_stem]

ES 18.7

Cheers!
Michal

kimchy · February 26, 2012, 9:41am

Are you sure you are using a query that also analyzes the search text? (query_string, field, text)? If so, can you gist a recreation (Elasticsearch Platform — Find real-time answers at scale | Elastic)?

On Wednesday, February 22, 2012 at 1:49 PM, Michal Wegorek wrote:

Is there an index analyzer setting to:

Treat diacritic letters (in my case polish diacritic letters ą, ć,
ę, ł, ń, ó, ś, ź, ż) as US alphabet equivalents during search:
ą -> a
ć -> c
..

What I mean is when I do search with pattern 'abc' i want to see in
results 'abc' as well as 'ąbc', but when I search for 'ąbc' I want ES
to find only 'ąbc'

This settings do not work, I expected asciifolding might be doing the
trick:

index.analysis.analyzer.default.type: standard
index.analysis.analyzer.default.stopwords: none
index.analysis.analyzer.default.tokenizer: standard
index.analysis.analyzer.default.filter: [standard, lowercase, stop,
asciifolding, porter_stem]

ES 18.7

Cheers!
Michal

Topic		Replies	Views
Serbian analyzer setup Elasticsearch	4	2485	November 21, 2017
Elasticsearch Query: Returning results with Diacritics (Accents and special characters) Elasticsearch	5	5213	July 7, 2017
Accent insensitive search with search analyzer Elasticsearch	8	12063	January 30, 2018
Return Results With Diatrics Elasticsearch	2	729	August 5, 2021
Index analyzer problem with accent! Elasticsearch	1	337	July 6, 2017

Index analyzer settings: is there a way to

Related topics