Hunspell analyzer

nicocsgamer · August 23, 2015, 5:06pm

Hey guys,

I would like to configure analyzers for languages which are not supported by ES out of the box: ET, HR, LT, MT, PL, SK, and SL.

So I took a look at the Hunspell Token Filter. As far as I know, the Hunspell Token Filter is only for steeming, but when I take a look at already configured languages, there is not only stemmer, but also stop words, lowercase, other even keywords.
So how analyzers based on hunspell should be correctly configured?
What about this github project https://github.com/elastic/hunspell/tree/master/dicts ?

Thanks.

warkolm · August 23, 2015, 10:11pm

If you want to look at non-english languages then also check out the ICU analyser.

nicocsgamer · August 24, 2015, 7:19am

It's seems the ICU plugin is not used for steeming not for normalizing, no?

Topic		Replies	Views
Use case of multiple Language Analyzer, Hunspell along with Elasticsearch Langdetect Plugin Elasticsearch	13	1054	July 6, 2017
Hunspell filter problem Elasticsearch	5	821	July 5, 2017
Need Hunspell Analysis in Single Token Keyword Elasticsearch	1	328	January 16, 2020
Hunspell russian language. Problem with some grammar cases Elasticsearch docker	2	343	July 18, 2022
Hunspell fails to stem correctly Elasticsearch	1	206	May 30, 2022

Hunspell analyzer

Related topics