I'm trying to use Swedish stemming in elasticsearch and I keep getting
problem with it. I could need some advice about how to deal with this stuff.
The main problem is that the stemmers stems some words in a weird way which
makes my hits either go through the roof or not match at all.
At first I used the "Swedish" snowball filter but it seemed to stem a bit
"too hard".
None of these words should really be stemmed to "led". So, I changed to the
"light_swedish" filter instead. It seems to be a bit more conservative with
its stemming which I like:
Am Dienstag, 11. November 2014 14:55:47 UTC+1 schrieb Linus Pettersson:
Hello
I'm trying to use Swedish stemming in elasticsearch and I keep getting
problem with it. I could need some advice about how to deal with this stuff.
The main problem is that the stemmers stems some words in a weird way
which makes my hits either go through the roof or not match at all.
At first I used the "Swedish" snowball filter but it seemed to stem a bit
"too hard".
None of these words should really be stemmed to "led". So, I changed to
the "light_swedish" filter instead. It seems to be a bit more conservative
with its stemming which I like:
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.