I'm using the snowball analyzer to stem words but there are a couple of words during testing that are not working well.
If put in Mountain Biking or Mountain Bikes then I end up with Mountain Bike but if put Mountain Biker it gets stored and Mountain Biker not Mountain Bike.
Is there another tokenizer or analyzer I should be using in conjunction with snowball?
I'm using the snowball analyzer to stem words but there are a couple of
words
during testing that are not working well.
If put in Mountain Biking or Mountain Bikes then I end up with Mountain
Bike
but if put Mountain Biker it gets stored and Mountain Biker not Mountain
Bike.
Is there another tokenizer or analyzer I should be using in conjunction
with
snowball?
Snowball and porter stemmers will not stem 'er' words. If you have a
handful of cases, you can use the stemming override filter to manually
manage the list of words.
I'm using the snowball analyzer to stem words but there are a couple of words
during testing that are not working well.
If put in Mountain Biking or Mountain Bikes then I end up with Mountain Bike
but if put Mountain Biker it gets stored and Mountain Biker not Mountain
Bike.
Is there another tokenizer or analyzer I should be using in conjunction with
snowball?
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.