Difference between snowball stemmer and regular stemmer


Elasticsearch (and Lucene ?) has two stemmer token filters: the snowball one and the regular one.

It seems in my tests that the regular stemmer is under the hood the snowball stemmer.

It this really the case ?

For instance, these two settings give the same stemming in my tests:

    { "type": "stemmer", "language": "italian" }
    { "type": "snowball", "language": "italian" }

For example, as per the doc, the type:stemmer with language italian seems to be a snowball stemmer as the link goes to snowballstem.org.

So question: are the snowball and the regular stemmer the same for some language config ?
I have the case with italian and french (as in the screenshot).

1 Like

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.