I'm trying to see what stemming does specifically for different types of stemmers but when I do the following:
curl -X POST \
http://i:9200/_analyze \
-H 'Cache-Control: no-cache' \
-H 'Content-Type: application/json' \
-d '{
"text": "consigned",
"filter": [
"lowercase",
{
"type": "stemmer",
"language": "porter2"
}
]
}'
The results are
"tokens": [
{
"token": "consigned",
"start_offset": 0,
"end_offset": 9,
"type": "<ALPHANUM>",
"position": 0
}
]
}
Whereas in the documentation of the Porter2 stemmer, it states that it should be stemmed to consign
. What am I doing wrong?
The lowercase is working btw (if I use uppercase in the text)