Polish stemming plugin (Stempel 2.4.4) not working on ES 2.4

kuba · February 9, 2017, 10:12pm

Hi, I installed Stempel plugin and configured analyzer and mappings. However, generated tokens seems incorrect.

Elastic version: 2.4
Stempel version: 2.4.4

This is how I set up analyzer:

{
  "body": {
    "settings": {
      "analysis": {
        "analyzer": {
          "text_analyzer": {
            "tokenizer": "standard",
            "filter": [
              "lowercase",
              "polish_stem"
            ]
          }
        }
      }
    }
  }
}

This is how I test it:

curl -XGET 'localhost:9200/my_index/_analyze?pretty=1' -d '
{
  "analyzer" : "text_analyzer",
  "text" : "malediwy"
}'

response:

{
  "tokens" : [ {
    "token" : "y",
    "start_offset" : 0,
    "end_offset" : 8,
    "type" : "<ALPHANUM>",
    "position" : 0
  } ]
}

Oh, and this is how I use it in my mapping:

"someTextField": {
  "type": "string",
  "analyzer": "text_analyzer"
}

Indeed, it is returning 'Malediwy' for 'y' query! (I'd expect 'Malediwach', 'Malediwami' etc., not 'y')
Other words are messed up also.

Is this a bug or am I doing something wrong?

system · March 9, 2017, 10:13pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
[ANN] elasticsearch-analysis-stempel 1.6.0 Elasticsearch	1	302	July 6, 2017
Stempel Polish Analysis Plugin doesn't work Elasticsearch	1	677	July 5, 2017
Elastic and language stem (polish) Elasticsearch	2	1611	July 6, 2017
[ANN] Elasticsearch Stempel (Polish) Analysis plugin 2.4.2 released Elasticsearch	1	368	July 6, 2017
[ANN] Elasticsearch Stempel (Polish) Analysis plugin 2.4.0 released Elasticsearch	1	368	July 6, 2017

Polish stemming plugin (Stempel 2.4.4) not working on ES 2.4

Related topics