Custom ukrainian analyzer


#1

Hi.

I am trying to use the analyzer from the Ukrainian Analysis plugin with Elasticsearch 6.0.1.

I cannot figure out how to integrate it with my html_strip and word_delimiter filters.

{
  "settings": {
    "analysis": {
      "filter": {
        "ukrainian_stop": {
          "type":     "stop",
          "stopwords": "_ukrainian_"
          }
      },
      "analyzer": {
        "default": {
          "tokenizer": "standard",
          "char_filter": ["html_filter"],
          "filter": [
              "icu_folding",
              "ukrainian_stop",
              " --- ukrainian stemming --  ???",
              "word_delimiter"
          ]
        }
      }
    }
  }
}

The ukrainian plugin seems not to declare any stemmer, AFAICS. I was using the hunspell based ukrainian stemming but wanted to switch to the morfologik one.


#2

Is it perhaps possible with later version of elasticsearch 6.3.0?

Or is this a limitation of the underlying Lucene module?!

Would be grateful for any pointers, thanks!


(system) #3

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.