New language - Custom analyzer plugin or token filter

Hello all,

I am trying to analyze text in Slovenian language:

  • tokenizer
  • stop token filter
  • stemmer

I would like to use my own stemmer.

  1. Can I use an existing analyzer and implement just the stemmer (basically another token filter)?
  2. If this can be done - how?
  3. If I have to implement a new analyzer plugin, is there a bare bones clean version of plugin that builds available?

The example that can be found here is a complete elasticsearch dependent project that requires a lot of manual gradle digging.

Any help much appreciated.

Best regards,

Aleš Gorišek

http://twitter.com/bob_si

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.