Its a bit tricky. What you need to do is extend one of elasticsearch built in providers for analyzers/tokenizers and then, within it, create your tokenizer. Then, within the configuration, you can reference that class. Check out the org.elasticsearch.index.analyiss package source code.
On Tuesday, January 18, 2011 at 10:42 PM, PaullyB wrote:
I have my own custom Tokenizer and would like to use it in Elastic
Search. How do I configure it in elasticsearch.yml? Do I need to
reference the fully qualified class name? Also, is there a way to set
parameters of the class in the configuration?