Have any way to customize icu_tokenizer?

I need to add some more keywords to icu_tokanizer, which don't need to tokanize? i had tried kuromoji_tokanizer it has userdictionary, we can add new keyword tokens to userdictionary. can i have any way to do this to icu_tokanizer?

Hi @Ajith_Lal

The ICU tokenizer doesn't appear to have a user dictionary, but uou can use rules files to customize the ICU tokenizer: ICU tokenizer | Elasticsearch Plugins and Integrations [8.12] | Elastic

You can also specify transform rules: ICU transform token filter | Elasticsearch Plugins and Integrations [8.12] | Elastic

I'm not sure if that'll get you where you want to go, though.

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.