Custom language analyzers on Elastic cloud

Hi,
I use Elasticsearch enterprise version 8.12.2 on EC cloud.
I need to support search over 120 locales for our application. The problem is that I'm unable to find packages on cloud that can enable stemming and stop filters for about 20+ locales.
For example, Slovakia (Slovensko), sq (Albania), az (Azerbaijan) locales doesn't seem to have EC plugin.
Any suggestions on enabling stemming and stop filters for these 20+ locales that don't have inbuilt analyzers in EC cloud?
I know that we can create plugins and upload them to the cloud, but not able to find any easier way of doing it.

The markets (locales) that need custom language analyzers are:
Slovakia, Albania, Azerbaijan, Bosnia and Herzegovina, Belarus, Georgia, Croatia, Israel, Iceland, Montenegro, North Macedonia, Moldova - Moldavian, Serbia, Slovenia, Ukraine, Vietnam - Vietnamese and Kosovo

Your best bet is to create a custom analyzer for each language you need to support. This page on language analyzers has some examples and may help get you started.

Hi Kathleen,
The link you shared points to the built-in analyzers. When I checked some more documentation, I noticed that I need to create .dic and .aff files for the custom analyzers. And then probably create a custom elastic cloud plugin for the language analyzer. Isn't that whole process needed?

There are some plugins available such as this Ukrainkian analysis plugin and then follow the cloud plugin instructions. You're right that for ones that don't have a plugin you may need to make your own.

It's also worth noting that as a cloud customer you should be entitled to support services, so please feel free to contact your support representative if you need additional help installing it!

Thank you Kathleen.

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.