I'm building a multi-node elk stack which is going to use the Hyphenation decompounder token filter for multiple languages. The official documentation states, that to use the filter the hyphenation_patterns_path
needs to be provided. The path should point to an Apache FOP (Formatting Objects Processor) XML hyphenation pattern file. In my case, I downloaded the zip archive that was linked to by the documentation offo-hyphenation_v1.2.zip. The archive is filled with a bunch of hyphenation XML files which need to be stored on the cluster. For this to happen I have the following questions:
- Do the files need to be provided on each node of the cluster? In my case, I'm using two master- and one data node.
- What is the best way or working way, to put the files onto the nodes? I've tried to use ConfigMaps to mount the files to the nodes. But since I want to use more than a dozen hyphenation files I would have to create a ConfigMap per XML file which would create a lot of extra configuration even more so if the files are required on every node of the cluster.
My cluster is running version 8.14 of Elastic Search.
I found this thread on the forum that had the same goal as me, but unfortunately never received a reply: