I'm setting up the mapping for an index and I have a field containing long German texts.
I would like to use the Compound Filter to divide the words, so that I could find results also for the "sub-words".
The question is:
- where I can find a good dictionary to use for German?
- (but mainly) what should be the format of this file? A specific extension? One word per line? A comma-separated-values list without line breaks?
I found a dictionary but has a format like this:
And I don't really understand the meaning of the suffix after the slash; and I wonder if Elasticsearch will instead.
Where can I read more specification about how the dictionary needs to be?