Hi guys,
i experiment with search phrases of german compound words. The testword is "kennenlernen" which consists of the two words "kennen" and "lernen".
I configured the following settings to achive that i can find the word "kennenlernen" with the searchphrase "kennen".
Any ideas why this wont work or solutions for debugging the indexed terms?
Thanks a lot.
Settings:
"analysis": {
"filter": {
"german_stop": {
"type": "stop",
"stopwords": "german"
},
"german_stemmer": {
"type": "stemmer",
"language": "light_german"
},
"german_compound_filter": {
"type": "dictionary_decompounder",
"word_list": [
"kennenlernen",
"kennen",
"lernen"
],
"min_subword_size": 2
}
},
"analyzer": {
"german_mod": {
"type": "custom",
"tokenizer": "standard",
"filter": [
"lowercase",
"german_stop",
"german_normalization",
"german_stemmer",
"german_compound_filter"
]
}
}
}
Mapping:
"_all": {
"analyzer": "german_mod"
}
Search with no hits:
"query": {
"match": {
"_all": {
"query": "kennen"
}
}
}
Search with one hit:
"query": {
"match": {
"_all": {
"query": "kennenlernen"
}
}
}