When analyzing alpha+beta delta, I want the outcome of tokens to be [ALPHA+BETA DELTA, ALPHABETADELTA, ALPHA, BETA, DELTA, ALPHA+, ALPHA+BETA]
. My anlyzer gives me the results that I am looking for, except for [ALPHA+, ALPHA+BETA]
. How can I include them?
{
"index": {
"number_of_shards": 1,
"analysis": {
"filter": {
"word_joiner": {
"type": "word_delimiter",
"catenate_all": true,
"preserve_original": "true"
}
},
"analyzer": {
"word_join_analyzer": {
"type": "custom",
"filter": [
"word_joiner",
"uppercase"
],
"tokenizer": "keyword"
}
}
}
}
}