I'm playing with the strangest bug ever in ES. As I didn't clearly identified the root cause, I'm trying the forum first.
My analysis do:
- a char filter to replace
- a synonym token filter to extend
If my content is
I ❤ ES, my tokens are ok and I have a "love" token.
But if my content is
I <3 ES, I only found
❤ in the tokens, not "love"!
Aren't the char_filter supposed to be executed first?
Also note that it does not happens with other replacements. If I replace all "ES" by "Elasticsearch" in the char_filter, and then have synonyms for "Elasticsearch", they will be indexed.
Here is a "play" where I managed to reproduce the error: https://found.no/play/gist/2493ccf2f307d46359a0#analysis
You can see that where there is a
<3, no "love" synonym appears, but the replacement did occurs.
Thanks for your help!