Example of a UTF8 text which does not generate tokens

I have a text ಪೋರ್ಟ್‌ಲ್ಯಾಂಡ್‌ which passes the utf-8 encoding checks and causes no issues when I use it inside javascript or print to the console. However, elasticsearch cannot build a token from it and throws no alerts or errors.
When I try to paste it in Kibana UI console it highlights the problematic spot in this text string:
kibana%20screenshot
Yet, no errors or warnings are fired, the search simply yields no matches.

Are there any special utf8 characters present in this string that have to be escaped? It does not seem to have usual suspects as . {} / \

I am most concerned that no warnings are given, caught this issue by a pure accident

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.