The nitty gritty of Unicode is a bit out of my wheelhouse but I did a little searching. While some regex engines support "Unicode scripts" which allow matching on characters from particular languages, it seems Elasticsearch's regex implementation does not support these. You may want to create an enhancement request on Elasticsearch's github repo. I can't think of any other good way to match on certain character sets at query time.
It might be easier to flag documents containing certain characters at index time.
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.