The field temp_path is "text" type and "not_analyzed".
According to the document, "In order to prevent extremely slow wildcard queries, a wildcard term should not start with one of the wildcards * or ?. ", I used "迷惑メール*", instead of "*迷惑メール*", then it succeeded to hit the Japanese text.
So my question is: for the wildcard term like "*xxx*", why it can work on English text, but can not hit non-English text?
2nd: it’s probably happening because your text has been analyzed with the default analyzer at index time but when using wildcards the text is not analyzed. So analyzed terms and non analyzed terms don’t match.
Try to see what the _analyze API is telling you about the way your text is analyzed and you’ll get a better idea.
It's my fault to misuse "type:text" and "type:string". You are right, the text is analyzed. Sorry for your time.
And anyone else who viewing this topic, please be careful about this.
~~
The string field datatype has been replaced by the text field for full text analyzed content, and the keyword field for not-analyzed exact string values.
~~
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.