By the way, when I first tried to create a working example using my local
test/dev index, my Chinese characters were missing and queries against them
did not work. I don't exactly recall the last time I deleted and reloaded
that index, nor do I remember exactly which ES versions were changed. But I
am currently running on ES 0.90.3, and I believe the index was deleted and
recreated (with successful regression tests including Chinese characters)
no earlier than 0.90.0. So I don't have any logs to show; just results. But
here are the results:
In general, this is against a synonym "table". (Yeah, I know. But I do find
that a separate query for synonyms means that changing synonyms does not
require a reload or reindex of the data. And performance is very good.)
{
"bool" : {
"must" : [ {
"match" : {
"field" : {
"query" : "gn",
"type" : "boolean"
}
}
}, {
"prefix" : {
"words" : "醫"
}
} ]
}
}
- When I first used my current laptop set-up to get a working example,
nothing was found. When I queried one of the English terms, the following
result came back. Note that the last value is expected to be a Chinese
phrase but comes out null instead:
{ "field" : [ "gn" , "o" , "cnam" ] , "words" : [ "Dr" , "Doctor" , "MD" ,
"Phd" , null ] }
- After deleting and reloading the index, the query now returns all words
including the Chinese:
{ "field" : [ "gn" , "o" , "cnam" ] , "words" : [ "Dr" , "Doctor" , "MD" ,
"Phd" , "醫生" ] }
Not sure why, since this has always worked starting with my initial ES
version 19.4 and hasn't yet (until today) failed.
Brian
--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.