Search unicode


(Tule) #1

I have an Unicode string like this: "chủ nợ đã kéo đến cửa hàng đồ
gỗ".

As I follow the tutorial on the ElasticSearch website, I can search,
for example:
"hàng" and the result with highlight is beautiful.

The problem now is that I want to search for "hang" and still receive
the same result with highlight in unicode.

Can anyone help me with this? If you do, please explain in clearl
steps because I'm very new to search.

Thank you in advance!


(Shay Banon) #2

Check the asciifolding filter in the analysis section:
http://www.elasticsearch.org/guide/reference/index-modules/analysis/asciifolding-tokenfilter.html
.

You can build a custom analyzer that does ascii folding as well as one of
its filters.

On Wed, Dec 14, 2011 at 6:34 AM, Tule tule193@gmail.com wrote:

I have an Unicode string like this: "chủ nợ đã kéo đến cửa hàng đồ
gỗ".

As I follow the tutorial on the ElasticSearch website, I can search,
for example:
"hàng" and the result with highlight is beautiful.

The problem now is that I want to search for "hang" and still receive
the same result with highlight in unicode.

Can anyone help me with this? If you do, please explain in clearl
steps because I'm very new to search.

Thank you in advance!


(system) #3