How to normalize Japanese?


(Min Cha) #1

Hello all.

I would like to search by hiragana or katakana without any converting.
For example, It will be matched by same katakana letters even if original
text is stored as hiragana.
I expected to be normalization filter for that but could`t find.

How I resolve this problem? Please give me some advice.
Thanks for reading.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.


(Jörg Prante) #2

The best japanese analyzer I know is kuromoji

Does it work?

Jörg

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.


(Min Cha) #3

Thanks for answer.
The kuromoji seems like to not support a feature that I hope.

2013년 11월 11일 월요일 오후 6시 37분 30초 UTC+9, Jörg Prante 님의 말:

The best japanese analyzer I know is kuromoji
https://github.com/elasticsearch/elasticsearch-analysis-kuromoji

Does it work?

Jörg

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.


(Jun Ohtani) #4

Hi,

You try to use MappingCharFilter.
You define mappings all hiragana to katakana.

http://www.elasticsearch.org/guide/en/elasticsearch/reference/current/analysis-mapping-charfilter.html

But kanji does not convert hiragana or katakana.

Make sense?


Jun Ohtani
johtani@gmail.com
blog : http://blog.johtani.info
twitter : http://twitter.com/johtani

2013/11/11 18:40、Min Cha minslovey@gmail.com のメール:

Thanks for answer.
The kuromoji seems like to not support a feature that I hope.

2013년 11월 11일 월요일 오후 6시 37분 30초 UTC+9, Jörg Prante 님의 말:
The best japanese analyzer I know is kuromoji https://github.com/elasticsearch/elasticsearch-analysis-kuromoji

Does it work?

Jörg

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.


(system) #5