I try to use kuromoji user dictionary to make Elasticsearch can analazed japanese name which with
Half-width space between family name & first name.
Settings example looks like the below:
{
"settings": {
"analysis": {
"tokenizer": {
"kuromoji_user_dict": {
"type": "kuromoji_tokenizer",
"mode": "normal",
"user_dictionary_rules": [
"渡辺 健,渡辺 健,ワタナベ ヒカリ,カスタム名詞",
]
}
},
"analyzer": {
"my_ja_analyzer": {
"type": "custom",
"tokenizer": "kuromoji_user_dict",
"char_filter": [
"icu_normalizer"
]
}
}
}
}
}
But I fund it dosen't worked well.A part of name was not analazed .
{
"analyzer": "my_ja_analyzer",
"text": "渡辺 健"
}
{
"tokens": [
{
"token": "渡辺",
"start_offset": 0,
"end_offset": 2,
"type": "word",
"position": 0
},
{
"token": " ",
"start_offset": 2,
"end_offset": 3,
"type": "word",
"position": 1
}
]
}
Can somebody tells me what's wrong with it?