Fuzzy

kalpana_pinnaka · September 21, 2011, 5:29am

Hi, I want a command to get the "nearest word" of a miss-spell word
using fuzzy query.

for example: i have put a command like this

curl -XPUT http://localhost:9200/lowang/ets/11 -d '
{
"_id":11,
"title":"smartfon nokia 5678"
}

and i use a search command like below

curl -XGET http://localhost:9200/lowang/ets/_search -d '{
"query": {
"bool": {
"must":[
{ "fuzzy": { "title": { "value" : "noia" } } }
]
}
}}'

and now i got the results like below

{"took":35,"timed_out":false,"_shards":{"total":5,"successful":
5,"failed":0},"hits":{"total":1,"max_score":0.15342641,"hits":
[{"_index":"lowang","_type":"ets","_id":"11","_score":0.15342641,
"_source" :
{
"_id":11,
"title":"smartfon nokia 5678"
}}]}}

internally...using fuzzy query logic ..it is calculating nearest word
for "noia" as "nokia"..and it giving results for the word "nokia".

Instead of results i want to display the "nearest word" of a miss-
spelled word.

How to i can get?

AEvar_Arnfjord_Bjarm · September 21, 2011, 5:40pm

On Wed, Sep 21, 2011 at 07:29, kalpana pinnaka saikalpana18@gmail.com wrote:

internally...using fuzzy query logic ..it is calculating nearest word
for "noia" as "nokia"..and it giving results for the word "nokia".

Instead of results i want to display the "nearest word" of a miss-
spelled word.

I dealt with this problem today and just wrote code that for a given
resultset for a fuzzy result:

Got the first result in the set
Tokenized the words in it
Tokenized the words I'd given Elasticsearch
Compared the Levenshtein distance between all words and took into
account their length
Got words like "nokia" back for "noia"

I wish there was an easier API for this, but I haven't found it yet.

Topic		Replies	Views
How to get results for missspelled query not using fuzzy based query? Elasticsearch	8	457	July 6, 2017
Elastic Search for misspelled words Elasticsearch	15	11880	July 6, 2017
Fuzzy query Elasticsearch	2	286	July 6, 2017
Tolerance spelling Elasticsearch	4	1667	June 1, 2017
Return Levenshtein distance in fuzzy query Elasticsearch	1	461	July 6, 2017

Fuzzy

Related topics