Is fuzzy query in elasticsearch related to fuzzy logic?

full_vlad · January 18, 2016, 3:17pm

As the title states, what exactly in Elasticsearch's fuzzy-query is related to fuzzy logic?

For example, given a string, a fuzzy query with fuzziness of 2 will return all indexed strings that have a Levenshtein distance of 2. How does the system decide what answers to return if there are multiple matches?

Is there a fuzzy system behind it? one that has triangular functions (for instance) and can be expressed in something like this:

1|   A    B
 |   /\  /\      A = fuzzy set 1
 |  /  \/  \     B = fuzzy set 2
 | /   /\   \
0|/   /  \   \
 ------------
  a   b  c   d

I would like a more theoretical answer that tackles what exactly in fuzzy queries is so fuzzy?

Mark_Harwood · January 18, 2016, 3:26pm

Edit distance was one factor but TF-IDF was also part of the mix - IDF being handled badly until recently. See https://issues.apache.org/jira/browse/LUCENE-329 for the recent IDF fixes.

full_vlad · January 18, 2016, 3:56pm

Thanks very much for the quick answer, but I was looking to more of a begginers answer that also shows the mathematical model behind fuzzy queries in regard to fuzzy logic.

Topic		Replies	Views
Elasticsearch Fuzzy Query Elasticsearch	1	329	April 1, 2019
Fuzzy query scoring based on levenshtein distance Elasticsearch	4	2680	July 6, 2017
Fuzzy query that is making me crazy Elasticsearch	1	341	April 1, 2020
Fuzzy search question Elasticsearch	8	1173	May 23, 2020
Fuzziness & score computation Elasticsearch	2	5844	July 6, 2017

Is fuzzy query in elasticsearch related to fuzzy logic?

Related topics