Hamming Distance on Binary strings

anahap · August 26, 2011, 10:50am

Hi there,

Does anyone know the best way to store fixedlength binary data and query it,
while scoring with hamming distance?
A hamming distance filter with a threshold would also be ok.

Thanks a lot, this is useful for all kinds of similarity searches based on
fingerprinting algorithms.

kimchy · August 26, 2011, 3:08pm

You can use fuzzy queries for Levenshtein distance, but note that they are
slow(er) in Lucene 3.3, will be much faster in Lucene 4.0 (when it comes
out).

On Fri, Aug 26, 2011 at 1:50 PM, anahap andy@nahapetian.com wrote:

Hi there,

Does anyone know the best way to store fixedlength binary data and query
it, while scoring with hamming distance?
A hamming distance filter with a threshold would also be ok.

Thanks a lot, this is useful for all kinds of similarity searches based on
fingerprinting algorithms.

Catalin_Banu · November 13, 2013, 10:08am

Hi,

Did you find a solution?

On Friday, August 26, 2011 1:50:51 PM UTC+3, anahap wrote:

Hi there,

Does anyone know the best way to store fixedlength binary data and query
it, while scoring with hamming distance?
A hamming distance filter with a threshold would also be ok.

Thanks a lot, this is useful for all kinds of similarity searches based on
fingerprinting algorithms.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

Topic		Replies	Views
Hamming Distance on Binary Strings - Latest Elasticsearch	6	963	May 19, 2023
What is the best scheme for similarity search based on binary codes with Elasticsearch ? (Context is image search with deep nets) Elasticsearch	5	6160	July 5, 2017
ElasticSearch 5.5 two phase search Elasticsearch	5	1223	September 15, 2017
How to store a byte array? How to reference a stored field in a Filter implementation? Elasticsearch	5	2641	July 6, 2017
Can exact match get higher score than modified by character filter? Elasticsearch	3	776	March 12, 2022

Hamming Distance on Binary strings

Related topics