Hybrid Retrieval with approximate KNN

jinmingteo · April 3, 2023, 9:50am

Hi,

I have read through the following documentation: k-nearest neighbor (kNN) search | Elasticsearch Guide [master] | Elastic

I have also experimented with a small database and it seems that the results are skewed towards the matching function (which is BM25 (?)).

Matching score seems to be in a much higher magnitude as compared to KNN score. This can be further validated using the Explain API. Thus, is there any normalisation done to the matching values before adding to KNN score? Or we have to tune the values through boosting?

BenTrent · April 3, 2023, 12:50pm

Hey @jinmingteo ,

You must boost the KNN results to make them competitive with BM25.

Or you can inversely boost BM25 (boost by 0.6 or something).

We want to make this experience better. But for now, that is what you have to do.

system · May 1, 2023, 12:51pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Implement my own Hybrid Search Elasticsearch vector-search	2	485	January 9, 2024
Implementing Hybrid Search with k-NN and BM25 in Elasticsearch Open Source Elasticsearch	2	45	February 18, 2025
Normalizing knn and multi_match clauses Elasticsearch vector-search	1	42	January 2, 2025
Aggregate Score for Hybrid Search Elasticsearch vector-search	22	2208	March 17, 2023
Hybrid Search high score on irrelevant documents Elasticsearch vector-search	9	305	September 23, 2024

Hybrid Retrieval with approximate KNN

Related topics