Pre-filter KNN vector search

veesahni · August 24, 2023, 3:21pm

I'm trying to understand how to best implement a pre-filtered KNN vector search. To be clear: I'd like to pre-filter the total number of docs down to a smaller set, and then run a vector search that leverages a vector index over the results.

Documentation describes a Filtered KNN Search with the following note: "The filter is applied during the approximate kNN search to ensure that k matching documents are returned."

The during phrase is unclear.

Does this mean that it will apply a per-shared pre-filter?
Or does this mean that it will do a KNN search on the all the data in the shard and then filter before the share returns results.. possibly getting more results if too many got filtered out?
If a filter is applied to a KNN search, Is the underlying vector index still being used or it is reverting to brute force?

BenTrent · August 24, 2023, 7:01pm

Hey @veesahni !

Apologies it was unclear. We can update the verbiage.

Does this mean that it will apply a per-shared pre-filter?

It means that it is a pre-filter applied while searching the individual HNSW graphs. So, it is a true pre-filter while still only searching for the kNN within the HNSW graph.

Or does this mean that it will do a KNN search on the all the data in the shard and then filter before the share returns results..

No, it does not do that, it would be very expensive to do that.

The only time we dynamically switch to brute force is when the filter set is very restrictive (less than num_candidates).

system · September 21, 2023, 7:02pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Approximate KNN search with filtering vs. exact search after the filtering Elasticsearch vector-search	1	334	November 22, 2023
KNN search returns an empty result set when num_candidates is less than the filtered doc count Elasticsearch vector-search	10	223	October 18, 2024
Does pre-filtered approximate kNN still need entire data in memory? Elasticsearch vector-search	1	224	September 22, 2023
Elastic KNN search questions Elasticsearch vector-search	3	1060	November 6, 2023
Semantic search on more than 10k documents Elasticsearch vector-search	4	681	October 18, 2023

Pre-filter KNN vector search

Related topics