Does pre-filtered approximate kNN still need entire data in memory?

veesahni · August 25, 2023, 2:35am

This article on tuning approximate kNN states "You should ensure that data nodes have at least enough RAM to hold the vector data and index structures".

Does this still apply when doing filtered kNN search?

To be clear: if filtering is being used to significantly reduce the amount of data being queried at any one time, would we still need enough RAM to hold the entirety of the data in memory?

My guess - yes, still applies. Since shards have segments of HSNW graphs. And those graphs need to be in memory in their entirety for efficient querying. Even if we're running filtered queries.

system · September 22, 2023, 2:35am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Approximate KNN, Preloading & Performance Elasticsearch vector-search	5	210	February 28, 2024
Approximate KNN search with filtering vs. exact search after the filtering Elasticsearch vector-search	1	334	November 22, 2023
Managed Elastic search for billion scale dense vector index and performance Elasticsearch vector-search	3	1371	December 23, 2022
Manage heavy indexing in kNN indexes Elasticsearch vector-search	3	668	November 9, 2023
Elastic KNN search questions Elasticsearch vector-search	3	1060	November 6, 2023

Does pre-filtered approximate kNN still need entire data in memory?

Related topics