Profiling kNN search

BenTrent · March 7, 2023, 1:33pm

KNN spends most of its time in the rewrite. So, that is indeed the hot spot. Its where the KNN search occurs.

We do segment searches serially. So, comparing your two open KNN search tickets this is what I think is happening.

You are on a single node with a single shard. That single shard has 49 segments, each seems to be an OK size (at least a GB or so).

But, this then means, on a single node, you are exploring 49 different HNSW graphs.

In the future, we want to make KNN work in parallel on the same shard but with different segments, but right now, that doesn't happen.

I think you should try force-merging your test node to fewer segments. It doesn't have to be 1. 1 would be best, but it could take a while to complete.

So, I think:

Run it asynchronously (it will take a while), and set max_num_segments to something 10 or below. Again, 1 is best, but you would see significant improvements with 10 I think.

Topic		Replies	Views
KNN search speed Elasticsearch vector-search	12	2009	April 20, 2023
High Latency on KNN Search Elasticsearch vector-search	11	199	January 15, 2025
Help optimizing slow knn query Elasticsearch vector-search	7	106	November 4, 2024
Kibana search profiler accuracy Elasticsearch	4	287	November 17, 2021
A question about the Profile API Elasticsearch	1	204	August 11, 2022

Profiling kNN search

Related topics