Is my vector search quick enough

Capybara07 · April 8, 2025, 11:30am

I have 1536 dimension vectors,
During benchmarking I am doing vector similarity search using script (not knn) by matching one vector agains 30k stored documents with vectors and getting response in about 4s
The time increases to 10s for 100k documents with vectors.

Is this slow, because I have 32 cores and 128gb ram

given below my query script

"must": [
                    {
                        "script_score": {
                            "query": { "match_all": {} },
                            "script": {
                                "source": """
                                    if (doc['vector'].size() > 0) {
                                        return cosineSimilarity(params.query_vector, 'vector') + 1.0;
                                    } else {
                                        return 0.0;
                                    }
                                """,
                                "params": { "query_vector": vector }
                            }
                        }
                    }
                ]

dadoonet · April 8, 2025, 12:16pm

That's expected as it needs to compute the cosine for every single vector (brute-force). From 30k to 100k means much more vectors.
Using knn should be much faster.

Also look at Script score query | Elasticsearch Guide [8.17] | Elastic

During vector functions' calculation, all matched documents are linearly scanned. Thus, expect the query time grow linearly with the number of matched documents. For this reason, we recommend to limit the number of matched documents with a query parameter.

Christian_Dahlqvist · April 8, 2025, 12:31pm

If your index is small and with a limited number of shards and segments I suspect Elasticsearch will only be able to utilise a few cores at most as there as far as I know are limits to parallelism.

Capybara07 · April 8, 2025, 12:33pm

Thanks @dadoonet for responding.
So for 30k documents 4s is expected time?
I think it's a lot, I am going to have 5 million documents so will script vector search not remain an option for me?

Capybara07 · April 8, 2025, 12:35pm

Thanks @Christian_Dahlqvist for replying
I have a big index of about 5 million vectors with 5 shards, I have preloaded vex and veq, i.e. hnsw graph and quantized vectors.
Given below is my index setting
"store": {
"preload": [
"vex",
"veq"
]
},

anything else I need to do ?

dadoonet · April 8, 2025, 12:50pm

If you want to search always on the 5m documents using a brute force method without any filter as recommended in the document I linked, then I suspect that it will be super slow. May be after some queries it will become faster if all the files are loaded in RAM but still this will take time as specified in the first doc I linked.

My advice is to switch to kNN query. That's what I'm using myself and it's extremely powerful.

Topic		Replies	Views
Script score vector search performance Elasticsearch	3	649	September 22, 2022
Speed of dense vector search with 512 or more dimensions Elasticsearch	8	4231	August 26, 2022
More search time Elasticsearch	10	602	June 18, 2020
How vector based text similarity works under the hood? Elasticsearch	4	830	July 15, 2020
Vector search large dense vectors performance issues Elasticsearch vector-search	3	127	July 16, 2025

Is my vector search quick enough

Related topics