Hybrid search on managed elasticsearch instance

aiexplorations · December 11, 2024, 5:28am

Hi all,

I have a specific question on whether we need all the vectors to be persisted in memory on the Elasticsearch ML node (or in the other ES nodes) when running ML search.

We're building an application at my end which needs to implement hybrid search on ES for millions of records of data. We compute embeddings for some of the data fields in each record and use this as part of the hybrid search feature (which uses BM25 and vector similarity via RRF).

Want to understand whether these embeddings need to be persisted in memory, or whether they can be persisted to indexes, like with a vector database.

dadoonet · December 11, 2024, 6:29am

Welcome.

Elasticsearch is a vector database. You need to index (store) your vectors in ES if you want to be able to search for them.

Topic		Replies	Views
Elasticsearch Hybrid Query - No Results Elasticsearch	2	785	February 18, 2021
Vector Search in ElasticSearch + Filtering using other fields Elasticsearch vector-search	4	408	May 7, 2024
Implementing Hybrid Seach with AppSearch Elastic Search elastic-app-search	1	256	December 11, 2022
Question about ES w/ Couch(or any other db) Elasticsearch	7	638	July 6, 2017
Implement my own Hybrid Search Elasticsearch vector-search	2	380	January 9, 2024

Hybrid search on managed elasticsearch instance

Related topics