Index design for product with multi image and similarity search

rastin_rastini · September 3, 2024, 3:53pm

Hi
im scrapping product pages where each product has a description and multi image. generate vector for each image. finally i want search similar images with a given image and return products similar with that image.

what is best design for this?

create a complete document for each image with product data?(e.g. having 5 image means having 5 document with equal description?)
or create a document with nested image name and its vector in main document?
or create two document and join?
or other design?

RabBit_BR · September 4, 2024, 10:49am

Hi @rastin_rastini .

I don't know much about search performance in nested types with vectors. But thinking about this and also about document size, because you can have several images for a product and this can increase the size of the document, I would go for an option with two indexes.

There would be the product index and the other index with the vectorized images for each product (if there are 5 images for a product, there would be 5 documents). This would allow managing two indexes separately both in data ingestion and in search.

For vector search, you use the image index and would enrich the result with the data from the product index.

rastin_rastini · September 5, 2024, 11:49am

two title for desire. first is big data. denormalization. second is performance. in each search we must for one time select one index then join it with other index.
then whats better?

Topic		Replies	Views
Efficient more-like-this using vector search Elasticsearch vector-search	2	57	October 14, 2024
Designing index for products with many variants Elasticsearch	1	1068	April 26, 2019
Modeling products and category Elasticsearch	1	392	July 6, 2017
Design Index & Document Elasticsearch	1	246	May 5, 2023
Search on an associated type Elasticsearch	2	346	July 6, 2017

Index design for product with multi image and similarity search

Related topics