Inference on an image

martin-k · October 16, 2024, 12:43pm

Hello all,

Is it possible to get a vector embedding for an image from an inference endpoint using the sentence-transformers__clip-vit-b-32-multilingual-v1 model? And if so, what format should the input be?

The documentation on the inference api tells me that the input should be a string so I suspect that this is not possible. But perhaps a base64 encoded image might work?

I'm not sure how I would test this myself, because if I give it a base64 encoded string it will give me a vector, but I have no way of knowing if the vector is for the image or the text.

dadoonet · October 16, 2024, 12:59pm

Not it's not possible yet. I hope to see that happening at some point as I'd love to run inference on my audio files directly within elastic instead of writing my own Python code

martin-k · October 16, 2024, 1:12pm

Thanks for your quick reply @dadoonet !
Sounds like I'll have to create my own endpoint for this specific use case

Gera_Barsky · December 6, 2024, 10:51pm

I would be also interested to use inference API to get a vector embedding for the image file. I tried to use sentence-transformers__clip-vit-b-32-multilingual-v1 model locally within python service, but I am struggling to wrap that service into docker container. The result docker image is very big and it takes forever to deploy it into kubernetes cluster.
Do you know if that is in the roadmap and if yes, any idea when it will be available ?

Topic		Replies	Views
Can we index image in Elasticsearch and see that one in Kibana Reports Elasticsearch	2	830	July 5, 2017
Elasticsearch and Machine Learning Elasticsearch	4	1085	July 5, 2017
Can we perform the text search present in the images or pdf files through elasticsearch Elasticsearch	9	3226	July 5, 2017
Categorizing images with deep learning into Elasticsearch Elasticsearch	10	4591	July 5, 2017
Indexing Images Elasticsearch	3	1690	July 6, 2017

Inference on an image

Related topics