Vertex AI Gemini Embeddings Task Type Support in Inference API

Jared_Gray · September 2, 2025, 8:32pm

I'm working on implementing semantic search using Vertex AI Gemini embedding models (gemini-embedding-001) through the Inference API. While
the Create a Google Vertex AI inference endpoint | Elasticsearch API documentation shows support for task_type parameter, there appears to be a
difference between what Elasticsearch supports and what Vertex AI offers.

Elasticsearch Vertex AI Inference API supports these task types:

rerank
text_embedding
completion
chat_completion

However, Gemini embedding API provides task types optimized for specific use cases:

SEMANTIC_SIMILARITY - Embeddings optimized to assess text similarity
CLASSIFICATION - Embeddings optimized to classify texts
CLUSTERING - Embeddings optimized to cluster texts
RETRIEVAL_DOCUMENT - Embeddings optimized for document search
RETRIEVAL_QUERY - Embeddings optimized for search queries
CODE_RETRIEVAL_QUERY - Embeddings optimized for code block retrieval via natural language
QUESTION_ANSWERING - Embeddings for Q&A systems
FACT_VERIFICATION - Embeddings for fact checking systems

Questions

Will Elasticsearch's Vertex AI inference endpoint support Google's specific task types (like RETRIEVAL_DOCUMENT, RETRIEVAL_QUERY, etc.) in future releases? These
task types can impact embedding quality for specific use cases.
How does the current text_embedding task type map to Google's task types? Does it default to a specific Gemini task type, or does it use Google's default
behavior?

For now, I’m planning to:

Use Vertex AI's client libraries directly to generate embeddings with the appropriate task types
Index the resulting embeddings via Bulk API
Update embeddings manually when needed, outside of the inference pipeline setup

Has anyone else encountered this? Are there alternative approaches or upcoming features that might address this?

Thanks for any insights!

Additional Details:

Using Elasticsearch 8.17.3
Gemini model: gemini-embedding-001
Current embedding dimensions: 3072

Topic		Replies	Views
Chatgpt elastic enterprise search Elastic Search elastic-app-search	2	486	April 16, 2023
Elasticsearch-relevance-engine and chatgpt(openAI) multi-region Elasticsearch	1	239	November 6, 2023
New semantic relevance ranking API for Elastic Enterprise Search Community Ecosystem	4	1196	May 9, 2023
CHATGPT GPT for Elastic Elastic Search	1	96	June 9, 2025
ELSER2 \| Spell check before creating embeddings Elasticsearch elastic-stack-machine-learning	2	256	January 31, 2024

Vertex AI Gemini Embeddings Task Type Support in Inference API

Related topics