While searching how are shards picked for cluster with index.routing_partition_size

ktech007 · April 23, 2024, 3:03am

Let's say my cluster is set up with index.routing_partition_size of 20. For a routing key abc, it can then route documents to max 20 shards. Here is a scenario:

Suppose a document comes with routing key abc and it is routed to shard 10 out of 100 shards the index has. This is the first document that lands on that shard.
While searching, we pass routing abc.

Two questions:

How does it determine what shards to pick? Is the list of shards pre-calculated based on the routing key?
If the routes are sort of dynamically added to some list as the documents are added, is this list cached and then refreshed? Is there a delay? For example: I search for the above-mentioned document. Is there a scenario where it may not be returned because that shard is not in this cached table?

Christian_Dahlqvist · April 23, 2024, 6:26am

Yes, the shard determination is deterministic based on the routing key and the document ID. The calculation is described here.

Topic		Replies	Views
[Help] Understanding routing to an index partition Elasticsearch	3	1553	February 8, 2022
Routing to an index partition Elasticsearch	2	626	May 31, 2018
Determine Shard Id based on routing key Elasticsearch	5	1156	July 6, 2017
Routing to an Index Partition - Custom Routing Elasticsearch	1	262	October 13, 2021
What is the `index.number_of_routing_shards` setting? How can I calculate it based on the number of shards? Elasticsearch	8	2815	March 31, 2022

While searching how are shards picked for cluster with index.routing_partition_size

Related topics