We've indexed 25M documents into a single index of 128 shards with 1
routing parameter is set to a path in the document, which is an int
In out 25M documents, there's 167 distinct values of this "some_id" and in
our expectation, ElasticSearch will route these documents evenly across all
But we've found out that, out of 128 shards, there are 53 empty shards
(with 0 document inside), or, 40% of the shards are not used at all.
- is this normal? Do we miss something in configuring routing?
- does this imbalanced shard utilization affect indexing speed?
We can confirm that all documents are correctly indexed and routing works
(when searching with routing only 1 shard responds with the correct answer).
ElasticSearch version is v1.0.1.
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to email@example.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/f7e86ae2-14a8-4381-842d-53adf59ec43d%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.