How searching over multiple indices using alias work internally, is it faster that having one index having all the documents

code_merchant · June 9, 2021, 5:37am

I have 2 clusters.
Cluster A: have 1 big index storing 500 GB of data
Cluster B: have monthly indices of 30GB for 15 months

There is a search query which search over a single index in cluster A and over read-alias [image-2019-10, image-2019-11, ..., image-2021-06] in Cluster B.

According to search latency metrics of Cluster Health,
Cluster B has 4-6 ms of search latency over a shard.
Cluster A as 30-40 ms of search latency over a shard.

But the overall result of the query in Cluster A takes 10seconds and Cluster B is taking 20seconds.

I think Cluster B uses threads internally to search parallely over multiple indices, so overall time taken by search query with alias should be less. Am I missing something?

DavidTurner · June 9, 2021, 7:56am

Yes. Imagine looking up a word by hand in a 5000-page dictionary, and compare that to trying to find the same word in ten 500-page dictionaries each of which contains a random tenth of the words. On average you'd find it quicker in the one big dictionary, even though your per-dictionary search time would be slower (e.g. 5000-page books are heavy and hard to use).

If there was a few of you working together you might be able to achieve better performance, but trying to coordinate multiple lookups in parallel across multiple people takes significant time and effort too. Parallelising the work only takes you so far, doing less work (using a more efficient data structure) is the way to go.

system · July 7, 2021, 7:57am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Migration from single to multi-index increased searchRate and overall search time Elasticsearch	7	376	July 5, 2021
Search time with multiple aliases Elasticsearch	1	600	November 7, 2017
Multi-index overhead Elasticsearch	2	462	May 23, 2019
Search multiple indices Elasticsearch	4	385	January 2, 2022
Shards vs indexes vs cluster Elasticsearch	4	385	July 6, 2017

How searching over multiple indices using alias work internally, is it faster that having one index having all the documents

Related topics