How much does index size effect search speed?

brandont · February 26, 2017, 2:20am

I set up Elasticsearch to create a new index on a daily interval but when I started the stack up, I "synchronized" a lot of log data from the past right away resulting in the first index being a LOT larger than the others following it (50g instead of 1g). Is this going to cause my search speed to be a lot slower? I'm worried that a single worker thread will be assigned to searching the whole thing by itself and will take a lot longer than the rest. Would it make a difference if I reindexed the data so that the initial upload was spread over several indices instead or is this not even a factor?

Thanks,
Brandon

warkolm · February 26, 2017, 3:39am

It won't matter. A shard is multithreaded.

brandont · February 26, 2017, 3:53am

Alright, thanks. I'll check that one off the list of possible slowdown causes

Christian_Dahlqvist · February 26, 2017, 6:52am

Each query runs single threaded against each shards, but multiple queries and shards can be processed in parallel. The size of a shard does therefore affect query latencies, which is why we generally recommend benchmarking the ideal shard size.

brandont · February 27, 2017, 12:00am

Interesting. Thanks for following up. I'll take a look at the presentation. Do you think this could be the cause of my search speed problems?

Christian_Dahlqvist · February 27, 2017, 6:12am

If the shard size has changed significantly, that is certainly possible.

brandont · March 13, 2017, 5:08pm

To follow up on this issue, it does seem that removing the one large shard stabilized the Elasticsearch node. After increasing the number of shards, I ran into an additional issue with running out of worker thread queue space. Increasing the search queue size from 1000 to 5000 resolved this issue but did result in slightly longer lasting searches.

system · April 10, 2017, 5:09pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Does Index/Shard size affects the performance of search/insert/update query? Elasticsearch	3	336	March 22, 2021
Importance of shard sizing for search performance Elasticsearch	3	416	July 7, 2020
How does shard allocation effect indexing speed? Elasticsearch	1	352	August 20, 2019
Shard size / Index number / server count and performance Elasticsearch	4	1391	July 6, 2017
Will increasing the no. shards in a large cluster affect the query performance? Elasticsearch	7	2953	July 5, 2017

How much does index size effect search speed?

Related topics