Benchmarking ES using Rally store size metric increase and decrease

fargolk · December 14, 2021, 12:58pm

I was benchmarking an ES index with different shard numbers using esrally.
to find the shard number with best throughput for the specific index.
As I read in documents store size is index size (translog excluded)
In the report summary of the tests that I performed when the shards count increases I expected to see that store size increases but it increased at first and at some point it suddenly decreased. I wanted to know the reason of such behavior?
The track includes following operations:

Running delete-index [100% done]
Running create-index [100% done]
Running cluster-health [100% done]
Running bulk [100% done]

Race report summary with 1 shard:
Store size 0.0001169 GB

Race report summary with 2 shards:
Store size 5.12749 GB

Race report summary with 3 shards:
Store size 2.22465 GB

Race report summary with 4 shards:
Store size 1.93715 GB

json · December 14, 2021, 4:03pm

Hi, welcome to the Elastic community, and thank you for your post!

The race report values vary fairly broadly! The described behavior sounds like the normal ES segment merge cycle. Can you share the full race summary reports? I suspect merge throttling is causing it.

Are you using a publicly available Rally track? What are the CPU, memory and storage specs of your target system? I could try reproducing it for a better perspective.

Jason

fargolk · December 14, 2021, 8:25pm

Thanks, yes the full report is attached

shards =1 vs shards =2

shards = 2 vs shards=3

shards = 3 vs shards = 4

I created the track from my index that I want to find the optimal shard count for it.
Actually I don't understand how segment merges can cause this fluctuation. it would be very helpful for me to know more about these merges and how much storage they may take, are there any specific documents I should read to know more about process of segment merges? Thanks

system · January 11, 2022, 8:25pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Store size in esrally summary report doesn't match real index size Elasticsearch rally	2	452	January 9, 2023
Shard sizing charts Elasticsearch rally	4	676	June 11, 2020
ES Rally - indexing time increasing Elasticsearch rally	6	525	May 23, 2022
Scalability issue - Rally benchmark on ES 7.0.1 Elasticsearch rally	7	1191	July 2, 2019
Benchmarking cluster with rally Elasticsearch rally	3	1185	August 23, 2021

Benchmarking ES using Rally store size metric increase and decrease

Related topics