Internal rally queue in throughput-throttled mode

jmlucjav · September 9, 2022, 6:48pm

Hi

I have a question regarding latency and ES thread pool queues...I am running rally in throughput-throttled mode, targeting a throughput higher than ES will achieve in benchmarking mode. Some numbers I get are:

| Median Throughput | my-query | 6390.59 | ops/s |
| 50th percentile latency | my-query | 11618.4 | ms |
| 99.99th percentile latency | my-query | 27001.3 | ms |
| 50th percentile service time | my-query | 4.23791 | ms |
| 99.99th percentile service time | my-query | 59.8257 | ms |
| error rate | my-query | 0 | % |

And I can see in monitoring the largest search queue size during the benchmark was just 6.

So far this makes sense, latency being much larger than service time. As per the faq:

"Rally runs in throughput-throttled mode and generates requests according to this schedule regardless of how fast Elasticsearch can respond. In this mode the generated requests are first placed in a queue within Rally and may stay there for some time."

So rally is holding the requests on its own for some time before sending them to ES, thus ES search queue does not grow. My questions are:

why is rally doing this? Would not be better that the queue growth is shown in ES (eventually rejecting search requests)?
how does rally decide how long to wait in the internal queue? Above it says 'for some time', which sounds fuzzy

thanks

json · September 9, 2022, 8:19pm

Hi @jmlucjav, Rally will only attempt to run the specified number of operations per second in throughput-mode. Any value, high or low, for the largest Elasticsearch search queue size, is enough to indicate the cluster received more search requests than it could handle at any one time. A low value means the cluster was saturated, but perhaps it was not saturated enough.

target-throughput sets the number of operations per second for the search task. The length of time operations will sit in Rally's queue is variable based on the target-throughput, so we really do not know how long they will be there. They will sit in Rally's queue until enough time has passed for the requests to be submitted at the throughput specified in target-throughput.

If you want to saturate the search queue beyond 6 requests while measuring service time, you could try increasing target-throughput.

Thank you,
Jason

jmlucjav · September 12, 2022, 5:42pm

Hi Jason,

Understood.

What I now see it is happening too, is that you need to increase the iterations as well, so rally has time to reach the target throughput...

thanks!

system · October 10, 2022, 5:43pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Can't reach Rally Target throughput Elasticsearch rally	8	1124	October 15, 2018
A question for result benchmark Elasticsearch rally	2	791	March 27, 2017
Why the percentile latency is several times more than service time Elasticsearch rally	2	1403	January 18, 2017
About the Max Throughput VS target throughtput Elasticsearch rally	5	734	October 29, 2018
Benchmarking High Volumes Elasticsearch rally	2	507	May 11, 2019

Internal rally queue in throughput-throttled mode

Related topics