What's the relationship between clients and CPU cores?

Dhineshkumar_R · October 8, 2023, 6:30am

Hi Folks,

I found that rally uses Actor model to generate load under the hood. From the documentation, I understand that target throughput is achieved with all clients together. However, I do not understand the way number of clients set in a race and CPU cores are related. Can someone help me understand how these two are related? Basically the way number of clients set in a rally schedule are mapped to Actors and inturn to CPU threads to generate target-throughput.

If there are any documentation that's already there regarding these, please point me to it.

Thanks

Bradley_Deam · October 8, 2023, 11:19pm

Hi @Dhineshkumar_R,

These are largely implementation details, and as a user of Rally you shouldn't really need to be aware of them, but let me try and explain.

Rally is written in Python, which has a mechanism called the GlobalInterpreterLock to prevent multiple threads of the same Python process from running on CPU in parallel.

This 'GIL' limits the total throughput we could achieve using a single Python process, so to work around this we use a model of allocating one 'Actor' per-available core, where an Actor is separate Python process responsible for running a single asyncio event loop.

Then, the number of clients defined in a specific task's client property are evenly distributed across the available 'Actors' (i.e. Python processes/asyncio event loops).

Brad

Dhineshkumar_R · October 9, 2023, 5:30pm

Thanks Brad. This helps me imagine the flow. One followup question. I read that the latency metrics reported in the summary report includes the time spent by a request waiting in a queue after creation but before sent to the cluster. Where is this queue in the picture? Is this queue part of each client within an Actor or the Actor itself?

Bradley_Deam · October 9, 2023, 11:02pm

This really is an implementation detail, but right now it is per client within an Actor.

If you're interested in understanding this in more detail, take a look at these files for a start:

The AsyncExecutor is responsible for actually executing the task and measuring service times/processing time/latency
Schedulers which are responsible for defining when Rally should execute a request

system · November 6, 2023, 11:02pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Rally: How do the number of requests get calculated in a cluster? Elasticsearch rally	3	325	August 21, 2023
Rally and Httperf Elasticsearch rally	5	1135	May 1, 2018
The number of clients in search operation Elasticsearch rally	10	1890	September 7, 2018
Can't reach Rally Target throughput Elasticsearch rally	8	1159	October 15, 2018
Any limitations with distributed load-drivers? Elasticsearch rally	15	1965	December 27, 2017

What's the relationship between clients and CPU cores?

Related topics