Rally throughput counter include data generation

Lasse_Nedergaard · June 18, 2021, 9:43am

We have some large JSON doc’s we use for testing. The manipulation of the document before ingest takes some time. I can see Rally’s finally output score include throughputs but the time is Rally’s throughput and it’s including buffer array generation time.
Anyone knows how to get the es ingest rate metric as it isn’t include in node-stats

Thanks in advance
Lasse Nedergaard

RickBoyd · June 21, 2021, 3:06pm

Hi @Lasse_Nedergaard ,

As Rally is keeping track of the number of documents it is indexing, the throughput from the client side and from the server side (the Elasticsearch ingest rate) will be the same.

The indication from your question seems to be that you believe you have a client-side (or network) bottleneck. We don't typically concern ourselves too much with this (as in real world scenarios, composing bulk requests also takes some amount of time) unless:

The data generation code is in rough shape and needs some optimization OR
The client (Rally) machine is not powerful enough to generate load at the desired rate

If you are using a persistent data store (which is recommended) you can explore results in rally-metrics-* where the name field is "latency" and the task field is "bulk" (or whatever you have named your bulk task) and look at the meta.took field and compare to the value field, as both are expressed in milliseconds, to see what the latency overhead of your client and network roughly are, in order to assess if you need to optimize your track code, or upgrade your client machine.

Please let us know if this helps
Rick B

Lasse_Nedergaard · June 21, 2021, 3:32pm

Hi Rick

Thanks for cleaning this out it make sense. I will give it a try.
And you are right my rally client do not perform 100% so my problem is likely there.

Thanks for helping out

Lasse Nedergaard

system · July 19, 2021, 3:32pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
ES Benchmark using rally to stress a 2 node setup Elasticsearch rally	6	2508	November 8, 2018
Bulk-update challenge tuning Elasticsearch rally	3	421	December 17, 2020
Is this rally result valid? Elasticsearch rally	5	1663	January 5, 2017
A question for result benchmark Elasticsearch rally	2	792	March 27, 2017
How Write throughput is calculated in Rally Elasticsearch rally	4	1006	August 6, 2019

Rally throughput counter include data generation

Related topics