Benchmarking cluster with rally

dliappis · July 26, 2021, 5:00pm

Assuming that shard settings got applied correctly here are a few thoughts.

The workload that you've chosen (metricbeat) is quite small and very compressible. What was the duration of your benchmark? It's very likely it was very short.

For evaluating indexing throughput, you probably want to use something that resembles more the workload that you are after and is larger. You can e.g. the http_logs track for something more logs oriented, pmc if you are after large docs or nyc_taxis for a rather large corpora. For most tracks there are README pages describing the workload e.g. rally-tracks/http_logs at master · elastic/rally-tracks · GitHub

You could also be having a bottleneck elsewhere, e.g. on your loaddriver, or elsewhere.

I recommend taking a systematic approach ensuring that you are running valid benchmarks and detecting where the bottleneck is; there is a great tutorial linked in the Rally docs page that you can find here: The Seven Deadly Sins of Elasticsearch Benchmarking | Elastic or here: Benchmarking Elasticsearch with Rally by Daniel Mitterdorfer | Search Meetup Munich - YouTube

Topic		Replies	Views
Rally Track Report Analysis Elasticsearch rally	7	1536	March 20, 2018
Benchmarking High Volumes Elasticsearch rally	2	507	May 11, 2019
Benchmarck elastic cluster with rally Elasticsearch rally	4	719	August 29, 2021
Relative index time metrics? Elasticsearch rally	3	421	May 17, 2021
A question for result benchmark Elasticsearch rally	2	792	March 27, 2017

Benchmarking cluster with rally

Related topics