Shard sizing charts

Phandora · April 24, 2020, 3:17pm

Hello!

Recently we have started to benchmark our Elasticsearch cluster using Rally. We have designed a custom workload to be able to find the desired shard size for us.

We are really interested in generating a chart like the one shown in the Elasticon Quantitative Cluster Sizing (24:30 min)

I do not mind if it is generated by Rally or manually from Kibana. Could you help me with this matter?

Best regards.

Christian_Dahlqvist · April 25, 2020, 11:06am

For those charts I used a different benchmarking tool as Rally at that point was not able to handle this, and the dashboards were created in Kibana 4, which was relatively new back then .

Rally has changed and evolved a lot since then but does not provide anything like this out of the box, so you will need to create them yourself. Fortunately Kibana has gotten a lot more advanced and flexible over the years so I am sure you will be able to create something a lot nicer.

As long as you have an Elasticsearch instance set up as a metrics store for Rally, it should be possible to generate these type of dashboards. In the metrics store you have results for every individual request, so as long as you have applied tags and metadata that allow you to identify which iteration results belong to it should be relatively easy to create a saved search with a filter and create these kind of visualizations.

hilo21 · May 13, 2020, 9:53pm

Hello Christian,

I saw your talk, loved it, and I am running some tests on our own data as you guys suggested. I used an All-in-One instance and indexed logging data with one shard and no replicas and started changing some parameters and running some tests based on whether we enabled best compression or not. I now have over 70M documents and I wish i can generate visualizations from my indexes to see when do I hit a limit for my shard sizing.

Can you please provide an example on what fields you based yours on or how to setup this kind of visuals ?

Thank you

Christian_Dahlqvist · May 14, 2020, 5:27am

When I created the rally-eventdata-track I included a challange showing an example of how to run shard sizing. It seems this was removed during a cleanup of the track recently. In this challenge I simply ran X indexing iterations of a fixed number of documents and the followed on by running a set of queries. For each operation I set the iteration number as a metadata field. This allowed me to create a saved query showing service-time for the queries. I then created a simple histogram based on the iteration number showing the average latency per type of query.

system · June 11, 2020, 5:30am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Benchmarking ES using Rally store size metric increase and decrease Elasticsearch rally	3	578	January 11, 2022
How can I know kibana dashboard latency, max indexing tps, data volume queried per time etc Elasticsearch	2	366	December 16, 2022
Change thread pool search queue_size? yes or not? Elasticsearch	13	4175	October 4, 2017
Prebuilt dashboards for esrally Elasticsearch rally	5	1001	July 21, 2021
Visualization in Dashboard Kibana	8	2139	July 6, 2017

Shard sizing charts

Related topics