Hey,
I looking at doing some bench marking for different ElasticSearch cluster configurations
I've seen this: https://github.com/elastic/rally
It says that when you kick it off it downloads the benchmark data, pulls down the latest Elastic version and executes the benchmark.
I want to run the benchmark against an existing ES cluster configured one way, then run it again against a clsuter configured a different way. Is it possible to do this with Rally?
this is kind of doable but not so convenient. The situation is similar to what we had with tracks a few weeks ago. You have basically two options:
Install Rally in development mode, open car.py and configure the "cars" you need (we call an Elasticsearch configuration a "car"). Note that this is currently limited but we'll eventually allow you to configure it much more flexible.
Don't have Rally provision your cluster but instead to it manually and point Rally to your cluster by using the pipeline benchmark-only
One small addition: If you use the second approach with the pipeline benchmark-only (which seems to be more likely to me in your scenario), please ensure that you use Rally 0.3.1. (released yesterday) as I've fixed a related bug there.
Hey,
Thanks for the information. Option 2 looks like it might work well for us.
Will this work with older versions of ElasticSearch? We currently use ES version: 2.2.0
yes, if you use the standard "tracks" (benchmarks). We have created them in a way so they support all versions between ES 1.7 and the latest 5.0 builds.
I'm always using an external cluster (benchmark-only) and in between races I change the deployment parameters (which destroys the old master/data/client nodes and creates new nodes)...
I'm able to list the races, comparing 2 races completes "successfully" - but it doesn't show any metrics... please let me know what more information you need?
sure, I didn't want to flood the forum with similar topics, I thought this was a follow on to testing external clusters ~ running tournaments for external clusters.
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.