Was hoping to utilize rally to perform this.
We are looking to benchmark ingestion, keyword vs text types effects on different fields, and generate enough data into our dev cluster to more accurately simulate the data in our prod cluster.
@lance.zukel Oh Welcome to the community... If you specifically want to use Rally I would open a Topic with Rally in the Subject line (or edit this one), there are folks that know about Rally (not me )
Oh as a Rally developer I noticed (the tag is indeed enough) but we don't have any tool to generate data in Rally itself, Rally only works with data that is already generated.
Can you write a script to generate the data you need?
This would be a great feature to have in rally.
I think it would be very helpful for users to be able to specify a document template, some array of random/pseudo random values, and the number of documents to be generated.
This would allow simulation/benchmarking of documents/fields that are specific to a particular deployment, thus better replicating actual workloads.
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.