Yeah, I saw the video on youtube about 7 sins for benchmarking. (that is why I was surprised why disabling OS page cache is a good idea).
I definitely agree with you about the methodology. I just feel I got stuck with something very basic.
I am rerunning the command you sent:
esrally --pipeline=benchmark-only --track=eventdata --track-repository=eventdata --challenge=index-and-query-logs-fixed-daily-v olume --track-params=bulk_size:10000,bulk_indexing_clients:16 --target-hosts=my_ip:9200 --client-options="timeout:240" --on-error=abor
I am not running on the same machine ( as you mentioned it is BIG Antipattern).
Running on Google cloud:
ES is 16 vCore , 64 ram 1TB disk.
Rally client machine 8vCore 32GB 1TB disk.
I've got exactly the same statistics:
avg-cpu: %user %nice %system %iowait %steal %idle
8.77 0.00 1.18 5.10 0.00 84.95
Device r/s w/s rMB/s wMB/s rrqm/s wrqm/s %rrqm %wrqm r_await w_await aqu-sz rareq-sz wareq-sz svctm %util
sda 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00
sdb 0.00 3803.00 0.00 41.84 0.00 5772.00 0.00 60.28 0.00 0.63 1.55 0.00 11.27 0.19 71.20
iowait - 5.10
MBs - 41.84
w/s - 3803.00
being on execution ~ 30% I've got an error and job is aborted
[ERROR] Cannot race. Error in load generator [2]
No matching data found for field '@timestamp' in pattern 'elasticlogs-*'.
Getting further help:
log files have many exceptions:
AssertionError: No matching data found for field '@timestamp' in pattern 'elasticlogs-*'.
2020-09-15 19:17:25,798 ActorAddr-(T|:44165)/PID:12740 esrally.driver.driver ERROR Could not execute schedule
Traceback (most recent call last):
File "/usr/local/lib/python3.6/dist-packages/esrally/driver/driver.py", line 1095, in call
total_ops, total_ops_unit, request_meta_data = execute_single(runner, self.es, params, self.abort_on_error)
File "/usr/local/lib/python3.6/dist-packages/esrally/driver/driver.py", line 1133, in execute_single
return_value = runner(es, params)
File "/usr/local/lib/python3.6/dist-packages/esrally/driver/runner.py", line 188, in call
return self.delegate(*args)
File "/usr/local/lib/python3.6/dist-packages/esrally/driver/runner.py", line 236, in call
return self.delegate(self.client_extractor(args[0]), *args[1:])
File "/home/user/.rally/benchmarks/tracks/eventdata/eventdata/runners/fieldstats_runner.py", line 146, in fieldstats
raise AssertionError("No matching data found for field '{}' in pattern '{}'.".format(field_name, index_pattern))
AssertionError: No matching data found for field '@timestamp' in pattern 'elasticlogs-*'.
Any Ideas why what is wrong?