Multiple indices are indexing in sequence

suvarna · April 4, 2019, 6:18pm

Hi

I am using Rally's existing tracks to perform benchmarking. I noticed that nyc taxis is the largest track with 4.5GB compressed and 74.3 GB uncompressed docs. I want to test with larger data volume.

I have used the following trick..
the nyc_taxis document corpus ten times (note the index_count variable at the top):

{% set index_count = 10 %}
{
  "version": 2,
  "description": "Taxi rides in New York in 2015",
  "indices": [
  {% set comma = joiner() %}
  {% for item in range(index_count) %}
  {{ comma() }}
    {
      "name": "nyc_taxis-{{item}}",
      "body": "index.json",
      "types": [ "type" ],
      "auto-managed": false
    }
  {% endfor %}
  ],
  "corpora": [
    {
      "name": "nyc_taxis",
      "base-url": "http://benchmarks.elasticsearch.org.s3.amazonaws.com/corpora/nyc_taxis",
      "documents": [
      {% set comma = joiner() %}
      {% for item in range(index_count) %}
      {{ comma() }}

i have referred below link for above trick.

My Concern is: when i used above trick .. the ES will have 10 different indices and those are running in sequence ..

how to make that indices to run in parallel so that it can utilize the CPU in an optimal way.

danielmitterdorfer · April 9, 2019, 12:14pm

All specified clients will send bulk requests to Elasticsearch as fast as they can, i.e. you control that by varying the number of clients instead of the number of indices that you bulk-index into unless I misunderstand what you're after. Hope that helps.

Daniel

system · May 7, 2019, 12:14pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Benchmarking ES cluster using larger Rally dataset for multiple parallel indexing Elasticsearch rally	5	872	July 5, 2019
Increase data size in Rally existing tracks Elasticsearch rally	3	2414	February 20, 2018
Increase data size in Rally with existing tracks Elasticsearch rally	4	704	December 9, 2019
Esrally ingesting to two indices parallelly Elasticsearch rally	6	726	August 26, 2021
Rally Benchmark - Which race/benchmark to use for performance testing Elasticsearch rally	2	486	December 27, 2021

Multiple indices are indexing in sequence

Related topics