[ESRally Benchmarks] throughput of search changes every time ES restart

kakka · March 30, 2019, 8:19am

I am using rally to benchmark ES, and track is geonames. Every time when ES starts, run several match_all tests to warm up and get best throughput. The throughputs is between 45 and 200 in different test. Here is the environment configuration.

Rally

version 1.0.0

operations

{
      "name": "default",
      "operation-type": "search",
      "body": {
        "query": {
          "match_all": {}
        }
      }
},

challenges

      "schedule": [
        {
          "operation": "default",
          "clients": 4,
          "warmup-iterations": 500,
          "iterations": 1000,
          "target-throughput": 210
        }
      ]

Hardware & OS

Intel(R) Xeon(R) Gold 6148 CPU @ 2.40GHz
Memory 100G every numa
SSD
Linux 3.10.0
ES binds 4 cores using numactl

JVM

-Xms8g
-Xmx8g
-XX:NewRatio=2
-XX:+UseConcMarkSweepGC

ES

version 6.2.3

$ curl -XGET 'http://localhost:9200/_cluster/stats?pretty'
{
  "_nodes" : {
    "total" : 1,
    "successful" : 1,
    "failed" : 0
  },
  "cluster_name" : "elasticsearch",
  "timestamp" : 1553929322953,
  "status" : "green",
  "indices" : {
    "count" : 1,
    "shards" : {
      "total" : 5,
      "primaries" : 5,
      "replication" : 0.0,
      "index" : {
        "shards" : {
          "min" : 5,
          "max" : 5,
          "avg" : 5.0
        },
        "primaries" : {
          "min" : 5,
          "max" : 5,
          "avg" : 5.0
        },
        "replication" : {
          "min" : 0.0,
          "max" : 0.0,
          "avg" : 0.0
        }
      }
    },
    "docs" : {
      "count" : 10320000,
      "deleted" : 0
    },
    "store" : {
      "size_in_bytes" : 2795283680
    },
...

zqc0512 · April 1, 2019, 5:44am

change the bulk size and try.

kakka · April 1, 2019, 7:09am

In this test, anything about bulk size?

dliappis · April 1, 2019, 7:13am

Hello,

I am not sure what the question is here. If it is why the query throughput varies between 45 and 200 and is not stable, I'd initially think that simply the benchmark over stresses (some aspect of) the cluster making it unstable.

When performing throughput benchmarks it is highly recommended to not only check the achieved target throughput (obviously if it never gets achieved it's the first red flag making the benchmark invalid and lower throughput rates needs to be used) but equally importantly, the service time and latency. If latency starts growing it indicates that Elasticsearch is slower to service (service_time) your requests than what would be required to satisfy your target throughput.

Rgs,
Dimitris

kakka · April 1, 2019, 8:54am

When throughput is 40, adjust throughput target to 60 and throughput do not grow up.
Here is flame graphs for different throuphput.

throughput: 44

throughput-44.png1412×411 37.3 KB
throughput: 197

throughput-197.png1415×466 42.8 KB

kakka · April 26, 2019, 2:35am

Anyboby has idea about this problem?

system · May 24, 2019, 2:35am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
A question for result benchmark Elasticsearch rally	2	791	March 27, 2017
Does Rally support benchmark test to multi-instance elasticsearch in single node? Elasticsearch rally	6	387	April 25, 2023
Benchmarking High Volumes Elasticsearch rally	2	505	May 11, 2019
Benchmarking cluster with rally Elasticsearch rally	3	1179	August 23, 2021
Throughput Elasticsearch rally	4	697	March 2, 2020

[ESRally Benchmarks] throughput of search changes every time ES restart

Rally

operations

challenges

Hardware & OS

JVM

ES

Related topics