High I/O usage on large Elasticsearch Instance

Nisar_Adappadathil1 · September 9, 2021, 8:22pm

Hi

We have a elasticsearch-6.4 cluster with 100 nodes
we were using AWS instances of type (8 core CPU & 15 GB memory), now we are migrating to (48 core 384 GB memory)
The issue here is that now we have half number of instances in old type & half in new type, we can see the load (io usage %) on new instance is a lot higher even though its capacity is higher.
we were checking the iostat from the elastic search instances and comparing old and new instance types.

For now, we restart the high load usage instances, the shards would reassigned for few hour load would be less but again the load would come back to 80-100
what would be the reason behind this, appreciate any help.

Thanks

warkolm · September 9, 2021, 11:32pm

Welcome to our community!
6.4 is EOL, please upgrade ASAP.

What is the output from the _cluster/stats?pretty&human API?

Nisar_Adappadathil1 · September 10, 2021, 9:12pm

We were originally having 500+ nodes of the small size ( 8 core CPU & 15 GB memory) and the goal was to go down to 150 instances by using the larger host ( 48 core 384 GB memory )

we have 650TB data, 9000+ shards
avg write TPS - 1300 per second
avg read TPS - 2500 per second

So here how do I know the high IO usage % is because of an incorrect elastic search configuration or its a limitation of the instance type?

_cluster/stats output

{
  "_nodes" : {
    "total" : 386,
    "successful" : 386,
    "failed" : 0
  },
  "cluster_name" : "mycluster",
  "timestamp" : 1631282601562,
  "status" : "green",
  "indices" : {
    "count" : 78,
    "shards" : {
      "total" : 9700,
      "primaries" : 4850,
      "replication" : 1.0,
      "index" : {
        "shards" : {
          "min" : 2,
          "max" : 700,
          "avg" : 124.35897435897436
        },
        "primaries" : {
          "min" : 1,
          "max" : 350,
          "avg" : 62.17948717948718
        },
        "replication" : {
          "min" : 1.0,
          "max" : 1.0,
          "avg" : 1.0
        }
      }
    },
    "docs" : {
      "count" : 534114907871,
      "deleted" : 131472431078
    },
    "store" : {
      "size" : "648.9tb",
      "size_in_bytes" : 713527348139452
    },
    "fielddata" : {
      "memory_size" : "8.4mb",
      "memory_size_in_bytes" : 8908840,
      "evictions" : 0
    },
    "query_cache" : {
      "memory_size" : "2.7tb",
      "memory_size_in_bytes" : 3012866611322,
      "total_count" : 2570605016989,
      "hit_count" : 228375108353,
      "miss_count" : 2342229908636,
      "cache_size" : 54326426,
      "cache_count" : 3148666271,
      "evictions" : 3094339845
    },
    "completion" : {
      "size" : "0b",
      "size_in_bytes" : 0
    },
    "segments" : {
      "count" : 332846,
      "memory" : "858.4gb",
      "memory_in_bytes" : 921712993891,
      "terms_memory" : "557.5gb",
      "terms_memory_in_bytes" : 598697432174,
      "stored_fields_memory" : "255.9gb",
      "stored_fields_memory_in_bytes" : 274852502984,
      "term_vectors_memory" : "0b",
      "term_vectors_memory_in_bytes" : 0,
      "norms_memory" : "218.3mb",
      "norms_memory_in_bytes" : 228978304,
      "points_memory" : "41.9gb",
      "points_memory_in_bytes" : 45044298917,
      "doc_values_memory" : "2.6gb",
      "doc_values_memory_in_bytes" : 2889781512,
      "index_writer_memory" : "20.1gb",
      "index_writer_memory_in_bytes" : 21619911301,
      "version_map_memory" : "31.6mb",
      "version_map_memory_in_bytes" : 33167020,
      "fixed_bit_set" : "309.7gb",
      "fixed_bit_set_memory_in_bytes" : 332574300512,
      "max_unsafe_auto_id_timestamp" : -1,
      "file_sizes" : { }
    }
  },
  "nodes" : {
    "count" : {
      "total" : 386,
      "data" : 347,
      "coordinating_only" : 36,
      "master" : 3,
      "ingest" : 0
    },
    "versions" : [
      "6.4.1"
    ],
    "os" : {
      "available_processors" : 11016,
      "allocated_processors" : 11016,
      "names" : [
        {
          "name" : "Linux",
          "count" : 386
        }
      ],
      "mem" : {
        "total" : "81.8tb",
        "total_in_bytes" : 89957789696000,
        "free" : "2.7tb",
        "free_in_bytes" : 3068981411840,
        "used" : "79tb",
        "used_in_bytes" : 86888808284160,
        "free_percent" : 3,
        "used_percent" : 97
      }
    },
    "process" : {
      "cpu" : {
        "percent" : 2470
      },
      "open_file_descriptors" : {
        "min" : 8926,
        "max" : 12640,
        "avg" : 10746
      }
    },
    "jvm" : {
      "max_uptime" : "48d",
      "max_uptime_in_millis" : 4153686531,
      "versions" : [
        {
          "version" : "1.8.0_302"
        },
        {
          "version" : "1.8.0_242"
        }
      ],
      "mem" : {
        "heap_used" : "13.8tb",
        "heap_used_in_bytes" : 15240308363208,
        "heap_max" : "28tb",
        "heap_max_in_bytes" : 30870800171008
      },
      "threads" : 169437
    },
    "fs" : {
      "total" : "5.4pb",
      "total_in_bytes" : 6123033514786816,
      "free" : "4.8pb",
      "free_in_bytes" : 5405924939575296,
      "available" : "4.5pb",
      "available_in_bytes" : 5098669333790720
    }
  }
}

Christian_Dahlqvist · September 10, 2021, 10:11pm

It looks like you are running with very large heaps on the large nodes. This is not recommended and can lead to very long GC pauses that may make nodes appear offline and case shard relocation. In general the heap is recommended to be set yo 30GB or lower to benefit from compressed pointers, especially on older versions of Elasticsearch like you are using. Look in the logs on the new nodes for evidence of long GC and see if this correlates with nodes dropping out of the cluster.

warkolm · September 13, 2021, 12:41am

Plus I'd also suggest that that cluster is way too large, you should consider splitting it up and using CCS to query between clusters.

Nisar_Adappadathil1 · September 15, 2021, 8:37am

So the option i have is

Split into smaller clusters and do CCS
Setting heap to less than 30GB

We already checked on GC waits and we dont have long GC waits

What we have is high load on the big instances compared to the small instances, there would be a specific reason for this right.

Christian_Dahlqvist · September 15, 2021, 9:11am

What type and size of storage are you using on the different nodes?

Nisar_Adappadathil1 · September 15, 2021, 9:36am

8 core CPU & 15 GB memory is the old type one & now we are migrating to 48 core 384 GB memory

Currently our cluster 50% of old type instances & 50% new type instances

High load is always associated with the new type instances

Nisar_Adappadathil1 · October 6, 2021, 2:42pm

now we have completely migrated to the new instance type 48 core 384 GB memory
We are seeing sudden spike in loads of the hosts & the latency also increases

after going through the _cat/segments api I could see that 22% of our segments are marked searchable = false also 22% committed = false

we have a total of 400k segments & 5000 shards for our 70 indices

is this normal?

we can see a jump in Write latency every now and then, also on our hosts iostat param - rkb_s is very high.

How could we fix this issue?
Is the problem with larger segments?
whats the issue with larger segments? We have enough memory on our hosts

system · November 3, 2021, 2:42pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Primary balancing Elasticsearch	17	636	July 6, 2017
Elasticsearch on EC2 : load-average problem Elasticsearch	8	986	July 6, 2017
Investigating elasticsearch load issues Elasticsearch	9	1295	July 6, 2017
High cpu usage on large ec2 nodes Elasticsearch	5	1933	July 6, 2017
Quick question on which Amazon instance type to use Elasticsearch	1	378	July 6, 2017

High I/O usage on large Elasticsearch Instance

Related topics