ElasticSearch 2.3.3 Cluster Disk High/Low Watermark Net Effect is Ambiguous

Troy_Heanssgen · February 22, 2017, 8:41pm

We are running into some problems with autoconfiguration of ElasticSearch. Mainly, the low disk watermark never gets logged and the high disk watermark seems to be ignored! This only seems to be a problem when we are using a single-node cluster. The re-allocation seems to work just fine when we add another system into the cluster.

We can freely delete extrenous shards (sub 1mb) using kopf/sense and they will prompty get recreated (against the documentation that new shards cannot be allocated).

The strangest part of the whole process is that we are properly logging the high watermark breach:
[2017-02-22 15:28:02,129][WARN ][cluster.routing.allocation.decider] [es-1] high disk watermark [2%] exceeded on [cVPjveSMQeS6d0oxq8axZA][es-1][/opt/evertz/insite/parasite/applications/es-1/data/Development-cluster/nodes/0] free: 27.1gb[49.3%], shards will be relocated away from this node

Does anyone have some information regarding the expected behavior of the settings mentioned at https://www.elastic.co/guide/en/elasticsearch/reference/2.3/disk-allocator.html and how they are supposed to interact with a single node cluster?

We have tried this for both absolute watermarks (in mb) and percentage watermarks.

Our _cluster/settings:

{
  "persistent": {
    "cluster": {
      "routing": {
        "allocation": {
          "disk": {
            "include_relocations": "true",
            "threshold_enabled": "true",
            "watermark": {
              "low": "1%",
              "high": "2%"
            }
          }
        }
      },
      "info": {
        "update": {
          "interval": "10s"
        }
      }
    }
  },
  "transient": {}
}

spinscale · February 27, 2017, 9:14am

Hey,

just to understand your question fully: Which behaviour for watermark checks do you expect when you only have a single node, thus no balancing can happen anyway?

--Alex

Troy_Heanssgen · February 27, 2017, 2:57pm

Hello,

The expected result on a single node cluster would be no new indices when the low watermark is reached with existing indices accepting new data and no new data accepted at all once the high watermark is reached.

The reason being that there are cases where long term mass storage is needed (infrequent access) where we never want to fill up anything more than 95% of the disk so that other applications such as our nodejs curator can compute stale indices without taking massive amounts of memory.

As it stands right now we have watched elasticsearch write to the final cylinder of our disk and then peg the system as it tries to write more.

Any thoughts?

Troy Heanssgen

spinscale · February 27, 2017, 9:02pm

Hey,

checking the source it is explicitely mentioned, that in case of only a single node the disk threshold decider is disabled and allocation is always allowed.

See https://github.com/elastic/elasticsearch/blob/master/core/src/main/java/org/elasticsearch/cluster/routing/allocation/decider/DiskThresholdDecider.java#L373

--Alex

Troy_Heanssgen · February 27, 2017, 9:11pm

Hello,

First of all thank you for the response. So if I understand correctly, there is absolutely no way outside of manual/automatic external management to limit the resources ElasticSearch uses?

The source seems to agree with your conclusion, however it seems fairly silly to let a database fill the entire disk unless it has another node in the cluster to talk to.

Apart from recompiling the source, is there a different sane way to limit disk usage in ElasticSearch?

Troy Heanssgen

spinscale · February 27, 2017, 10:53pm

Hey,

I opened https://github.com/elastic/elasticsearch/issues/23395

--Alex

system · March 27, 2017, 10:53pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Disk threshold Elasticsearch	7	1751	May 31, 2021
Disk Allocation Threshold Elasticsearch	1	442	July 6, 2017
Low disk watermark [15%] exceeded on Elasticsearch	13	5351	July 5, 2017
Question about High watermark in Single Node cluster Elasticsearch	2	628	August 28, 2017
Is there much value in having different values for cluster.routing.allocation.disk.watermark.low/high configs? Elasticsearch	2	522	January 11, 2019

ElasticSearch 2.3.3 Cluster Disk High/Low Watermark Net Effect is Ambiguous

Related topics