Elasticsearch using lots of CPU and disk, flipping to read-only during heavy use

dpitchford · October 24, 2023, 7:01pm

I am trying to understand behavior I am seeing from Elasticsearch 6.8 on a production site. During a period of heavy use (indexing large numbers of documents), Prometheus metrics are showing me that ES is using large amounts of CPU and a steadily increasing amount of disk space, but memory usage is only slightly elevated over the norm. In the ES logs, I see many messages like this:

[2023-10-20T15:15:01,932][INFO ][o.e.i.IndexingMemoryController] [elasticsearch-master-0] now throttling indexing for shard [[data_explorer][1]]: segment writing can't keep up

This continues until disk usage hits 95%, at which point the index is set to read-only / allow delete mode as described here and the remaining index requests fail. Disk usage then falls to a level slightly below that from before the indexing started. There are no error messages in the process that is invoking ES until the index is set to read-only mode. After this, the index has to be manually reset out of read-only mode.

I have three main questions:

Why does disk usage drop back to its original level after flipping to read-only mode?
Why does the read-only setting on the index have to be reset manually if the doc linked above says "the index block is automatically released when the disk utilization falls below the high watermark"?
What does the throttling seen in the ES logs mean, and what is causing it? Do the ES docs describe this process anywhere? (I have not been successful in finding an official explanation)

DavidTurner · October 24, 2023, 8:37pm

This version is really very old, it passed EOL several years ago. I strongly recommend upgrading to a supported version ASAP.

I would guess either a segment merge or a translog flush, you'd need to check indices stats to confirm.

Because the docs you link are for 7.17 and you're using a much older version.

The cluster is overloaded (likely IO-bound).

system · November 21, 2023, 8:37pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Elasticsearch index read only Elasticsearch	1	342	February 28, 2020
FORBIDDEN/12/index read-only / allow delete (api)] : indexes are set to "read only mode" Elasticsearch	4	915	September 23, 2019
[cluster_block_exception] - Disk allocation and usage help! Elasticsearch	2	566	July 18, 2019
Elasticsearch not handled more indexes, So goes to Read Only mode Elasticsearch	10	3106	August 15, 2019
ELK stuck in Read Only mode Elasticsearch	4	7902	November 21, 2018

Elasticsearch using lots of CPU and disk, flipping to read-only during heavy use

Related topics