100% disk, single node cluster how to fix?

elasticforme · April 23, 2023, 8:03pm

I have a test single node cluster.
I know what the problem is but can't seems to figure out how to get out of it and fix without removing everything and star over

this node has all index without replica
because I executed this while back

PUT /_all/_settings
{
    "index" : {
        "number_of_replicas" : 0
    }
}

now because / disk is full which is also elastic data path and I can't increase disk size. how do I remove some unwanted or large older index?

because I can't connect to it even via command prompt

curl  -XGET  node11:9200/_cluster/health?pretty
{
  "error" : {
    "root_cause" : [
      {
        "type" : "master_not_discovered_exception",
        "reason" : null
      }
    ],
    "type" : "master_not_discovered_exception",
    "reason" : null
  },
  "status" : 503
}

and I know my elasticsearch/node folder has all the data.

Is there a solution in this situation?

stephenb · April 23, 2023, 9:08pm

What is the results of

_cat/indices/?s=pri.store.size:desc

Can you free up other non elastic disk space?

Even a few GBs might help?

elasticforme · April 24, 2023, 12:56pm

HI
I could and I did to bring system backup

but now I want to know how to recover from this situation if there is no other way to remove any other log/files from system. and If want to remove some index to free up space?

stephenb · April 24, 2023, 1:14pm

Did you run the cat indices before you cleaned up?

Did you get a response?

The. Only way to clean up AFAIK is to DELETE indices directly.

elasticforme · April 24, 2023, 4:11pm

I did further test by filling up disk to 100% manually and I can't run any elastic command at all, can't communicate with cluster at all.
everything gives me same error message.

what I believe is if you ever get to 100% and can't remove anything then only option is to go in Elasticsearch data dir and remove some indices directory or lucent files by looking up date creation of that. which is very old fashion but works ( I tested it)
then once system is up you will see that indice is unavailable. and delete it from kibana.

in production I don't want to be on that spot.

stephenb · April 24, 2023, 4:12pm

That is the most important statement in this whole thread

elasticforme · April 24, 2023, 4:16pm

yep. I had some check already in place but didn't know that, this can cause lot of trouble.

now what I have also thought is to put 1 or 2 gig dummy file on data dir. as storage is cheap and if it ever gets fill up we can remove that dummy file and get going to do require fix.

stephenb · April 24, 2023, 4:21pm

You can / should set up stack management alerting to let you know when you get to 80% or 90%...

leandrojmp · April 24, 2023, 5:12pm

It really depends on how you are going to do in production.

If you are self-manage everything with VMs I would say that a good approach is that every data node has a different disk for data, and in this case storage is not that cheap.

Depending on the size of the disk it can cost you the same price as the VM if you are running on cloud.

Also, with a separate data disk it makes easier to configure and control the watermarks, correctly configuring the watermarks will help you to not have this issue in production.

system · May 22, 2023, 5:13pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Elasticsearch too_many_requests disk usage exceeded flood-stage watermark Elasticsearch docker	5	387	June 8, 2023
Indices problem Elasticsearch	21	698	July 19, 2018
Deleting Indexes From Kibana Not Freed Any Disk Space On Hosts Elasticsearch	2	792	April 26, 2021
Complete newbie, but need urgent cleanup Kibana	8	1408	September 30, 2021
Recovery from red ES node and red indices Elasticsearch	4	572	July 6, 2017

100% disk, single node cluster how to fix?

Related topics