How to partially delete an index

its-ogawa · April 6, 2021, 10:03am

How can I delete only the records that meet the criteria of the index registered in elasticsearch?

I don't want to delete the entire index.
In other words, not the following instruction.

curl -XDELETE localhost:9200/index_name?pretty=true

its-ogawa · April 6, 2021, 10:06am

In addition, please let me know if there is an option to run a test run (like dry-run, which doesn't actually delete the file, but checks the execution).

dadoonet · April 6, 2021, 10:14am

Have a look at:

Do a _search instead with the same exact query.

its-ogawa · April 6, 2021, 10:21am

Thank you for answering my question.

Does it support the curl command?
I seem to get an error with the following command.

# curl -XPOST 'localhost:9200/api-2021.04.06/_search' -d '{ "query": { "match": { "log.file.path":"/var/log/api/api-2021-02" } } }'
{"error":"Content-Type header [application/x-www-form-urlencoded] is not supported","status":406}

its-ogawa · April 6, 2021, 11:19am

First of all, I decided to do as you said and send a GET request from Kibana's developer tool.

The following request works.

GET /api-2021.04.06/_search
{
  "query": {
    "bool": {
      "must": {
        "match": {
          "log.file.path":"/var/log/api/api-2021-02-*.log"
        }
      }
    }
  }
}

However, the all-important delete request does not work.

POST /api-2021.04.06/_delete_by_query
{
  "query": {
    "bool": {
      "must": {
        "match": {
          "log.file.path":"/var/log/api/api-2021-02-*.log"
        }
      }
    }
  }
}

{"statusCode":502,"error":"Bad Gateway","message":"Client request timeout"}

It means a timeout, but I don't believe it is that big a log. What could be the cause? Also, is there any way to extend the timeout period?

its-ogawa · April 6, 2021, 11:40am

If I run it from elasticsearch-head, I get another error.

"failures": [
{
... snip ...
"cause": {
"type": "cluster_block_exception",
"reason": "index [api-2021.04.06] blocked by: [TOO_MANY_REQUESTS/12/disk usage exceeded flood-stage watermark, index has read-only-allow-delete block];"
},
"status": 429
}

I saw this same error in Kibana when I had a lot of logs fetched by filebeat and the disk was getting tight.

However, I am trying to delete the log due to this error confirmed by Kibana.
If this operation is also blocked, how can I reduce the disk space?

dadoonet · April 6, 2021, 11:48am

DELETE a document does not remove the doc immediately. It creates more data on disk to Mark the document as deleted.

Then it eventually removes it when a merge happens.

Because you did not tell initially, may be describe what is the pro lyon want to fix by deleting some documents?

its-ogawa · April 6, 2021, 11:58am

I am sorry that I did not explain it well enough.

Because you did not tell initially, may be describe what is the pro lyon want to fix by deleting some documents?

What do you mean by this?

Does it mean that it is not practical to request a delete query from elasticsearch in this situation?

Please let me know if you have any good ideas to solve [TOO_MANY_REQUESTS/12/disk usage exceeded flood-stage watermark, index has read-only-allow-delete block].

its-ogawa · April 6, 2021, 12:21pm

Then it eventually removes it when a merge happens.
Perhaps you may have found yourself in the situation described here.

The following error has occurred.

"failures": [
{
... snip ...
"cause": {
"type": "version_conflict_engine_exception",
"reason": "[3NYwpngBPtRZ_HR4DA5t]: version conflict, required seqNo [11709405], primary term [1]. but no document was found",
"index_uuid": "4FRCHYUyTzWKrYs1fmr2Aw",
"shard": "0",
"index": "api-2021.04.06"
},
"status": 409
}

Is there any way to resolve the conflict?

dadoonet · April 6, 2021, 3:56pm

What is the current status of your cluster?

What is the output of:

GET /
GET /_cat/nodes?v
GET /_cat/health?v
GET /_cat/indices?v

If some outputs are too big, please share them on gist.github.com and link them here.

its-ogawa · April 7, 2021, 1:31am

Thank you for your answer.

I'm very sorry, but I couldn't wait for your answer, so I deleted the corresponding index, and now I can't reproduce it.

I think all the current statuses are in a good state.
I will share the current status.

gist.github.com

https://gist.github.com/its-ogawa/160825512afd43fe842cb8ac4c902431

ClusterStatus.md

# How to check cluster health

In the console of kibana > dev tool, execute the following command.

1. `GET /`

```
{
  "name" : "ITS-ELS-01",
  "cluster_name" : "elasticsearch",

This file has been truncated. show original

If the status is not healthy, what does this command tell you?

its-ogawa · April 7, 2021, 7:17am

I have deleted the index and can no longer check it. Is the following explanation correct?

GET /api-2021.04.06/_search
{
  "query": {
    "bool": {
      "must": {
        "match": {
          "log.file.path":"/var/log/api/api-2021-02-*.log"
        }
      }
    }
  }
}

However, the all-important delete request does not work.

POST /api-2021.04.06/_delete_by_query
{
  "query": {
    "bool": {
      "must": {
        "match": {
          "log.file.path":"/var/log/api/api-2021-02-*.log"
        }
      }
    }
  }
}

I would like to know this because in the future I may do something like partially deleting a document in the index.

dadoonet · April 7, 2021, 9:12am

It just gives me an overview of your cluster to have a better understanding before trying to dig in things.

Why?

If you are running out of disk space, I can understand that but otherwise I don't see why it would not work.

its-ogawa · April 9, 2021, 2:16am

If you are running out of disk space, I can understand that but otherwise I don't see why it would not work.

I'm sorry for the confusion.
Your advice turned out to be correct.

[TOO_MANY_REQUESTS/12/disk usage exceeded flood-stage watermark, index has read-only-allow-delete block].

The cause seems to be that elasticsearch is blocking index due to exhausted disk space.

We have secured enough disk space and changed the destination of the index in elasticsearch.
After collecting the logs again and confirming that there were no errors, I ran the above command and it worked.

system · May 7, 2021, 2:16am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Delete operation using curl Elasticsearch	2	667	July 5, 2017
Curl Xdelete with ELASTICSEARCH 6.1 Elasticsearch	5	1662	February 5, 2018
Elastic search 2.2 and HTTP delete api? Elasticsearch	3	951	July 5, 2017
Delete Indices/index/documents but 404 NotFound Elasticsearch	15	424	February 14, 2024
Deleting index using DELETE in dev tools leads to error Kibana	5	691	May 27, 2019

How to partially delete an index

Related topics