Removing Data Without Deleting Index While Using Filebeat on Elasticsearch

SFD13 · February 6, 2023, 7:35am

Hi everyone,

I am using filebeat-* index with some fields on Elasticsearch. I want to remove all data on the elasticsearch which I used but that index remains. It means that without deleting index name and contents (available fields) of it but data would be deleted.

Some people say that

curl -XDELETE 0:9200/filebeat-*

could be useful. I suppose that the index will be deleted. How can I solve this problem?

Thanks,

mverbeek · February 6, 2023, 7:56am

Hello,

You can try the delete_by_query API to delete specific documents.

SFD13 · February 6, 2023, 8:24am

I am using Elasticsearch 7.17.6 version as offline. Might it causes any problem?

Is it valid to delete data without index?:

curl -XDELETE 0:9200/filebeat/*

When I use the program (Zeek) to process the data, the data is uploaded to the Elasticsearch automatically. I want to delete data until processing a new data with new features in visualization.

mverbeek · February 6, 2023, 8:52am

If you use the normal DELETE API you will delete the entire index, the delete_by_query API is also present in older versions.

note that the delete_by_query_api is a slow way to do this..

POST /test-index/_delete_by_query
{
  "query": {
    "match_all": {}
  }
}

warkolm · February 6, 2023, 11:03pm

Welcome to our community!

Please note that delete-by-query is super inefficient. Why exactly do you want to do this?

SFD13 · February 7, 2023, 7:49am

Actually, I am using Zeek to extract files from PCAP as offline. Extracted data are visualized on the Elasticsearch using Kibana. I have an one index name as 'filebeat'.

When I make content-related changes on my own Zeek, I can not see these changes on Elastic. Because this data was included (visualized) before. That's why I want to completely delete the extracted data on Elastic. I want to delete all the data without deleting the index name so that there is no problem in filebeat while deleting the data. @warkolm

Ayush_Mathur · February 7, 2023, 8:08am

@SFD13 you can surely delete the index if you are re-ingesting the whole dump of data again into ES. When filebeat pushes the logs, ES will anyway check if the index is present or not, and if not, it is going to create a new index for you.
The only issue here could be if there some specific index name you might be using in your application, in which case, the index name could be set on filebeat configuration.
PS: there is no point in deleting all the data and keeping the index. ES only creates an index when the very first document/ event/ log for the index is ingested and stored.

SFD13 · February 7, 2023, 8:29am

If I delete the index, will yml files like filebeat.yml or filebeat.reference.yml remain unchanged under /etc/? So, as said, deleting means that with (e.g)

curl -XDELETE http://localhost:9200/filebeat

and then when I start the service with

sudo filebeat setup

will it be updated in Elasticsearch? @Ayush_Mathur

Ayush_Mathur · February 7, 2023, 8:50am

Deleting an index and restarting filebeat are two disparate things. You delete an index in Elasticsearch and it has none whatsover impact on Filebeat, the only thing impacted would be your logs which essentially would be deleted and won't be available in Kibana or any other client.

When you start filebeat ,it will run normally, reading the logs, enriching them based on your processors (if any) and then sending them to Elasticsearch. By default, filebeat sets the value for ctx._index which tells ES in which index this particular document/event/log must be stored. Not that, filebeat itself is not storing the log or document into the index, it is Elasticsearch which writes and stores it.

So essentially, when you delete the index, it is removed form ES, when you spin up your filebeat, it will send logs to ES for storing into ctx._index index. Elasticsearch will check if the index specified in the field ctx._index exists or not. If it exists, the document is added to that index, if not, it will create a new index with that name (value of ctx._index field) and store the document in that index.

sudo filebeat setup essentially configures the index template (index setting and property mappings), ILM policy, etc. in ES that will be used when new index created based on that template. It is executed on ES when filebeat is started, but it won't change anything if templates and policies themselves have not been modified. Also, this command doesn't impact the reading and sending of logs, albeit it defines how the documents are stored and how indices are managed.

SFD13 · February 13, 2023, 10:19am

After deleting an index, will Elasticsearch re-create that index? Is that right?

For instance, assume that I delete the index that its name is filebeat. When I run that sudo service filebeat start, it should be recreate and the data goes on Elasticsearch.

I suppose that the scenario would be after that if I delete an index. @Ayush_Mathur

Ayush_Mathur · February 13, 2023, 10:26am

That's right, a new index will be re-created as soon as the first document is ingested for that index.

SFD13 · February 13, 2023, 10:49am

Thanks for your help & time for me @Ayush_Mathur

system · March 13, 2023, 12:49pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Removing data from Elasticsearch Elasticsearch	3	551	September 1, 2017
When filebeat logs are deleted, does this delete Elasticsearch indices? Elasticsearch	6	190	March 15, 2024
Delete all data from index without deleting index? Elasticsearch	9	163313	June 28, 2017
Wiping data from ELK Elasticsearch	7	843	October 4, 2018
Delete All ES 2.4.1 Data for a Given filebeat Host Elasticsearch	5	1573	June 12, 2017

Removing Data Without Deleting Index While Using Filebeat on Elasticsearch

Related topics