How to delete the old documnet from my index pattern?

vebabu · August 7, 2018, 5:51pm

Hi All,

I am collecting the data from IoT DEVICE every second and transferring the data to Elastic search. I wanted only last 7 days data in my index pattern. How do i delete the old document data from my Index pattern.

I have a field called "event_ts" which is the time field and I want to perform the delete operation based on this time field.

my data looks like :

{
"_index": "dev-andon",
"_type": "dev-andon",
"_id": "2675ace7-b73c-4cea-b074-e1eb49fd6a1c",
"_version": 1,
"_score": 2,
"_source": {
"raspi_id": "0000000078686c63",
"PK": "0000000078686c63_inactive_880d12a1-ac76-415e-9735-c57050319ccd_2018-07-12",
"activity_id": "880d12a1-ac76-415e-9735-c57050319ccd_2018-07-12",
"event_ts": "2018-07-12T16:26:09.505710",
"event_type": "inactive",
"device": "knight",
"activity": "andon"
},
"fields": {
"event_ts": [
"2018-07-12T16:26:09.505Z"
]
}
}

Query that I have tried.
DELETE dev-andon/dev-andon/_query
{
"query": {
"filtered": {
"query": {
"query_string": {
"query": "*"
}
},
"filter": {
"range": {
"event_ts": {
"lte": "2018-07-13T01:00:00.000000"
}
}
}
}
}
}

The output I am getting :
{
"_index": "dev-andon",
"_type": "dev-andon",
"_id": "_query",
"_version": 1,
"result": "not_found",
"_shards": {
"total": 2,
"successful": 2,
"failed": 0
},
"_seq_no": 47,
"_primary_term": 1
}

Please advice
Best,
Venkatesh

dadoonet · August 7, 2018, 7:26pm

Create time based indices like one index per day.
At the end of the week, just drop the old indices.

You can automate that by using Curator.

vebabu · August 8, 2018, 1:45pm

Hi Dadoonet,

I don't want to create a Index pattern for every day , because I am build a dashboard based on the index pattern and I can't keep on build my dashboard. Is there any way i can search all the record in my index pattern and delete the record based on the time parameter ??

Please advice.
Best,
Venkatesh

dadoonet · August 8, 2018, 2:04pm

Of course. That's where index templates are great for.

On Kibana side, you can define the index pattern as foo-* for example.

theuntergeek · August 8, 2018, 3:01pm

@dadoonet is correct. There is no reason not to use a different index per day for your use case. Because Kibana can use an Index Pattern for building visualizations and dashboards, it can handle new indices per day:

43%20AM

If you click on Index Patterns, you will be taken to another screen where you can Create Index Pattern. You can see from mine that there are several Index Patterns which follow the foo-* example shared by @dadoonet:

55%20AM

Once created, when you go to create a new visualization in Kibana, your Index Pattern will be in a list you select from:

38%20AM

theuntergeek · August 8, 2018, 3:11pm

To further round out the discussion, performing delete_by_query, which you can still do, is very inefficient for deleting data from indices, compared with deleting entire indices. The difference is similar to the difference between these SQL psuedo-statements of:

DELETE from TABLE where timestamp < now-7days

and

DROP TABLE

The DELETE from statement has to perform a query, and do a comparison on every document, and then set up a series of atomic DELETE operations for each match found, while the DROP statement is over and done in a single shot. This example isn't perfect, because SQL is designed to handle this sort of thing better than Elasticsearch.

Elasticsearch makes things worse still because deleting a document doesn't result in it immediately freeing resources, but instead only marks the document for deletion. It isn't actually deleted until the next segment merge—which results in yet another scan of the documents to see which are kept, and which to delete. That's at least 2 scans over all of your documents just to free the resources. Also, having mismatched segment sizes—which is what happens with document deletes—makes Lucene a bit less efficient.

Elasticsearch handles these scenarios well, but if you didn't have to delete documents from an index, it would be much, much more efficient, which is why @dadoonet recommended using daily indices—as do I.

system · September 5, 2018, 3:11pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Time based elastic search index deletion Elasticsearch	2	2007	March 16, 2018
Automatically delete 1 month old documents(without deleting index) in elastic search Elasticsearch	4	1481	January 4, 2019
Delete documents by timestamp Elasticsearch	18	22602	August 3, 2017
Deleting time based document -dummy Elasticsearch	5	734	July 5, 2017
Delete the data in Elasticsearch index based on a date/timestamp column in that index using python Kibana	5	3930	December 9, 2020

How to delete the old documnet from my index pattern?

Related topics