Hi
I have a requirement where i need to delete old records from Elasticsearch
Index . Before i delete them i want to archive them to a different location
. I see that there are tools for exporting data (elasticsearch-knapsack)
and deleting index (curator) . Is there a tool which does following
Archive records based on query from an Index
Deletes above records from Index after successful Archive operation.
Yes, with the knapsack plugin, you can specify a query, and the result set
can be archived into tar.gz or zip archive. Knapsack has no feature for
automatic deletion of documents since the operation may lead to unexpected
data loss. Deletion should be done from shell after archiving has
succeeded, when an external validation of the created archive succeeds
(e.g. counting the number of entries in the archive). Knapsack may fail if
disk space is full or file write privilege is insufficient, so it is
essential to validate backups before proceeding. Best tool from shell to
kill documents is curl, either by index delete or delete by query command.
Hi
I have a requirement where i need to delete old records from Elasticsearch
Index . Before i delete them i want to archive them to a different location
. I see that there are tools for exporting data (elasticsearch-knapsack)
and deleting index (curator) . Is there a tool which does following
Archive records based on query from an Index
Deletes above records from Index after successful Archive operation.
On Wednesday, May 7, 2014 10:16:50 AM UTC-5, Jörg Prante wrote:
Yes, with the knapsack plugin, you can specify a query, and the result set
can be archived into tar.gz or zip archive. Knapsack has no feature for
automatic deletion of documents since the operation may lead to unexpected
data loss. Deletion should be done from shell after archiving has
succeeded, when an external validation of the created archive succeeds
(e.g. counting the number of entries in the archive). Knapsack may fail if
disk space is full or file write privilege is insufficient, so it is
essential to validate backups before proceeding. Best tool from shell to
kill documents is curl, either by index delete or delete by query command.
Jörg
On Wed, May 7, 2014 at 4:43 PM, Vijay Dodla <vijay....@gmail.com<javascript:>
wrote:
Hi
I have a requirement where i need to delete old records from
Elasticsearch Index . Before i delete them i want to archive them to a
different location . I see that there are tools for exporting data
(elasticsearch-knapsack) and deleting index (curator) . Is there a tool
which does following
Archive records based on query from an Index
Deletes above records from Index after successful Archive operation.
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.