How can we see how long indexes can be retained for? Can we make a scheduled task for it? How can we maintain an index by looking into freeing up disk space in intervals?
1:
action: delete_indices
description: >-
Delete indices older than 60 days (based on index name), for metricbeat-
prefixed indices. Ignore the error if the filter does not result in an
actionable list of indices (ignore_empty_list) and exit cleanly.
options:
ignore_empty_list: True
timeout_override:
continue_if_exception: False
disable_action: False
filters:
- filtertype: pattern
kind: prefix
value: metricbeat-
exclude:
- filtertype: age
source: name
direction: older
timestring: '%Y.%m.%d'
unit: days
unit_count: 28
exclude:
Here unit_count is 28 which means that it will be the number of days worth of data that we would like to retain. can it be more than 28 days?
And if not in curator, how long can we retain data in indexes before it starts affecting performance?
How can we see how long indexes can be retained for?
Look at how much disk space your indices for a certain time period use, say one day or one week, then look at how much disk space your cluster has. Whether your cluster has enough RAM to keep all the indices open is a different matter which may only become evident in time.
Can we make a scheduled task for it?
Yes. The cluster I work with is on Linux and we curator is called from a cron job once a day.
Here unit_count is 28 which means that it will be the number of days worth of data that we would like to retain. can it be more than 28 days?
In what you posted unit_count value of 28 doesn't match description which says "older than 60 days". You can set it to any number of days you want. Just make sure the cluster has enough resources to handle that.
And if not in curator, how long can we retain data in indexes before it starts affecting performance?
I don't understand the "and if not in curator" part but how long you can keep data for before it affects performance depends on how much data you have and the resources your cluster has.
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.