Curator: delay deleting shrunk indices

dandrestor · September 22, 2017, 7:53am

Hello,

I am trying to implement the rollover pattern, as described in this excellent blog post, using curator, and I'm having some trouble with the shrinking part.

I would like to:

Shrink the index, and create the shrunk index with zero replicas, then
perform the other actions on the shrunk index (forcemerge, etc.), then
create replicas for redundancy on the shrunk index, and
only then delete the original unshrunk index.

I'm having trouble doing this with curator, which seems to only support deleting the original index immediately following a successful shrink. I tried finding other workarounds but couldn't find any. I believe some features that enable this may be missing, and I have a few feature requests prepared, but I wanted to check first with the community if maybe I'm missing something?

I'm posting this in the Elasticsearch category, as there is no Curator category.

Thanks for your thoughts.
//Dan

PS: My action file (WIP):

actions:
  1:
    action: shrink
    description: Shrink index to one shard
    options:
      shrink_node: DETERMINISTIC
      node_filters:
        permit_masters: True
      number_of_shards: 1
      number_of_replicas: 0
      delete_after: True
      shrink_suffix: -inactive
      wait_for_active_shards: 1
      extra_settings:
        settings:
          index.codec: best_compression
    filters:
      - filtertype: pattern
        kind: suffix
        value: -inactive
        exclude: True
      - filtertype: pattern
        kind: prefix
        value: log-
      - filtertype: age
        unit: days
        unit_count: 7
        direction: older
        source: field_stats
        field: '@timestamp'
        stats_result: min_value
      - filtertype: count
        count: 1
        exclude: False
  2:
    action: forcemerge
    description: Forcemerge the index to one segment
    options:
      max_num_segments: 1
    filters:
      - filtertype: pattern
        kind: suffix
        value: -inactive
      - filtertype: forcemerged
        max_num_segments: 1
        exclude: True
  3:
    action: replicas
    description: Set forcemerged indexes to have replicas
    options:
      count: 0
      wait_for_completion: True
    filters:
      - filtertype: pattern
        kind: suffix
        value: -inactive
      - filtertype: forcemerged
        max_num_segments: 1

theuntergeek · September 22, 2017, 4:33pm

So, you're saying that you want the Shrink action to do all of those actions as part of the shrink action, and not require other actions/steps?

That's a bit scary in terms of how long it could take, but I see the appeal, especially if you're pressed for space (as deleting after doing those things would be more space consuming).

It's not a bad idea. Please submit a feature request to https://github.com/elastic/curator

The flow I'm imagining would be as follows:

Shrink takes place with 0 replicas
forceMerge this index after (ostensibly with best_compression)
update the number of replicas to number_of_replicas
delete the original index.

This will be a bit of a change from the current behavior, so it will need to be a special flag, in case other users don't want this as part of their sequence.

dandrestor · September 23, 2017, 10:48am

No, I'm not saying that. I actually believe the shrink action is a little bit monolithic in its current form, and I totally agree that it would take too long to handle the entire process under a single action.

A couple of features that could be interesting to have which would enable this pattern:

Have an option to enable assigning the index to an alias, following the successful and unsuccessful processing of that index by the shrink action. This could even be a good idea to have as a generic option for all actions. Subsequent actions could use the alias filter to perform cleanup or follow-up actions on indexes as required based on their alias membership.
Have a sort of "name transformation" option on certain index, such as the delete index action. One could then apply the delete action for all filters ending in, say, "-inactive", transform the index name using some regex, and then apply the action (delete) using the transformed name as a parameter. This would enable for instance selecting all "...-inactive" indexes using a filter, but deleting those that actually end in "...-active".

I think idea number one is way better as it's more modular. Let me know if you like either of those; in the meantime I'll open a feature request also.

//Dan

theuntergeek · September 23, 2017, 4:32pm

I do like option 1. It's like "tagging" an index. But since Elasticsearch does not have such a feature, and alias is a good way to tag it.

system · October 21, 2017, 4:33pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Unable to shrink index using curator Elasticsearch curator	13	900	May 10, 2022
Shrink ends in red index state Elasticsearch curator	7	1080	March 28, 2019
How to find rolled indices Eligible to Shrink programmatically Elasticsearch	6	418	March 4, 2019
Action Shrink n Reindex with Curator Elasticsearch curator	1	383	December 17, 2020
Shrink do not proceed because of number of shard filter Elasticsearch docker , curator	4	178	March 7, 2024

Curator: delay deleting shrunk indices

Related topics