Can Snapshots save index in a limited time

Skairik · May 16, 2023, 10:00am

Hello everyone,
I would like to know if it is possible to set up a snapshot policy that retrieves for example indexes only from the last 7 days. For example, I save my logs from my active directory with this format:

index => "winlogbeat-%{+YYYY.MM.dd}"

And I wanted to use this to recover that the last 7 daysI tested a few things with the settings like this:

PUT /_slm/policy/snap-hebdo-ad
{
  "schedule": "0 0 1 * * ?", 
  "name": "snap-hebdo-ad", 
  "repository": "AD", 
  "config": {
    "indices": [
      "winlogbeat-*"
    ],
    "metadata": {
      "taken_at": "now-7d/d"
    },
    "ignore_unavailable": true,
    "include_global_state": false,
    "partial": true
  },
  "retention": {
    "expire_after": "60d",
    "min_count": 1,
    "max_count": 10
  }
}

But nothing worked (it saves every winlobeat-*), so I would like to know already if what I am asking for is possible, and if it is, how!

FALEN · May 16, 2023, 10:15am

If your concern is storage usage, below should answer that;

How snapshots work (here)

Snapshots are automatically deduplicated to save storage space and reduce network transfer costs. To back up an index, a snapshot makes a copy of the index’s segments and stores them in the snapshot repository. Since segments are immutable, the snapshot only needs to copy any new segments created since the repository’s last snapshot.

Each snapshot is also logically independent. When you delete a snapshot, Elasticsearch only deletes the segments used exclusively by that snapshot. Elasticsearch doesn’t delete segments used by other snapshots in the repository.

If your concern is restore process, you can select indices manually while restoring snapshot. You don't need to restore all

Skairik · May 16, 2023, 10:29am

Thanks for the reply,

My concern is the use of storage, but the following information does not allow me to answer my question unless the answer is no since I have not seen what I am looking for anywhere.

FALEN · May 16, 2023, 10:48am

As noted above, ALL Elasticsearch snapshots are similiar to incremental. Basically the cluster (actually each node) looks at what segments it has to snapshot vs. what segments are already in the repository, and writes the missing ones. Plus a bunch of references, states, and other metadata. That’s it.

So, if you have daily, weekly snapshot jobs scheduled. You should not concern storage. Because storage used will not multiple with each snapshot data

Skairik · May 16, 2023, 12:04pm

Aaah! Actually present it this way is more logical, I thought badly because my first goal was to retrieve only my logs from last week and leave the others because they were test logs and were going to be deleted.
But if we remove this exceptional case indeed, you are right, given how the snapshot system works, I do not need to try to recover only certain logs.

Thanks for the reply again !

system · June 13, 2023, 12:04pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Snapshot Strategy for archival Elasticsearch	3	1053	September 7, 2017
Questions about backup strategy Elasticsearch	4	3435	May 27, 2019
Elasticsearch take snapshot then clean indices which older than 7 days Elasticsearch	1	390	January 5, 2021
Point in time recovery from a snapshot Elasticsearch	2	1671	December 12, 2019
Permanently retaining all snapshot segments Elasticsearch snapshot-and-restore	3	400	August 10, 2021

Can Snapshots save index in a limited time

How snapshots work (here)

Related topics