Filter for specific field values and create another index to store for longterm storage

C_Shah · February 27, 2026, 9:24pm

I have filebeat sending logs from multiple files into an elasticseash index. What is the best way for me to filter on those logs and only store those filtered logs in an index for longer time (longer ILM policy)?

k8s pod logs → goes to a file → filebeat pods read the file and send specific annotated logs to an index. Now, I want to filter for specific logs within elasticsearch and store only those for longer timeframe. What is the best way to do so?

I’ve looked into rollup and transform jobs within Kibana, but they seem to be more for aggregation than simply to store the same documents and being able to filter further to get to finding what happened to a problem. They don’t delete/remove the documents from the source, so we would have duplicated documents right?

Thank you so much, in advance!

Tortoise · March 1, 2026, 3:29am

Hello @C_Shah

Welcome to the Community!!

Yes we will have duplicated data for short time :

Filebeat → logs-raw (ILM 7 days)
                ↓
      Transform (continuous)
                ↓
     logs-long-retention (ILM 180 days)

We can also try to filter data at source end i.e. while indexing the raw data by which we will ignore the records during indexing for the first time & no duplicate data will be stored. For this it will be required to the review the usecase.

Thanks!!

Rafa_Silva · March 1, 2026, 9:22pm

You don’t need Transform for this scenario. Transform is more suited for aggregation/pivot use cases and would introduce unnecessary duplication and extra indexing overhead.
The cleaner approach is to make the retention decision at ingest time. Route documents to different indices (or data streams) based on a condition (e.g., specific field values), and attach different ILM policies to each destination.

In other words:
Send all logs to a short-retention index by default.
During ingest, evaluate your filter condition.
If it matches, route that document to a long-retention index instead.
This avoids duplication, keeps the architecture simple, and scales much better than copying documents afterward.
In general, it’s best to separate retention policies through ingest-time routing rather than post-processing.

C_Shah · May 5, 2026, 3:34pm

How would I setup a filter condition at ingest time? Is this ingest in elasticsearch or at filebeat level?

Topic		Replies	Views
Saving documents in multiple indexes/Retention policy based on documents fields (Looking for ideas) Elasticsearch	1	194	April 8, 2022
Increased log retention for set of logs within an index Elasticsearch	4	470	September 5, 2022
Can i "Filter" certain logs into a Special Index? Elasticsearch	2	355	November 5, 2019
Timespan filter inside Elasticsearch index Elasticsearch	1	591	November 20, 2017
Copy speciifc data to another index automatically Elasticsearch	2	361	October 11, 2022

Filter for specific field values and create another index to store for longterm storage

Related topics