Previously used Logstash to push data into Elastic. For some logs that had the most important data scattered around we used the Aggregate Filter Plugin to gather data across multiple lines and then push one single document to Elastic.
Pseudo Example
2026-05-26 12:00 (thread-01) [msgid:123]: Received file [hello.txt]
2026-05-26 12:00 (thread-01) [msgid:123]: File is from [some part]
2026-05-26 12:01 (thread-01) [msgid:123]: File moved to [some/dir] in 20ms
{
msgid: 123
file.name: hello.txt
part: some part
dir: some/dir
duration: 20
}
After moving to Custom Filestream Integration with ingest pipelines I found no Aggregate Filter Plugin replacement.
I thought a workaround could be to update the documents by replacing the _id, but it seems data streams have op_type=create, which means I'm not allowed to update documents when using data streams (according to AI) ?
Any thought on how to handle this scenario when using integration with ingest pipelines?