Elasticsearch transformation query

venkatkumar229 · September 6, 2024, 10:36am

We are working on a data processing pipeline that involves multiple transformations. Specifically, we have a use case where the first transformation runs and calculates documents for various systems, including system1. In this transformation, we categorize documents based on their presence as either "Primary only", "Secondary only", or "Both".

In our second transformation, we need to calculate or process documents again for system1. I’m concerned about how changes from "Primary only" to "Both" from the first transformation will be managed. Specifically:

For example- We have 10 logs havings document as primary only now ,if for 2 logs the status changes to both from the first transform.

Since the documents with the "Primary only" status are already indexed, what’s the best approach to ensure these documents are properly updated or removed when their status changes to "Both"? We want to ensure that only "Primary only" documents are retained in the second transformation output.

QuentinH · September 6, 2024, 1:22pm

Hi,

You may have multiple options to solve this problem. One that I can think of right now would be to chain transforms. Your logs would be your source for the first transform. This transform would write a new status (Primary/Secondary/Both) in your destination index. You could then use this destination index as the source of your second transform that would only consider updated status from your first transform. Depending on what you are trying to achieve, there may be better solutions.

system · October 4, 2024, 1:23pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Transform not updating documents Elasticsearch transforms	9	698	December 14, 2022
Mapping - transform: only for creating new and not for updating? Elasticsearch	10	3201	July 5, 2017
Continuous transform of a transform destination index Elasticsearch transforms	1	110	May 16, 2024
Default behavior of sorting in the transform Elasticsearch	2	13	September 18, 2024
Transform vs update vs recreating index for historical/last values indexes Elasticsearch	0	93	May 21, 2024

Elasticsearch transformation query

Related topics