Backwards update for transform data

liorg2 · November 17, 2020, 10:54am

Hello

I wonder what's the best practices , in case of a failed transform (for example if a new index was created with wrong mapping or any reason), of completing the missing data.

example:
created a new transform at 23:00
which has the following group by: date histogram of 1 minute, and terms by server , and avg(load) aggregation

at 00 a new a new index was created with the wrong mapping: server should be a keyword, but its a text

the transform failed, and until it's fixed - 10am, data is missing from the dest index

what the recommended way of handling such a case? (can it be automated?)

thanks

Hendrik_Muhs · November 17, 2020, 11:45am

Transform uses checkpoints, because the transform has failed check pointing does not proceed but is stopped at the time it failed. Once you re-start the transform it will continue from the checkpoint.

To restart the transform you first need to bring it into the stopped state by using

POST _transform/{id}/_stop?force=true

Afterwards you can start it again.

There is no way to automate, because transform treats this error as permanent problem which requires a user to fix it. This is different to the case that a temporary problem occurs, e.g. temporary outage of the node that holds the data. If such a failure happens, transform retries up to 10 times.

liorg2 · November 17, 2020, 4:16pm

Thanks a lot. Can I also ask what's the best option for transform fail alert?

Hendrik_Muhs · November 17, 2020, 7:16pm

This is a good question. The best option right now is afaik using watcher and the http input.

I suggest to configure it against the _transform/{transform_id}/_stats endpoint and check that status != failed.

(Note its called http input but it can speak https)

I will follow up with the team if we can provide a better solution in future, e.g. a transform wide state. Feel free to open a gh issue as enhancement request if you have a concrete idea how it should look like.

system · December 15, 2020, 7:16pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Transform stability Elasticsearch elastic-stack-monitoring , transforms	20	1495	August 2, 2021
How to re run the failed transform job for that specific failed duration? Elasticsearch painless	2	351	September 9, 2021
Transform error handling Elasticsearch	4	539	August 5, 2020
Transform Exception - all shards failed Elasticsearch	3	1206	March 4, 2020
Send alert when transform has stopped Elasticsearch elastic-stack-alerting , transforms	2	395	May 10, 2021

Backwards update for transform data

Related topics