How to Stack Cardinality Transforms

Ashfur · April 20, 2022, 7:26pm

Hello,

This is a followup question for:

In order to delete old raw data without impacting the continuous lifetime cardinality aggregation transform, I was suggested to stack 2 transforms together. The first transform aggregated raw data into hourly summaries, then the outputs get fed into the second transform that aggregated the hourly summaries into lifetime total.

I am not sure how to set this up in practice because the output of the hourly cardinality transform is just a single integer representing how many unique values are seen in each hour, rather than some kind of data structure like HLL that can be rolled up to a higher level. Is this the right way to do this or is there a different approach?

Thanks.

Hendrik_Muhs · April 21, 2022, 6:07am

Cardinalities can not be summarized using a stacked approach. I just read my previous post again, it sounds misleading. Sorry if I have not been clear. It should say that it does not work for cardinality.

There have been ideas about making HLL a native data type in Lucene, so it can be updated.

system · May 19, 2022, 6:08am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Questions Regarding Deleting Source Data After Transform Elasticsearch transforms	6	1328	September 3, 2021
Can i use Transforms to collapse data? Elasticsearch	11	934	November 20, 2019
Delete data from source index of a transform that created the documents in destination index Elasticsearch transforms	6	872	June 20, 2022
Avoid recalculation from scratch of Transform aggregation Elasticsearch	3	504	March 13, 2020
Transform behavior with deleted documents Elasticsearch	5	792	August 5, 2020

How to Stack Cardinality Transforms

Related topics