I want to group (aggregate) records on the basis of a specific ID. At the moment, my data has an event START and END record identified by the IDs. START and END record have different fields in them which I want to combine in one row.
The current mapping is similar to the attached image:
Here, Batch_ID is the field which I want to use for combining. I want to combine the fields from both START and END records in such a way that one row has important fields from both records.
Joining events while indexing might not be possible as the duration of time between both these events can be in hours. Regarding the use of table vis or transforms, any useful link will be highly appreciated. Though, I will explore these options on my own as well.
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.