Rollup strategy in Elastic

John_Doe1 · November 13, 2017, 4:46pm

I am looking for a feasible way to rollup data I have stored in Elasticsearch. The records I have are time series based, and can be grouped by a timestamp, host, and path of a URL. What I had in mind was a cron job that looks at all the records 1 day old, not yet merged into a granularity. It would then bulk write the new merged records into the same index, and once completed run a delete by query where there is no granularity within the specified date range. I would eventually want to configure the cron job to run at the monthly/yearly granularity as well as single document granularity once it reaches a certain age.

What I am unsure about is the strategy needed to aggregate the data. For an input where there would be millions of records to aggregate, is this something I can handle with a single elastic query that fetches the aggregations, or will I have to use something such as Hadoop/MapReduce to read and aggregate the data? Would it be better to store granularities in separate indexes to make the rollup job easier?

system · December 11, 2017, 4:46pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Rollup Granularity and Aggregation Elasticsearch	1	490	March 12, 2019
Granulated data - elasticsearch. Is it possible to convert minutely data to hourly data and store it as a new index? Elasticsearch	2	529	August 31, 2017
Rollup strategies with automatic old-raw-data deletion Elasticsearch	5	1793	August 6, 2018
Rollup data in ES Elasticsearch	3	1628	July 6, 2017
How to generate multiple time granular indexes? Elasticsearch	4	972	March 16, 2020

Rollup strategy in Elastic

Related topics