Heartbeat data roll-up

sderungs · April 16, 2020, 6:21am

Hi

I was wondering if there are any good practices around roll-up for data in Elasticsearch coming from Heartbeat or if anyone has experience with it.

Some background:
We're collecting uptime data from multiple systems and obviously the amount of data can get quite big (with monitors being set to collect data every 15 seconds). However, our needs for data granularity/resolution decreases in time, i.e. for the past 7 days a granularity of 15 seconds is good, but same is not true e.g. for data 1 month back (there e.g. buckets of 5 minute averages would be enough). For data even further back (e.g. 3 months in the past and older) buckets with 60-minute-averages would be enough.

The ideas so far:

Heartbeat-Index contains contains raw data for the past 2 weeks
A roll-up job aggregates data into 5-minute-buckets
ILM takes care of deleting raw data older than 2 weeks
Another roll-up job aggregates the roll-up index from point 2 above into another index with 60-minute-buckets
ILM takes care of deleting rolled-up data from point 2

Questions:

Are 4. and 5. even possible?
Does the Uptime app in Kibana support roll-up indices?
How do you maintain your Heartbeat data? I'm struggling to get a reasonable setup...

Cheers,
Stefan

P.S. The entire Elastic environment is running on version 7.6.

system · May 14, 2020, 6:21am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Andrew_Cholakian1 · May 27, 2020, 4:16pm

Sorry we missed this, I've moved this to the Heartbeat forum which is more appropriate.

This is a great idea for a feature. There's nothing stopping you from doing rollups today, but the Uptime UI is not built to support rolled up data. You'd need to use custom dashboards with the rolled up data. We currently depend on the schema heartbeat sends.

That said, it's something that we'll probably want to add in the future. That said, it's not a frequent request. It'd help to have some

To answer your questions:

I believe those are both possible.
No it does not.
It'd be great to know some of the parameters for your setup. We find that different people have different expectations for data usage, data fidelity, retention etc. If you could provide more detail here that'd be hugely useful for us to know as we choose which features to prioritize.

WRT what a good rollup job would look like. Off the top of my head, what I think you'd want to do is rollup the summary.up and summary.down fields and aggregate by monitor.id. Those count how many individual monitors were up/down for a given check. You could simply sum those within a bucket for a given monitor. You might also want an avg for monitor.duration.us to get the overall timing. Is that enough to get going on?

Topic		Replies	Views
Rollups and ILM Elasticsearch	3	1321	February 20, 2020
Aggregate data after month to get smaller index Beats metricbeat	3	397	December 7, 2018
Rolling up Data Streams Elasticsearch rollups , datastreams	8	1039	January 8, 2022
Inaccuracy and noises in elastic search rollup data Elasticsearch rollups	4	316	April 25, 2023
Uptime Tab says "No Data Available" when heartbeat-today index keeps incrementing Synthetics	6	2029	June 9, 2019

Heartbeat data roll-up

Related topics