Hi there!
Weeks ago I opened a discussion about a similar problem but then I found a workaround.
Now I can't find one (I do not want to use external script when there's a specific ES function for aggregating data).
Now, I have some data split in 24 indices per day (e.g. my_index-20190700, my_index-20190701...) each one for a specific hour of the day and each collecting something like 700k docs (3.5 Gb). Those indices have a retention period of 24 hours.
What I'm trying to do is to schedule a Rollup Job to run at 00.00 each day and collect the daily data, so data from indices my_index-*.
I created a Rollup Job like the following:
{
"config": {
"id": "daily_rollup_job",
"index_pattern": "my_index-*",
"rollup_index": "daily_index_rollup",
"cron": "0 0 0 * * ?",
"groups": {
"date_histogram": {
"interval": "24h",
"field": "my_interesting_date_field",
"time_zone": "UTC"
},
"terms": {
"fields": [
"my_interesting_date_field.keyword",
"another_interesting_field1.keyword",
"another_interesting_field2.keyword",
"another_interesting_field3.keyword"
]
}
},
"metrics": [],
"timeout": "20s",
"page_size": 10000
}
It did run for a couple of days but then it stopped.
Last time it run was on July 1st.
What is the problem with these Rollup Jobs? Why do they stop running with no apparent reason?
Do I have to increase the Delay for such an amount of data? Which is the exact use of the Delay parameter?
Thank you in advance!