Index Rollup delay effect

mcguacon · May 9, 2019, 2:18pm

I previously experimented with a rollup job that ran every 30 minutes, with an interval of 1 hour, and a delay of 12 hours.

I wanted to experiment with rolling up older data. I have a new job that runs every 30 minutes, with an interval of 1 hour, and a delay of 7 days. After approx. 18 hours, I see no data has been rolled up. Do I need to wait a week for data to appear? Or am I missing something else? The previous job worked as expected.

polyfractal · May 17, 2019, 3:30pm

Hey @mcguacon, apologies for the delay (on a business trip right now with minimal connectivity).

This is correct. The indexer wakes up on the cron schedule, looks at the most recent timestamp in the index and decides if it it needs to process data based on the last position and the delay value. In this case, the delay means no documents will be generated until 7 days have passed.

The indexer isn't doing any work or buffering things up in memory, it's just going back to sleep until the delay has passed.

Hope that helps!

mcguacon · May 17, 2019, 5:59pm

That does help, thank you!
So just to clarify, if I make a job now with a 7d delay, it wont retroactively rollup all the data in my index that's 7 days old right now?

polyfractal · May 22, 2019, 1:28pm

Hey, sorry for the delay (heh), was traveling for work.

It will actually So the way the indexer works is like this:

Indexer starts. Is there persisted checkpoint for this job? If no, start from the beginning of the index, otherwise pickup from the most recent checkpoint
Is there data that is older than now - delay? If yes, start rolling up that data. If no, go back to sleep

So for example, if your index has three weeks of data, the first two weeks and one day will be rolled up, then the remaining 6 days will block until another day's worth of data is ingested (assuming the index ends with a "now'ish" timestamp).

If instead you had three weeks of data, but all the data was a year old, all the data would be rolled up because the most recent timestamp in the index is older than now - delay

Hope that helps!

mcguacon · May 22, 2019, 1:50pm

Okay, I am understanding it correctly then. So, I am confused with my results then. This job was started about a week ago.

I have had success in the past with other rollup jobs, but other ones have ended up like this, essentially doing nothing. My anecdotal evidence seems to be if I configure the job via the wizard, it doesn't work. Just using the API in the dev tools, I believe they work fine. We are on 6.5.

polyfractal · May 22, 2019, 2:19pm

Hmm. That is indeed suspicious.

Can you paste the rollup configuration that the UI generated (GET _xpack/rollup/job/<job_id>)? I'm wondering if maybe the cron is being misconfigured by the UI, and so it isn't triggering very often?

Is there anything about rollups in the server logs?

mcguacon · May 22, 2019, 5:15pm

result of that API call

    {
  "jobs" : [
    {
      "config" : {
        "id" : "process_rollup",
        "index_pattern" : "metricbeat-process-*",
        "rollup_index" : "process_rollup",
        "cron" : "0 0 22 * * ?",
        "groups" : {
          "date_histogram" : {
            "interval" : "60m",
            "field" : "@timestamp",
            "delay" : "3d",
            "time_zone" : "UTC"
          },
          "terms" : {
            "fields" : [
              "sentry.server",
              "system.process.name"
            ]
          }
        },
        "metrics" : [
          {
            "field" : "system.process.cpu.total.norm.pct",
            "metrics" : [
              "avg",
              "max",
              "min"
            ]
          },
          {
            "field" : "system.process.memory.size",
            "metrics" : [
              "avg",
              "min",
              "max"
            ]
          }
        ],
        "timeout" : "20s",
        "page_size" : 1000
      },
      "status" : {
        "job_state" : "started",
        "upgraded_doc_id" : true
      },
      "stats" : {
        "pages_processed" : 3,
        "documents_processed" : 0,
        "rollups_indexed" : 0,
        "trigger_count" : 3
      }
    }
  ]
}

polyfractal · May 23, 2019, 2:41pm

Hmm. Config looks ok. Do you see anything in the server logs?

Sometimes the job can run into issues (incorrectly mapped field, like trying to average a string, etc) and will log the exception.

How long has this job been "running"? trigger_count: 3 means the cron has only fired three times, so if the job has been running for longer than three days there could be something wrong with the task itself.

mcguacon · May 23, 2019, 4:24pm

Any kind of key words or phrases to search for? My kibana.log is about 1.5G

polyfractal · May 23, 2019, 5:45pm

Ah sorry, it would be in the elasticsearch server log, not the kibana.log

As for keywords, anything mentioning "rollup" or "AsyncTwoPhaseIndexer"

system · June 20, 2019, 5:45pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Rollups: What happens when docs arrive a few days late? Elasticsearch	3	814	October 1, 2018
Rollup a rolled up index Elasticsearch	1	343	August 13, 2019
Rollup Job not working on ES 6.6.1 Elasticsearch	3	349	August 5, 2019
Rollup Job - New data not Rolled Up Kibana rollups	4	404	May 17, 2023
How does rollup work? Kibana rollups	3	427	June 18, 2021

Index Rollup delay effect

Related topics