Endpoint Agent clock problem in sleep mode

simoner · November 22, 2023, 1:44pm

Hello,
I found a problem with Endpoint agent and sleep mode.
When my computer wakes itself up from sleep mode Endpoint Agent logs:

{"file":{"line":140,"name":"Entry.cpp"}}},"message":"Entry.cpp:140 The system clock advanced more than 5 seconds in an expected 1 second window (previous time=2023-11-22T12:00:05.0Z, now=2023-11-22T13:06:11.0Z)","process":{"pid":4416,"thread":{"id":4617}}}

and every endpoint telemetries from now on will have a big discrepancy between event.ingested and event.created:

[...]
"event": {
      "agent_id_status": "verified",
      "ingested": "2023-11-22T13:07:20Z",
      "created": "2023-11-22T11:59:12.6194252Z",
      "module": "endpoint",
[...]

it seems that the endpoint does not detect the clock change, which the operating system does:

<syslog>
Nov 22 14:06:10 rakel systemd-resolved[4355]: Clock change detected. Flushing caches.

My machine is an Asus ZenBook
My OS is Ubuntu 22.04.3 LTS
Elastic version and agent 8.10.4

Norrie · November 23, 2023, 12:35am

Hello @simoner!

Thanks for reaching out and providing the logs and the event showing the discrepancy.

Endpoint calculates the created time by querying the OS for the system time at the event creation time. The ingested time is set when the event is ingested into Elasticsearch.

Between event creation and ingestion, Endpoint can buffer the event locally. If the host has gone to sleep, locally buffered or queued events are expected to have a large discrepancy between these two timestamps.

You can test this by observing if the issue is resolved for new events generated after the host has awakened and had time for the queues to flush.

{"file":{"line":140,"name":"Entry.cpp"}}},"message":"Entry.cpp:140 The system clock advanced more than 5 seconds in an expected 1 second window (previous time=2023-11-22T12:00:05.0Z, now=2023-11-22T13:06:11.0Z)","process":{"pid":4416,"thread":{"id":4617}}}

For further context, the log that you have observed here is Endpoint detecting that sleep mode has occurred by comparing the system time against a saved value on a periodic interval.

simoner · November 23, 2023, 9:57am

Hello Norrie,
thanks for your answer.
Regarding your observation:

You can test this by observing if the issue is resolved for new events generated after the host has awakened and had time for the queues to flush.

the problem is right here:
new events (after the host has awakened) continue to have a large discrepancy, to be more precise, if we define t0 (time entering sleep mode) and t1 (awakening time), all events generated after awakening (and forever until I restart machine) have an event.ingested t1+x and event.created t0 +x.

I left my computer running for several hours after waking up and the problem persists, so I don't think I can depend on the cache.

Norrie · November 23, 2023, 7:56pm

Thanks for going through the effort of running the test and validating that fishing the buffer did not resolve the issue.

To take this further, could you please provide a diagnostic dump of the running Endpoint after the issue has occurred?

Verify and share Endpoint's version (sudo /opt/Elastic/Endpoint/elastic-endpoint version)
Collect the diagnostics (sudo /opt/Elastic/Endpoint/elastic-endpoint diagnostics)
Share the resulting zip file via upload.

I will DM you an upload link to share the diagnostic package.

Norrie · November 30, 2023, 11:14pm

Hi Simone,

We have received your logs and are conducting an investigation.
In the meantime, we think you could work around this issue by using the kprobe data capture mode. This can be found in the advanced settings of the Defend integration.

To do this wou will need to set linux.advanced.kernel.capture_mode to kprobe.

system · December 28, 2023, 11:14pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Transactions per hour elastic agent Elasticsearch elastic-agent	3	462	October 15, 2020
Endpoint agent consistent 90+% CPU for some PCs Endpoint Security	16	11862	March 17, 2021
Elastic Endpoint Security with Elastic Agent Endpoint Security	16	3040	November 10, 2020
Issues with Elastic Agent Beats elastic-agent	3	400	December 23, 2020
No Host events Endpoint Security Endpoint Security	2	459	November 7, 2022

Endpoint Agent clock problem in sleep mode

Related topics