Howdy! We're using the Elastic Agent's synthetic agent running in docker to monitor some web properties. There's no visible issues or errors, but after some time the agent "dies" for lack of a better term. After some investigation, it looks like the container is running out of disk space because the /usr/share/elastic-agent/data/elastic-agent-1d05ba/install/heartbeat-7.17.1-linux-x86_64 is filling up with core.XXX files which I can only assume are core dumps.
Is there some way we can diagnose this, or something we might be doing wrong?
Reading over that issue, that's exactly what we're running into. Came to the same conclusion that it was a weird SIGABRT in Chrome, and ended up disabling core dumps in the container to resolve it temporarily.
Which method did you use to disable core dumps in the container? Last I checked our container was set to disable them, but I'm wondering if you're using a different setting, or if you had overriden that.
Additionally, I'm considering opening an issue around doing a better job enforcing the ulimit for core dumps, but it'd be great if we can repro this first.
I'll be honest, I'm not sure what the default might be.
For context, we're running a relatively vanilla AWS Elastic Container Service cluster, with the container compute being provided by Fargate. So, I'm not super familiar with what the defaults are aside from the odd issue. You could probably reproduce with a bare bone cluster and the lowest tier task compute, and just run a synthetic at a fast interval.
Glad to see it's being upgraded! I'm a big playwright fan and reached for some recent APIs when writing the tests in questions and ran into sharp edges as a result (especially since I ran into another now solved(?) issue where errors weren't bubbling up in an easily visible way).
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.