Induced CPU load by APM nodejs agent if elastic server saturates

Javier_Aracil · May 11, 2021, 4:51pm

Hi All,

We are instrumenting a nodejs backend with Elastic APM agent. At some point, the elastic server cannot cope with the incoming messages from the agent. I wonder what happens in that case. I understand that such incoming messages will be dropped, and the backend CPU load will not be affected because of this.

Generally speaking, what CPU load can the agent induce in the backend?

Thanks very much,

Javier

trentm · May 11, 2021, 5:47pm

Hello Javier,

Thanks for the question. I'm assuming by "elastic server" here you mean the APM server (i.e. Configuration options | APM Node.js Agent Reference [4.x] | Elastic) to which the APM agent sends event data.

The first important thing to note is that how the Node.js APM agent copes with an overloaded or non-responding APM server was significantly changed in v3.14.0 of the agent (Node.js Agent version 3.x | APM Node.js Agent Reference [4.x] | Elastic).

That is correct (at least with v3.14.0). The APM agent will buffer up to 1024 (Configuration options | APM Node.js Agent Reference [4.x] | Elastic) events and start dropping events if the APM server won't receive them fast enough.

That is a hard question to quantify in general. Often it depends a lot on the application being instrumented. Some disorganized thoughts:

The async_hooks mechanism that the agent uses does definitely have some overhead on the application. From recent anecdotal observations, if the application is very heavy on async operations (lots of timers, nextTick, setImmediate, and especially Promise usage), that overhead can be higher.
If you are experiencing too-high CPU overhead from the agent, my first suggestions would be to consider setting transactionSampleRate (Configuration options | APM Node.js Agent Reference [master] | Elastic) to a value less than 1, and captureSpanStackTraces=false (Configuration options | APM Node.js Agent Reference [master] | Elastic). If the application produces lots of spans, the latter can help a lot. Coming versions of the agent will have some improvements on stacktrace collection overhead and will probably change default settings to not capture stacktraces for all spans (likely just for slower ones).
Otherwise working through Performance Tuning | APM Node.js Agent Reference [master] | Elastic can help.

I hope that helps answer your question.

Cheers,
Trent

Javier_Aracil · May 11, 2021, 5:53pm

Thanks Trent, yes, APM server. We will proceed as suggested, best

Javier

system · June 1, 2021, 1:53pm

This topic was automatically closed 20 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Heavy CPU usage in APM Agents when APM-servers goes down APM	7	1087	February 5, 2019
Does Elastic APM for node js uses different thread for sending events or the same node thread? APM	4	1015	February 1, 2018
Very high CPU usage on some projects APM php	3	583	March 30, 2023
High CPU usage with Python agent APM	4	686	June 18, 2018
Intake very slow (more than 10secondes) APM server	5	945	April 29, 2021

Induced CPU load by APM nodejs agent if elastic server saturates

Related topics