Connection timed out problem

bob96589 · September 17, 2020, 2:53am

Kibana version: 7.4.1

Elasticsearch version: 7.4.1

APM Server version: 7.6.1

APM Agent language and version: java
https://apm-ci.elastic.co/blue/organizations/jenkins/apm-agent-java%2Fapm-agent-java-mbp/detail/PR-1326/1/artifacts/

elasticapm.properties

environment=twtpehswj2ui01
application_packages=com.delta
server_urls=http://10.148.208.47:8090
log_level=TRACE
log_file=AGENT_HOME/../logs/elastic-apm.log
log_file_size=10mb
server_timeout=0

Description of the problem including expected versus actual behavior. Please include screenshots (if relevant):

Sometime it receives apm data correctly, but there are time periods that can not receive the data (there are empty spaces in the chart below). And then I check the log (attached below). It says "Connection timed out".

But I try to hit http://10.148.208.47:8090/intake/v2/events by postman and it response 202. So I think connection is not a problem.

Any suggestion how to fix the problem?
Thanks.

Apm view in Kibana

Error log

2020-09-17 08:57:58,900 [elastic-apm-server-reporter] ERROR co.elastic.apm.agent.report.IntakeV2ReportingEventHandler - Failed to handle event of type TRANSACTION with this error: Connection timed out (Connection timed out)
2020-09-17 09:12:32,628 [elastic-apm-server-reporter] ERROR co.elastic.apm.agent.report.IntakeV2ReportingEventHandler - Failed to handle event of type METRICS with this error: Connection timed out (Connection timed out)
2020-09-17 09:14:39,860 [elastic-apm-server-reporter] ERROR co.elastic.apm.agent.report.IntakeV2ReportingEventHandler - Failed to handle event of type METRICS with this error: Connection timed out (Connection timed out)
2020-09-17 09:16:48,116 [elastic-apm-server-reporter] ERROR co.elastic.apm.agent.report.IntakeV2ReportingEventHandler - Failed to handle event of type METRICS with this error: Connection timed out (Connection timed out)
2020-09-17 09:18:59,571 [elastic-apm-server-reporter] ERROR co.elastic.apm.agent.report.IntakeV2ReportingEventHandler - Failed to handle event of type METRICS with this error: Connection timed out (Connection timed out)

Entire log

gist.github.com

https://gist.github.com/bob96589/3b4d017218e48041da98f242fa6ec65c

elastic-apm.log

2020-09-17 08:55:25,413 [main] DEBUG co.elastic.apm.agent.impl.payload.SystemInfo - container ID is 6fae965720316bddcb3ad8507f074e1ed811c98ba10af371877b20c088b8dcf5
2020-09-17 08:55:25,453 [main] INFO  co.elastic.apm.agent.util.JmxUtils - Found JVM-specific OperatingSystemMXBean interface: com.sun.management.OperatingSystemMXBean
2020-09-17 08:55:25,565 [main] INFO  co.elastic.apm.agent.configuration.StartupInfo - Starting Elastic APM 1.18.1.RC1-SNAPSHOT as tomcat-application on Java 1.8.0_212 Runtime version: 1.8.0_212-8u212-b01-1~deb9u1-b01 VM version: 25.212-b01 (Oracle Corporation) Linux 3.10.0-1062.el7.x86_64
2020-09-17 08:55:25,565 [main] DEBUG co.elastic.apm.agent.configuration.StartupInfo - environment: 'twtpehswj2ui01' (source: /usr/local/tomcat/lib/elasticapm.properties)
2020-09-17 08:55:25,565 [main] DEBUG co.elastic.apm.agent.configuration.StartupInfo - server_urls: 'http://10.148.208.47:8090' (source: /usr/local/tomcat/lib/elasticapm.properties)
2020-09-17 08:55:25,565 [main] DEBUG co.elastic.apm.agent.configuration.StartupInfo - server_timeout: '0s' (source: /usr/local/tomcat/lib/elasticapm.properties)
2020-09-17 08:55:25,566 [main] WARN  co.elastic.apm.agent.configuration.StartupInfo - DEPRECATION WARNING: server_timeout: '0' (source: /usr/local/tomcat/lib/elasticapm.properties) is not using a time unit. Please use one of 'ms', 's' or 'm'.
2020-09-17 08:55:25,566 [main] DEBUG co.elastic.apm.agent.configuration.StartupInfo - application_packages: 'com.delta' (source: /usr/local/tomcat/lib/elasticapm.properties)
2020-09-17 08:55:25,566 [main] DEBUG co.elastic.apm.agent.configuration.StartupInfo - log_level: 'TRACE' (source: /usr/local/tomcat/lib/elasticapm.properties)
2020-09-17 08:55:25,566 [main] DEBUG co.elastic.apm.agent.configuration.StartupInfo - log_file: '_AGENT_HOME_/../logs/elastic-apm.log' (source: /usr/local/tomcat/lib/elasticapm.properties)

This file has been truncated. show original

Sylvain_Juge · September 17, 2020, 7:08am

From this line in the logs, it seems that you have a very optimistic timeout for server connection 0s, which probably explains why do you get so much Connection timed out errors. Can you try with a value higher than zero like 1s or with the default one (5s) ? Also, please note that this value should have a unit and is not just a number.

bob96589 · September 18, 2020, 12:56am

Hi,

Previously, I've tried to set server_timeout to 5s and 60s. And the problem still exists.

And then I found a post.

If a request to the APM server takes longer than the configured timeout, the request is cancelled and the event (exception or transaction) is discarded. Set to 0 to disable timeouts.

That's why I tried to set server_timeout to zero and without unit. I want to disable the timeout functionality. But it seems that this can not solve the problem.

Sylvain_Juge · September 21, 2020, 7:28am

Hi @bob96589,

Could you check in your server logs during the time frame where no data appears to be sent ?

If there is no visible activity during those time frames, it means the agent might not have been able to reach the server at all, which would indicate more a network issue rather than an issue with the agent. Increasing log level server-side might be required.

I assume that you only have a single apm-server instance, and thus my hypotheses are the following:

if you have a single agent, if there is nothing in server logs after increasing log level, that means there is an issue on the network
if you have more than one agent, if there is nothing in server logs, the issue is still on the network, but more on the server side (as no other agent seem able to reach it)
if you have more than one agent and some of them are able to reach the server, that means the issue might be on the network on agent side, or that there is a bug in the agent.

system · October 12, 2020, 3:28am

This topic was automatically closed 20 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Peridiocally Java APM Agent experiences errors with connection to APM server APM java	7	834	August 10, 2023
Elastic APM - Error sending data to APM server: Read timed out, response code is -1 APM java , server	1	43	February 28, 2025
Failed to submit message: 'Connection to APM Server timed out (url: http://apm-server:8200/intake/v2/events, timeout: 10000.0 seconds) APM python , server	2	1223	February 17, 2023
Elastic apm agent Read timed out issue APM java	5	2537	April 8, 2021
APM agent showing timeout Error (Connection Refused) APM	19	10149	April 17, 2019

Connection timed out problem

Related topics