Hello,
I am having an issue on a few boxes reporting to my elastic stack,
They look to be getting blocked/timing out with the below error in the metricbeat logs,
2018-08-07T12:33:49.810+0100 ERROR pipeline/output.go:74 Failed to connect: Get http://ELASTIC_DOMAIN:9200: dial tcp SERVER_IP:9200: getsockopt: connection timed out
I then ran a metricbeat test which returned the below,
elasticsearch: http://ELASTIC_DOMAIN:9200...
parse url... OK
connection...
parse host... OK
dns lookup... OK
addresses: SERVER_IP
dial up... ERROR dial tcp SERVER_IP:9200: getsockopt: connection timed out
Can any one advise what might be the issue? I have 243 boxes that connect fine and im unsure of the amount that are having this issue. running a GET on the elasticsearch domain returns fine in 25-75MS.
243 doesn't look like a big number for this but... could you be hitting some connection limit in your network or your hosts?
Could you try to connect from the affected hosts to the elasticsearch server or to other hosts using other tool like curl or telnet?
This looks like a connectivity problem. Are both machines in the same network? is it possible that some firewall is blocking connections from some servers but not others?
Hello @jsoriano,
Nothing looks to be getting blocked at Firewall level etc.
I think I will test building a replica elastic stack and change the host on a server with issues to point to the new one to see if that works.
I'll keep posting here with any updates.
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.