Network connections not being accepted in all cases

dendle · June 29, 2016, 2:20pm

Hello Folks,

I have an elasticsearch cluster (v1.7.5) on Ubuntu (4 data, 3 masters).

Normal operation looks fine - nice response time.

This is the case when used behind a load balancer, or when I stipulate all node IPs in my cluster and the client (my application) connects directly to each node.

However, intermittently, elasticsearch does not accept new connections. By this I mean, the TCP 3 way handshake is not completed; the initial SYN frame arrives at the host, on the correct port, but no SYN, ACK frame is sent back.

I have marvel, and looked at the request queues (LISTENER THREAD POOL REJECTED), and they seem fine (0 rejections).

I cannot see anything in the logs that ever said anything to do with network issues, and I was wondering if
a) Anyone has similar issues
b) Anyone can point me in a direction for debugging further

Thanks for your assistance in advance!

Kind Regards,
Matt

warkolm · July 1, 2016, 4:11am

It sounds like a LB issue, is there any sort of connection reset/timeout values that you can change?

dendle · July 1, 2016, 12:25pm

Hi Mark,

I've ruled out the load balancer itself, by configuring the NEST client with the data node IPs directly.

In this configuration, the issue still occurs.

Supporting services were able to trace the un-answered SYN packet up until it gets swalled by dev/null or similar.

dendle · August 3, 2016, 4:17pm

Has no-one had this type of issue before?

Perhaps someone could tell me if elasticsearch - under extreme loads - ever ignores a connection attempt?

And by connection attempt, I mean the Tcp three way handshake - first leg (SYN).

Or does it answer with Service Unavailable? (I have seen it do this before)

Im sorry to be so vague, but I dont know if this is some kind of failure condition, or a network issue with my host (Microsoft Azure).

Any ideas would be greatly appreciated!

Regards,
Matt

Topic		Replies	Views
Dropped HTTP Connections when Indexing Elasticsearch	2	375	July 6, 2017
Elasticsearch cluster instability Elasticsearch	13	2820	July 6, 2017
Cluster connection issues when the machines hosting the nodes are restarted for service maintanance Elasticsearch	7	1011	July 6, 2017
Elasticsearch nodes behaving strangely, timeouts, discovery, etc. (solution) Elasticsearch	1	350	July 6, 2017
Connection refused after installing new version of elasticsearch Elasticsearch	5	1058	July 5, 2017

Network connections not being accepted in all cases

Related topics