- Search Guard and Elasticsearch version - 5.6.9
- 32GB of max heap configured
- Installed and used enterprise modules, if any - No
- JVM version and operating system version - 1.8, CentOS
- Other installed Elasticsearch or Kibana plugins, if any - None
Configuration: Loadbalancer node and 3 data nodes.
Loadbalancer: ES node for Querying and for data ingest.
Nginx between Client and ES cluster
50 clients connecting to a single Elasticsearch coordinating node every 10seconds
Problem Statement:
Client connections fail with the below exception
message: Error running query: ConnectionTimeout caused by - ConnectTimeout(HTTPSConnectionPool(host='xxxx', port=9200): Max retries exceeded with url: /metrics-*/_search?_source_include=Timestamp%2C%2A&ignore_unavailable=true&scroll=30s&size=10000 (Caused by ConnectTimeoutError(<requests.packages.urllib3.connection.VerifiedHTTPSConnection object at 0x2044e90>, 'Connection to xx.yy timed out. (connect timeout=20)')))
num_hits: 19
num_matches: 3
traceback: [
"Traceback (most recent call last):",
" File "/var/lib/elastalert/elastalert-0.1.29/elastalert/elastalert.py", line 390, in get_hits",
" **extra_args",
" File "/usr/lib/python2.7/site-packages/elasticsearch/client/utils.py", line 73, in _wrapped",
" return func(args, params=params, **kwargs)",
" File "/usr/lib/python2.7/site-packages/elasticsearch/client/init.py", line 623, in search",
" doc_type, '_search'), params=params, body=body)",
" File "/usr/lib/python2.7/site-packages/elasticsearch/transport.py", line 312, in perform_request",
" status, headers, data = connection.perform_request(method, url, params, body, ignore=ignore, timeout=timeout)",
" File "/usr/lib/python2.7/site-packages/elasticsearch/connection/http_requests.py", line 84, in perform_request",
" raise ConnectionTimeout('TIMEOUT', str(e), e)",
"ConnectionTimeout: ConnectionTimeout caused by - ConnectTimeout(HTTPSConnectionPool(host='xx.yy', port=9200): Max retries exceeded with url: /metrics-/_search?_source_include=Timestamp%2C%2A&ignore_unavailable=true&scroll=30s&size=10000 (Caused by ConnectTimeoutError(<requests.packages.urllib3.connection.VerifiedHTTPSConnection object at 0x2044e90>, 'Connection to xx.yy timed out. (connect timeout=20)')))"
]
Snapshot of JVM after searchguard install and connection timeout which surfaced.