Filebeat 7.4.0 does not recover when it fails to connect with k8s API

GreenKnight15 · October 19, 2019, 11:14pm

I am using the filebeat elastic helm chart https://github.com/elastic/helm-charts/tree/master/filebeat under an Istio service mesh.

As of filebeat 7.4.0 with the new k8s client Update kubernetes watcher to use official client-go libraries by vjsamuel · Pull Request #13051 · elastic/beats · GitHub filebeat starts faster than the Istio side car which blocks outbound requests to the k8s API. As a result filebeat never recovers the k8s connection and I lose all k8s meta data on my log packets.

│ 2019-10-17T20:33:29.733Z ERROR kubernetes/util.go:85 kubernetes: Querying for pod failed with error: Get https://10.100.0.1:443/api/v1/namespaces/bootstrap/pods/bootstrap-filebeat-x-pj8n9: dial tcp 10.100.0.1:443: connect: connection refused │
│ E1017 20:33:29.734464 1 reflector.go:125] github.com/elastic/beats/libbeat/common/kubernetes/watcher.go:235: Failed to list *v1.Pod: Get https://10.100.0.1:443/api/v1/pods?fieldSelector=spec.nodeName%3Dlocalhost&limit=500&resour │
│ ceVersion=0: dial tcp 10.100.0.1:443: connect: connection refused

I found a workaround by applying a sleep before starting filebeat, but I don't want it to be permanent.

A similar issue is describe in Kubernetes autodiscover provider fails silently if it can't connect to k8s · Issue #13081 · elastic/beats · GitHub

Will there be a retry or exponential back off added in upcoming versions?

pmercado · October 21, 2019, 10:44am

Hi @GreenKnight15,

let me try to reproduce / follow the code.
I'm not sure of if beats have a policy of retrying or failing, looks like it is mostly mandated by the library being used.

If you feel so, open an issue where that behaviour can be discussed while I guess out how it currently works.

pmercado · October 21, 2019, 11:25am

Hi again @GreenKnight15 ,

there are 2 potential features affected by this issue

autodiscover
add kubernetes metadata
a watcher that is setup for kubernetes metricsets to enrich events

I think we can do something there, but we need the to get the product designers involved, can you please create a GH issue?

thanks!

GreenKnight15 · October 21, 2019, 3:31pm

I wend ahead and opened a GH ticket

system · November 18, 2019, 5:31pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Filebeat errors while connecting to kubernetes API on start filebeat POD Beats filebeat	1	907	August 22, 2019
FIlebeat failed to connect to backoff(elasticsearch(http://elasticsearch:9200) Beats filebeat	1	1230	November 8, 2019
Connect: connection refused Beats docker , metricbeat	1	463	May 15, 2023
Beats can't reach Elastic Service Elastic Cloud on Kubernetes (ECK) docker	3	814	November 15, 2021
Filebeat 7.4.1 PODs of k8s daemonset constantly crash Beats docker , filebeat	7	1365	December 31, 2019

Filebeat 7.4.0 does not recover when it fails to connect with k8s API

Related Topics