I am running elastic-operator 1.0.0-beta1 and hitting following issue.
image: docker.elastic.co/eck/eck-operator:1.0.0-beta1
❯ kubectl get pods -n elasticsearch
NAME READY STATUS RESTARTS AGE
ct-es-es-data-nodes-0 1/1 Running 0 19d
ct-es-es-data-nodes-1 1/1 Running 0 19d
ct-es-es-data-nodes-2 1/1 Running 0 19d
ct-kibana-kb-c89445c75-cvvjf 1/1 Running 1 19d
elastic-operator-0 0/1 CrashLoopBackOff 5565 19d
I think possibly there was an EKS upgrade performed 19 days ago which restarted all the aws nodes in the cluster, but I observe that the operator pod goes into this state throwing these logs.
❯ kubectl logs elastic-operator-0 -n elasticsearch
{"level":"info","@timestamp":"2020-04-15T17:55:27.084Z","logger":"manager","message":"Setting up client for manager","ver":"1.0.0-beta1-84792e30"}
{"level":"info","@timestamp":"2020-04-15T17:55:27.084Z","logger":"manager","message":"Setting up scheme","ver":"1.0.0-beta1-84792e30"}
{"level":"info","@timestamp":"2020-04-15T17:55:27.085Z","logger":"manager","message":"Setting up manager","ver":"1.0.0-beta1-84792e30"}
{"level":"info","@timestamp":"2020-04-15T17:55:27.590Z","logger":"controller-runtime.metrics","message":"metrics server is starting to listen","ver":"1.0.0-beta1-84792e30","addr":":0"}
{"level":"error","@timestamp":"2020-04-15T17:55:27.592Z","logger":"manager","message":"unable to get operator info","ver":"1.0.0-beta1-84792e30","error":"configmaps \"elastic-operator-uuid\" is forbidden: User \"system:serviceaccount:elasticsearch:elastic-operator\" cannot get resource \"configmaps\" in API group \"\" in the namespace \"elasticsearch\"","stacktrace":"github.com/go-logr/zapr.(*zapLogger).Error\n\t/go/pkg/mod/github.com/go-logr/zapr@v0.1.0/zapr.go:128\ngithub.com/elastic/cloud-on-k8s/cmd/manager.execute\n\t/go/src/github.com/elastic/cloud-on-k8s/cmd/manager/main.go:254\ngithub.com/elastic/cloud-on-k8s/cmd/manager.glob..func1\n\t/go/src/github.com/elastic/cloud-on-k8s/cmd/manager/main.go:74\ngithub.com/spf13/cobra.(*Command).execute\n\t/go/pkg/mod/github.com/spf13/cobra@v0.0.5/command.go:830\ngithub.com/spf13/cobra.(*Command).ExecuteC\n\t/go/pkg/mod/github.com/spf13/cobra@v0.0.5/command.go:914\ngithub.com/spf13/cobra.(*Command).Execute\n\t/go/pkg/mod/github.com/spf13/cobra@v0.0.5/command.go:864\nmain.main\n\t/go/src/github.com/elastic/cloud-on-k8s/cmd/main.go:27\nruntime.main\n\t/usr/local/go/src/runtime/proc.go:203"}
Can someone help what is the reason and how to recover the operator pod without impacting existing ES cluster ?