I noticed some of pods are getting goes down and new one is created.
What is the most suitable way to detect and set an alert for any K8s deployment that one or more of pods get frequently go down and new one created.
For example, I like to detect the resource that the x number of down and up within y time duration using Elastic observability (Metric and Log).
I use ELK 7.7 with MetricBeat for K8s module and kube-state-metrics
In our upcoming 7.9 release we'll have support for alerting on X down checks over Y time duration on a per monitor basis. I think that will cover this case!
I am sorry I don't understand your question on "uptime app" or "metric app".
For example, I deploy application ABC with 5 replica onto Kubernetes.
I would like to detect any replica of ABC Pod goes down and starts so frequently (such as 3 restarts within last 2 minutes)
I currently use ELK 7.7 with MetricBeat for K8s module and kube-state-metrics.
Thanks,
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.