Proactive ELK stack monitoring

Zachary_Buckholz · August 20, 2015, 12:14pm

I am looking for ideas on how to send an automated UUID through the ELK stack and make sure it arrives in the elasticsearch cluster.

Basically some sort of elasticsearch process that generates the UUID inserts a record for it, then another external process possibly from Rundeck grabs that record and flips a switch saying it's active. That Rundeck job then logs into a server and uses the Linux 'logger' command to generate a syslog message with that UUID.

Once the switch is flipped elasticsearch starts some sort of timer, if that timer expires before that UUID makes its way back through the ELK stack an alert is generated.

Sort of like a bullet or echo ping being sent waiting for a response.

This tests multiple points of failure at one time.

Suggestions? Does anyone know of something similar to this already out there?

Thanks
Zach

magnusbaeck · August 20, 2015, 12:24pm

I'm going to use Lovebeat to detect significant changes in the frequency of messages flowing through the system. You can use the heartbeat input plugin to generate heartbeats or a cron job if you don't want to dependent on Logstash itself, then use the statsd or graphite output to send the beats to Lovebeat. Such a setup will indirectly check whether ES is accepting the messages since problems with the elasticsearch output will halt the whole pipeline.

This method can also be used to monitor the actual messages sources, i.e. you can easily detect if Logstash on host X has been hosed and its messages no longer come through.

Zachary_Buckholz · August 20, 2015, 2:48pm

Thanks for the pointer to Lovebeat. I have been looking at it and it seems very promising for my use case. Have you used Lovebeat before? It seems like the only way to add Services is via the NewServices call from the web UI.

Do you have a way to create services via a REST call? The javascript used in the web UI shows how it would be done, but I am not sure if the Lovebeat backend would accept a remote request to create a new service. Do you know if this is possible?

Thanks
Zach

magnusbaeck · August 20, 2015, 2:55pm

I haven't used Lovebeat for real work yet, but it's on my list for next week. I've just dabbled with it and written a couple of source code patches.

You don't have to pre-create services. You can just start posting heartbeats (via the HTTP, statsd, or Graphite protocols). See the examples towards the end of the readme file.

Zachary_Buckholz · August 20, 2015, 3:18pm

Yes I just tried it and sure enough it was magic. Thanks again for the pointers.

otisg · August 20, 2015, 10:07pm

Hi,

Hm, not sure if I got it, but are you trying to test that ES is up and running? If so, I know our SPM, and I assume all other modern monitoring tools can do that for you.... in SPM we call them Heartbeat Alerts - basically alerts that notify you if SPM doesn't hear from your ES cluster for some time. I assume you could do this even with Nagios.

Otis

Monitoring * Alerting * Anomaly Detection * Centralized Log Management
Solr & Elasticsearch Support * http://sematext.com/

Topic		Replies	Views
Monitor inactivity in log files Logstash	15	3563	July 6, 2017
Monitoring Log Activity into Logstash Logstash	2	898	July 6, 2017
How to send heartbeats to Elasticsearch in a secured network? Logstash ilm-index-lifecycle-management	14	419	March 22, 2024
Ping remote services for availability and log results to Elasticsearch or send to Logstash Beats heartbeat	3	1560	October 19, 2020
ELK web APIs Logstash	5	929	December 13, 2017

Proactive ELK stack monitoring

Otis

Related topics