Is it possible to limit http-poller to a single node in a Logstash cluster?

jba · April 16, 2023, 12:51pm

I have just discovered the http-poller plugin and it seems like it could replace some custom scripts that we have for getting monitoring data (e.g. index growth) into Elasticsearch. But it is not clear to me if I can restrict the poller to only run on 1 node in a Logstash cluster (without having to make a special configuration for the chosen node - which goes against the whole idea of clustering).

Maybe it is not important to only do the polling from one node, maybe it is not important to avoid having copies of the polled data in the index? But if there was a way to avoid 3 nodes from polling almost the exact same data, I would be interested in knowing how.

leandrojmp · April 16, 2023, 3:53pm

Can you provide more context on what you are trying to do and what is your issue? It is not clear since you didn't share any configuration or any logs.

Logstash does not run as a cluster, each logstash instance is independent from each other, so it is not clear what you mean with Logstash cluster.

jba · April 17, 2023, 2:26pm

By cluster I mean that we have 3 identically configured Logstash-nodes accepting input from a large number of application-nodes, and sending data to a Elasticsearch cluster. The 3 Logstash nodes might not technically be a cluster, but we take great care never to stop all of them at the same time so that our applications will always be able to ship/beat their logs to the Elasticsearch cluster.

The 3 Logstash nodes are monitored by a product called CheckMK that runs a little shell/curl script that retrieves performance data about the pipeline with something like this:

pipelines=$(curl --silent --fail ${api_user:+--user ${api_user}:${api_pass}} "${base_url}/_node/stats/pipelines")
queuesize=$(echo $pipelines | jq -r '.pipelines[].queue.capacity.queue_size_in_bytes')

And if the queue size grows above 50 / 90 % of the max queue size, CheckMK will raise an alarm.

My intention was to have Logstash do the HTTP polling and write the result to an index which Grafana (another monitoring tool) would look at and raise the alarm.

Now that I think about, I actually need all 3 to report their local queue size, but I imagine that there are other situations where I do not want to have 3 nodes all polling the same HTTP source and reporting almost identical results (almost because a small delay might produce a slightly different result from the HTTP source).

leandrojmp · April 17, 2023, 2:42pm

There is nothing in Logstash that would allow you to do that, as mentioned before logstash does not work as a cluster, every logstash instance is independent from each other, if you only want one instance polling and endpoint, then you need to have your polling configuration in only one instance.

Badger · April 17, 2023, 2:47pm

Perhaps have each logstash instance monitor its own metrics. Use absent_over_time to check if it stops monitoring.

jba · April 18, 2023, 8:06am

Thank you.

system · May 16, 2023, 8:06am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Logstash HTTP Filter sending once per node Logstash	3	94	April 11, 2024
Logstash HTTP polling performance Logstash	2	616	July 6, 2017
Logstash http input plugin configuration of same URL on multiple nodes Logstash	2	328	September 17, 2019
Logstash configuration with multiple http_poller did'nt ran for some indices Logstash	1	130	January 28, 2024
Multiple logstash nodes gathering log by http_poller prevent duplicates Logstash	2	400	May 28, 2020

Is it possible to limit http-poller to a single node in a Logstash cluster?

Related topics