Best load balancing solution for logstash service

Saikiran_Pulijala · May 21, 2024, 9:03am

Hi Team,

Currently we have logstash deployed on AWS ECS with service discovery (DNS), which creates DNS records pointing to task containers, We are pointing filebeat to these Domain names, with this setup Due to DNS TTL consumers (filebeat) is pointing to the same containers until TTL expires and resulting other containers being idle, this solution is not effectively use logstash service. Can we use application load balancer to serve logstash requests? or another approach to load balance logstash traffic.

Architecture:

tomcat (filebeat monitors log file changes) -> logstash -> elasticsearch

Thanks & Regards,
Saikiran Pulijala

Saikiran_Pulijala · May 21, 2024, 10:56am

I have tried hosting logstash service on ECS with Application load balancers, but filebeat is trying to reach load balancer dns, getting these errors:

{"log.level":"error","@timestamp":"2024-05-21T10:04:25.977+0530","log.logger":"publisher_pipeline_output","log.origin":{"file.name":"pipeline/client_worker.go","file.line":174},"message":"failed to publish events: client is not connected","service.name":"filebeat","ecs.version":"1.6.0"}
{"log.level":"info","@timestamp":"2024-05-21T10:04:25.977+0530","log.logger":"publisher_pipeline_output","log.origin":{"file.name":"pipeline/client_worker.go","file.line":137},"message":"Connecting to backoff(async(tcp://internal-prod-unnati-internal-alb-1641767585.ap-south-1.elb.amazonaws.com:5044))","service.name":"filebeat","ecs.version":"1.6.0"}
{"log.level":"error","@timestamp":"2024-05-21T10:07:12.028+0530","log.logger":"logstash","log.origin":{"file.name":"logstash/async.go","file.line":280},"message":"Failed to publish events caused by: lumberjack protocol error","service.name":"filebeat","ecs.version":"1.6.0"}
{"log.level":"error","@timestamp":"2024-05-21T10:07:12.028+0530","log.logger":"logstash","log.origin":{"file.name":"logstash/async.go","file.line":280},"message":"Failed to publish events caused by: lumberjack protocol error","service.name":"filebeat","ecs.version":"1.6.0"}

ashishtiwari1993 · May 21, 2024, 11:06am

HI @Saikiran_Pulijala,

You can add load balancing in filebeat output hosts.

output.logstash:
  hosts: ["localhost:5044", "localhost:5045"]
  loadbalance: true

Check more on logstash scalability.

Badger · May 21, 2024, 11:20am

Generally, no. Typically your load balancer does not balance application requests, it balances connection requests.

This is not a trivial difference. Imagine you have 4 beats each load-balancing across the same two logstash instances. If you restart one of the logstash instances then it can take over a minute to get the JVM back up. In that time all of the beats may connect to the other logstash instance. You will have 4 beats all talking to one logstash, and one logstash idle, and the balancing architecture working as designed!

leandrojmp · May 21, 2024, 12:44pm

Application Load Balancers like the AWS one normally only works for HTTP or HTTPS, beats does not use HTTP or HTTPS, it uses a proprietary protocol over TCP, so you need a Network Load Balancer, not an Application Load Balancer.

That's the reason for the errors you got.

You can create a network load balancer pointing to your logstash hosts, but you also need some settings in your logstash output in your filebeats.

Basically you need to add these settings:

pipelining: 0
loadbalance: false
ttl: 2m

Since you will have only one host in the output.logstash.hosts settings, loadbalance will be set to false, the ttl value is the amount of time that beats will try a new connection, this is required when you have logstash behind load balancers to avoid having uneven distributions as the beats connection to logstash is sticky, and pipelining is set to 0 to make the ttl option work.

The documentation has more information about those settings.

Saikiran_Pulijala · May 22, 2024, 5:40am

@ashishtiwari1993 ,
Thanks for the quick response, In our use case we have load balancer DNS as common connection point to logstash container behind ALB.

Topic		Replies	Views
Loadbalancing with filebeat Beats filebeat	3	258	August 1, 2018
How to configure filebeat for logstash cluster environment? Logstash	4	414	May 5, 2020
Filebeats logs -> aws elb of logstash Beats filebeat	5	3352	January 11, 2018
Load balancing data from Filebeat to Logstash using Nginx Logstash	11	3592	November 4, 2022
Vertical Scalability of Logstash in AWS - ECS Logstash	2	795	March 22, 2018

Best load balancing solution for logstash service

Related topics