A question around logstash S3 input plugin

pk.241011 · October 10, 2023, 11:37pm

Hi All,

We run logstash on multiple EC2 instances behind a loadbalancer for reliability purposes. We are thinking of using the S3 input plugin. Since the servers are created by auto-scaling process of AWS, they are exactly same.

I am trying to get some clarity around the behaviour of the S3 input plugin when multiple instances of Logstash are running polling the same S3 bucket and prefix.

My understanding is that they should be fine since each S3 object is key looks like a filepath but is not actually a filepath.

Sample code for my case will be like this. I delete the S3 object after reading it. So no need to track the last handled file.

input
{
	s3
	{
		bucket => "testbucket"
		prefix => "get/this/data"
		region => "us-east-2"
		delete => true
		interval => 100
		sincedb_path => "/dev/null"
		additional_settings => {
			"force_path_style" => true
			"follow_redirects" => false
			}
	}
}

leandrojmp · October 11, 2023, 12:31am

This input does not support this, it can lead to duplicates as you cannot guarantee that multiple Logstash instances will not try to read the same object in S3 at the same type.

If you need to have multiple instances reading the same bucket you should something that support it, one option is to use Filebeat with the AWS S3 Input with SQS configured, this is also the recommend way to consume logs from S3 buckets.

pk.241011 · October 11, 2023, 1:04am

Thanks. Makes things a lot clearer. I will go through the link you posted.

system · November 8, 2023, 1:04am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Logstash: s3 input plugin Logstash	2	538	May 3, 2018
How to use Beat with Amazon S3 logs as input? Logstash	1	360	March 18, 2019
How the logstash s3 input plugin handle load balance? Logstash	3	1080	June 27, 2017
Having multiple logstash instances (more than one server) for reading from same s3 bucket using s3 input plugin Logstash	1	685	September 4, 2019
High Availibility for Logstash Input Processing Logstash	2	415	April 10, 2018

A question around logstash S3 input plugin

Related topics