Filebeat reading logs from S3

Nithya · April 1, 2020, 2:32pm

Hi,

I'm trying to get the AWS Logs which is stored in the centralised S3 bucket. I configured the SQS to get the file and push it to the Elastic Cloud index.

I'm facing the below problems:

When I see the logs, each line from the log file is storing as a separate doc.
Getting a gzip invalid header error while uploading the WAF logs and CloudTrail

ChrsMark · April 2, 2020, 8:26am

Hey!

Could you share your configuration please?

Also please have a look at the docs and make sure that you don't miss anything like Permissions etc.

Thanks!

Nithya · April 2, 2020, 8:41am

Hi @ChrsMark,

Thanks for your response!

This is my config:

> filebeat.inputs:
> - type: s3
>   queue_url: https://sqs.us-west-2.amazonaws.com/XXXXXXXXXX/sqs-name
>   visibility_timeout: 300s
>   credential_profile_name: default
> cloud.id: "cloudid"
> cloud.auth: "elastic:{password}"

And yes, my AWS profile has admin access.

Thanks!

ChrsMark · April 2, 2020, 8:44am

Thanks!

Could you share a complete log output of Filebeat too? Please run it in debug mode like ./filebeat -e -d "*".

C.

Nithya · April 2, 2020, 10:30am

@ChrsMark

Can we setup a call to discuss on this?

Thanks!

Nithya · April 2, 2020, 10:58am

@ChrsMark Or can you give me a sample config file to get the log from S3 which contains the logs of cloudtrail, cloudfront, vpc flowlogs, cloudwatch and waf logs?

Nithya · April 2, 2020, 12:42pm

@ChrsMark Is there any other module available to collect the logs from S3 Bucket?

Thanks!

Kaiyan_Sheng · April 2, 2020, 4:33pm

@Nithya Thanks for creating this issue here.

filebeat.inputs:
 - type: s3
   queue_url: https://sqs.us-west-2.amazonaws.com/XXXXXXXXXX/sqs-name
   visibility_timeout: 300s
   credential_profile_name: default
   expand_event_list_from_field: Records

cloud.id: "cloudid"
cloud.auth: "elastic:{password}"

For Cloudtrail logs, they are in json format so expand_event_list_from_field is needed for decoding json.

Or you can use the cloudtrail fileset directly in Filebeat. You can run ./filebeat modules enable aws and then in modules.d/aws.yml you should see as section for cloudtrail logs.

Nithya · April 3, 2020, 6:44am

Hi @Kaiyan_Sheng,

How could I read the files which have the content type of application/octet-stream?

Because I'm streaming the CLoudWatch and WAFLogs using Firehose from multiple accounts to a common S3 bucket and it has the content type application/octet-stream.

And what are all the content-type which FileBeat will accept?

Thanks!

Nithya · April 6, 2020, 3:32pm

Hi @Kaiyan_Sheng, @ChrsMark,

Can you check the above comment?

Thanks,
Nithya

Kaiyan_Sheng · April 10, 2020, 6:33pm

@Nithya Sorry for the late response! Right now S3 input in Filebeat reads files with bufio.NewReader unless content-type is application/x-gzip, then it uses gzip.NewReader instead. There is no special reader for application/octet-stream yet.

What error message do you see when you try config below?

filebeat.inputs:
 - type: s3
   queue_url: https://sqs.us-west-2.amazonaws.com/XXXXXXXXXX/sqs-name
   visibility_timeout: 300s

system · May 8, 2020, 6:34pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Filebeat not ingesting logs from s3 Beats filebeat	10	1412	December 19, 2019
Filebeat AWS Module unable to process Logs from S3 Beats beats-module , filebeat	5	608	July 12, 2021
Filebeat S3 Input - Output Garbled Beats filebeat	5	510	December 5, 2019
GZIP invalid header with Filebeat S3 input and GuardDuty logs Beats filebeat	6	1512	June 18, 2020
Filebeat stops proccessing s3 input with no error Beats filebeat	4	1170	January 20, 2020

Filebeat reading logs from S3

Related topics