How to configure GCS as filebeat input

arvin495 · December 10, 2021, 6:02pm

Hello Team,

We are storing our audit logs in GCS bucket. we would like to ingest them to Elasticsearch when required - not regularly - using filebeat. I have checked S3 option where it let us use s3 like storages as input using providers.

I'm using following configuration but it is not writing any data however when I test the filebeat configuration it is fine.

I doubt my input configuration is not right in someway. Please check the following and help me understand what's wrong

filebeat.inputs:
  - type: gcp
    project_id: gcp-project-xxx
    bucket_name: log-bucket
    credentials_file: /tmp/service-account-key.json

output.elasticsearch:
  hosts: "https://es-test-xxx.aivencloud.com"
  username: "avnadmin"
  password: "xxxxx"
  indices:
    - index: 'restore-test'

legoguy1000 · December 11, 2021, 12:16am

There is no gcp input, it's gcp-pubsub and ur config isn't valid for that. See GCP Pub/Sub input | Filebeat Reference [7.16] | Elastic on how to configure that. If all you want to do is read log files from an S3 compatible bucket, see the aws-s3 input on how to poll a bucket, AWS S3 input | Filebeat Reference [7.16] | Elastic.

arvin495 · December 12, 2021, 5:07pm

First I thought the same @legoguy1000 untill I found this AWS S3 input | Filebeat Reference [master] | Elastic

legoguy1000 · December 12, 2021, 5:26pm

I'm confused, what on that page made u think u couldn't poll a Gcp bucket that is S3 compatible? See amazon web services - How to access Google Cloud Storage bucket using aws-cli - Stack Overflow on how to do it via the AWS cli.

arvin495 · December 13, 2021, 3:08am

apologies for miscommunication from my end.

what I meant was we could use gcp provider in fileabeat input configuration the way we are using it for s3 as follows

filebeat.inputs:
- type: aws-s3
  non_aws_bucket_name: test-s3-bucket
  number_of_workers: 5
  bucket_list_interval: 300s
  access_key_id: xxxxxxx
  secret_access_key: xxxxxxx
  endpoint: https://s3.example.com:9000
  expand_event_list_from_field: Records

I just want to know how we can do the same to fetch GCS objects to Elasticsearch.

legoguy1000 · December 13, 2021, 4:03am

You would configure it like so. Also u can test the AWS cli the same way using the link from the previous post to test the keys and endpoint.

filebeat.inputs:
- type: aws-s3
  non_aws_bucket_name: test-s3-bucket
  number_of_workers: 5
  bucket_list_interval: 300s
  access_key_id: xxxxxxx
  secret_access_key: xxxxxxx
  endpoint: https://storage.googleapis.com

arvin495 · December 13, 2021, 4:25am

Thanks @legoguy1000
I tried above configuration however I couldn't understand the part where it is still checking for bucket_arn or queue_url even though I provided non_aws_bucket_name.

error:

WARN	[aws-s3]	awss3/config.go:54	neither queue_url nor bucket_arn were provided, input aws-s3 will stop
INFO	[crawler]	beater/crawler.go:141	Starting input (ID: 17738867761700079737)
INFO	[crawler]	beater/crawler.go:108	Loading and starting Inputs completed. Enabled inputs: 1
INFO	[input.aws-s3]	compat/compat.go:111	Input aws-s3 starting	{"id": "F62D1E3EA5C30879"}
INFO	[input.aws-s3]	compat/compat.go:124	Input 'aws-s3' stopped	{"id": "F62D1E3EA5C30879"}

should we do any changes on type? probably changing aws-s3 to gcp-gcs (I'm not sure)

legoguy1000 · December 13, 2021, 1:13pm

My apologies, the ability to poll non AWS buckets was only added to 8.0.0 and want backpack to 7.x. You'll have to wait until 8.0 is released to be able to too what I explained. But to provide a bit more clarification, you can't just change the input names. There is a specific list of inputs that can be used, Configure inputs | Filebeat Reference [7.16] | Elastic.

system · January 10, 2022, 3:13pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Filebeat - GCP Module - No paths were defined for input accessing config Beats beats-module , filebeat	3	503	May 28, 2021
Gcp-pubsub error creating input Beats docker , filebeat	1	283	September 8, 2021
Ingest data directly from Google Cloud Storage into Elastic using Google Elasticsearch	9	1445	March 21, 2023
Configure GCP bucket for snapshot Elasticsearch	4	1236	December 9, 2018
Elasticsearch configuration on GCP Elasticsearch	1	314	April 12, 2019

How to configure GCS as filebeat input

Related topics