Using custom pipeline for existing filebeat module

jsosic · December 28, 2019, 11:02pm

Hi guys.

I use custom nginx log format, with some additional fields, so current pipeline in Filebeat 7.x nginx module fails with "grok parse failure".

My platform is CentOS 7.x.

Now, I'm thinking of what is the best way to reuse the module and only change the pipeline parser line?

Option 1:
Overwrite default.json with my custom one.

Pros: quick and easy
Cons: have to be re-done after each upgrade, so simple yum update doesn't suffice any more, plus it's not a good practice overwriting files in /usr.

Option 2:
Copying whole nginx module to mycom_nginx and changing default.json there, enabling mycom_nginx and disabling nginx module.

Pros: quick and easy
Cons: have to keep the module up to date whenever it's changed upstream, plus have to understand all the details of the module like machine learning part.

I don't like either of these two... but I can't figure out how to still use the nginx module, but just specify different pipeline file in /etc/filebeat/modules.d/nginx.yml.

How do you guys do it?

jsosic · January 21, 2020, 12:31am

I ended up doing the following:

deploying custom pipelines via module/nginx/error/ingest/custom.json and modified nginx/error/manifest.yml to look like:

module_version: "1.0"

var:
  - name: paths
    default:
      - /var/log/nginx/error.log*
    os.darwin:
      - /usr/local/var/log/nginx/error.log*
    os.windows:
      - c:/programdata/nginx/logs/error.log*

ingest_pipeline: ingest/custom.json
input: config/nginx-error.yml

Only line changed is ingest_pipeline.

After this, I run:

filebeat setup --pipelines

and that's it.

If a better method is discovered, I'll modify my approach.

Mario_Castro · January 23, 2020, 6:50pm

Hi @jsosic

The problem is that you are not using Nginx module at the end so, any solution will involve maintaining code. Your first option maybe is less prone to errors. The key thing here is that the pipeline is a JSON file where you can update the array with a new Grok pattern writing a simple script that you can run every time you update.

You can also try to use some processors in the input part (before the module) if you feel like you can "extract" the non standard data from the incoming line before it reaches the processor. https://www.elastic.co/guide/en/beats/filebeat/current/defining-processors.html Those processors are Filebeat processors, do not confuse them with Ingest Node processors

I hope this helps

system · February 20, 2020, 6:50pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Configuring pipeline in filebeat module - nginx Beats filebeat	4	3070	January 30, 2019
Filebeat - cannot change pipeline in nginx module Beats filebeat	3	591	March 18, 2020
Problem: Custom Filebeat Module to parse modified nginx log Grok Error Beats filebeat	16	4092	July 31, 2018
Filebeat Nginx Module - Nginx log has a non default format Beats filebeat	3	1154	December 1, 2017
Parsing NGINX for Filebeat dashboards with Logstash Logstash docker	2	627	March 11, 2020

Using custom pipeline for existing filebeat module

Related topics