Is it better to have one pipeline or multiple pipelines?

riahc3 · September 29, 2021, 4:15pm

Hello

From the start, Ive implemented the Elastic Stack using Logstash as the reciever and sender of logs to Logstash.

Ive always implemented it using various pipelines. Each pipeline is organized by a different configuration file.

This causes me on the Elastic Stack server (one node) to have to open a port for each of my pipelines and configurations.

One case that I have been discussing with a coworker are syslog files.

Since each provider sends syslogs in a different format, I have a configuration file for each one, filter them as needed and output to a Elasticsearch index.

The other way this could be done is having one file, listening on just one port and inside, add a lot filters so each one is mutated in its own way. Similar, in the output section, I send them to each index as needed.

Personally, I see a huge configuration file as confusing and hard to manage. This is while I seperated it.

But, I do want to know: Is this wrong? Should I just stick it all in one file?

Thank you

zx8086 · September 29, 2021, 4:36pm

I think is why you have IF conditionals and filters... you need something to distinguish each log from each other and route them with the IF statements.

leandrojmp · September 29, 2021, 4:52pm

The advantage of multiple pipelines is that the pipelines are completely isolated from one another, so your events won't mix up if you forgot a conditional or things like that.

Multiple pipelines is a feature that was implement on version 6.X, before that to run different pipelines you needed to run different logstash instances or have a lot of conditionals in one big file.

In the example you gave where you have many syslog sources with different formats you can try to use the pipeline-to-pipeline communication, this way you would have only one input and use conditionals to direct the messages to other pipelines.

For example:

input {
    udp {
        port => 5514
    }
}
output {
    if "stringA" in [message] {
        pipeline {
            send_to => "pipeline1"
        }
    }
    if "stringB" in [message] {
        pipeline {
            send_to => "pipeline2"
        }
    }
}

Then you would need two other pipelines, pipeline1 and pipeline2 with the following format.

input {
    pipeline {
        address => "pipeline1
    }
filters { ... }
output {...}

This way you have just one input, just one port listening, and you can isolate your pipelines using the internal communication between pipelines.

If you want to keep everything in just one pipeline and use conditionals, you can better organize your pipeline if you split it in multiple files.

For example:

000-input.conf
100-filters-syslogA.conf
110-filters-syslogB.conf
120-filters-syslogC.conf
...
XXX-filters-syslogN
999-output.conf

Then each one of the filters file would have the following format:

filter {
    if condition to isolate events { 
        your filters
    }
}

riahc3 · September 30, 2021, 6:51am

Hello

Thank you for your explanation. I also did not know that pipeline to pipeline communication was possible.

My files are a bit different; I do the input, filter and output all in the same file. Is this incorrect/worst/etc?

leandrojmp · September 30, 2021, 2:22pm

It makes no difference for logstash, it is more about how you want to organize your pipeline.

When logstash starts it will merge all the input, filters and output for your pipeline, so it doesn't matter if they are in one file or in multiple file.

The use of multiple files helps when you have really big pipelines, so you can edit just an specific part.

riahc3 · September 30, 2021, 3:56pm

Understood.

So, in your opinion, is it better or worst to have several Logstash syslog configuration files each listening on different ports or just one?

leandrojmp · September 30, 2021, 5:12pm

It is your choice, there is no wrong or right, nor better or worst.

It depends entirely on your use case and how you want to organize your pipelines, some people prefer to have one input for each source, other people prefer to have just one input and use conditionals to filter, it is a personal choice.

system · October 28, 2021, 5:12pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
One configuration file (single pipeline) or mutiple configuration files (various pipelines)? Logstash	1	195	July 14, 2021
Opinion on Multiple Pipelines vs Conditionals Logstash	1	236	March 11, 2020
Logstash single or multi pipelining? Logstash	1	251	June 1, 2020
Logstash Setting up multiple config files in one pipeline Logstash	5	1949	August 14, 2019
If-else at Multiple Pipeline Logstash	4	2668	April 12, 2019

Is it better to have one pipeline or multiple pipelines?

Related topics