If/else within Logstash output plugin

g.le · June 16, 2019, 11:24am

Hello,
I'm having a Logstash configuration similar to the below:

input {
  beats {
    port => 5044
  }
}
filter {
  clone {
    clones => ["local-dc"]
    add_tag => ["cloned"]
  }
}
output {
  if "cloned" in [tags] {
    elasticsearch { hosts => ["elastic-01:9200"] }
  }
  else {
    elasticsearch { hosts => ["elastic-02:9200"] }
  }
}

Assuming elastic-01 is temporarily unreachable, I would expect only the messages tagged with "cloned" to fail.
Nevertheless, all messages fail.

Is that normal?

Christian_Dahlqvist · June 16, 2019, 1:09pm

Yes. Logstash require all outputs to succeed before the batch is considered complete do that is expected behaviour.

g.le · June 16, 2019, 1:22pm

@Christian_Dahlqvist: Thank you for your feedback on this!
Any workarounds (besides using distinct Logstash instances per Elasticsearch destination) ?

wwalker · June 16, 2019, 3:02pm

What are you trying to accomplish? If you want to clone events and send them to the same server, just send them to a different index in the elasticsearch output.

g.le · June 16, 2019, 3:16pm

What I'm trying to accomplish is to send logs to two distinct Elasticsearch clusters.
If that requires two distinct Logstash instances, then so be it.

Christian_Dahlqvist · June 16, 2019, 3:41pm

You can have two different output pipelines (distributor pattern) within a single Logstash instance, both backed by separate persistent queues.

g.le · June 16, 2019, 4:12pm

@Christian_Dahlqvist: That is an interesting approach (albeit a beta feature).

What is not clear from the documents though is whether all filtering now needs to take place within the contents of config/pipelines.yml.
Do we need to move filtering logic away from the .conf file?

Christian_Dahlqvist · June 16, 2019, 4:14pm

The pipelines feature is just a different way of organizing your config.

wwalker · June 16, 2019, 5:09pm

That seems like an odd design decision, what's the rationale behind it?

Christian_Dahlqvist · June 16, 2019, 6:04pm

It is designed to prevent data loss. If not all outputs were required to succeed any of them could fail and drop data at any time.

wwalker · June 18, 2019, 12:00am

Wouldn't that be where DLQ comes in to pick up failed entries?

Badger · June 18, 2019, 12:19am

That would depend on why it failed, I believe. DLQ does not queue everything that fails.

Christian_Dahlqvist · June 18, 2019, 4:58am

DLQ is only supported by the Elasticsearch output plugin as far as I know and only queues documents where Elasticsearch reported an error, not when Elasticsearch was not available.

g.le · June 19, 2019, 4:33pm

Thanks everyone for the responses.

Given that I had a spare Logstash server doing nothing, I ended up duplicating everything:

Two Filebeat instances residing on the host that produces the logs.
Each Filebeat instance ships logs to a dedicated Logstash node.
Each Logstash node pushes documents to a dedicated Elasticsearch cluster.

Not the most elegant approach, but it was easy to configure without spending much time converting an existing (and huge) Logstash configuration into pipelines.
The regression test would have lasted weeks for no reason.

system · July 17, 2019, 4:33pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Sending an unfiltered copy Logstash	15	1948	June 4, 2018
Clarification on if/else behavior in output Logstash	3	386	October 21, 2019
Pipelines or "if else" conditions for output Logstash	13	15858	April 26, 2018
The final stage of the event pipeline with an "if" statement Logstash	3	409	August 27, 2018
Conditional in output filter fails on Linux Logstash	9	396	April 22, 2020

If/else within Logstash output plugin

Related topics