Multiline Codec stacking logs from different sources

Maxwell_Flanders · July 8, 2016, 3:10pm

Hi there guys,

We are using a logging framework in which a multitude of different servers write their logs to a kafka topic with two partitions, and then two instances of logstash on different VM's read from that topic using the logstash-kafka input and a multiline codec.

We have noticed an extremely strange case where we thought we were losing logs, but in fact discovered that some of our logs from one server were being appended to logs from another server. What could cause this?? Our kafka topic has two partitions, and we have two consumers with one thread each, which I believe is the recommended balance of consumers vs. partitions.

On another front, doesn't the multiline codec have a concept of stream identity?? Why would rows from two different servers/sources ever come into contact with each other?? Is there a setting I can change to make sure they do not??

Any help appreciated this issue is really putting us into a weird spot.

Thanks!

Glen_Smith · July 8, 2016, 5:12pm

Moved to Logstash

Maxwell_Flanders · July 8, 2016, 6:11pm

Thanks!

Topic		Replies	Views
Stream Identity in Multiline Codec? Logstash	4	1003	July 6, 2017
Codec Multiline is not working for Kafka input plugin as expected Logstash	1	223	July 28, 2022
Parsing multiline events with logstash Logstash	1	277	May 21, 2020
Multiline codec Logstash	15	2538	July 6, 2017
Logstash multiline source validation Logstash	3	366	December 16, 2019

Multiline Codec stacking logs from different sources

Related topics