Partial line reading when restarting Filebeat

eclionast · April 3, 2017, 2:43pm

Hello,

Currently we have the following setup:
Filebeat reads log files and sends the content to Kafka. One log-line results in one Kafka event.
At the other side a Logstash reads the events from Kafka, parses them and sends the resulting document to Elasticsearch.
Filebeat runs inside a Docker container and reads the log files from a Docker data container.
We are using Filebeat 5.1.2 with Docker 1.11.2 .

The problem we are encountering is as follows:
Although the offset in the Filebeat registry is pointing to the start of a new log-line, upon restart Filebeat starts reading somewhere in the middle of the previous log-line, resulting in sending a partial log-line to Kafka and thus having parsing errors in Logstash.

According to the documentation on "How does Filebeat ensure At-Least-Once delivery ?" Filebeat should just start reading from the offset upon restart.
What could cause the behavior we are experiencing ?

Thank you

ruflin · April 4, 2017, 2:49pm

First thing that pops to my mind is that this could have to do with encoding or special chars. But at the same time offset is in bytes, so it should not matter.

Can you share an example of the log files and registry where this is happen? Can you share your config?

eclionast · April 4, 2017, 3:09pm

Hello Ruflin,

If it was an encoding problem, then we should have seen scrambled content in Kafka, I think. And as long as Filebeat was not stopped, the pipeline processed everything well.

Eventually we found a solution by using close_eof:true, which works for us as the log files don't roll. Each file is written once and not updated over time.
With that option turned on, restarting the docker container of Filebeat does not result in sending partial log lines to Kafka any more.

Thank you very much for your reply.

ruflin · April 5, 2017, 1:58pm

Interesting, not sure why this solves the issue TBH. In case you see it again, ping me.

system · May 3, 2017, 1:59pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Read the same file several times Beats filebeat	4	2382	April 3, 2017
Filebeat read file from beginning always when restart Beats filebeat	2	1818	February 1, 2022
Need a logical end of file definition for filebeat Beats filebeat	14	2592	July 5, 2017
Filebeats is not detecting new log entries Beats	5	2339	July 5, 2017
My "need to restart filebeat for it to send logs" problem Beats filebeat	8	1585	November 14, 2016

Partial line reading when restarting Filebeat

Related topics