Missing logs with rotate log

Amos_Shahar · June 30, 2016, 12:01pm

hi,
I am using filebeat (version 1.2.3-1 on AWS linux AMI) to forward to logstash and I have missing logs (every time the file rotates I think).
the specific file is rotating every 20M and in the peak time it is rotating every 1-2 minutes, rotation name is:
filename.log, filename.log.1,filename.log.2 ....
Relevant yaml conf:
paths:
- /my_path/*.log
ignore_older: 24h
scan_frequency: 1s
tail_files: false

logstash:
hosts: ["ls-mydomain.com:4055"]
loadbalance: true

any idea what can be wrong? how to troubleshoot it?
In general, does filebeat can handle log rotate in such load? please note that there are many other files that filebeat is configured to ship but those that have no load are fine.

tried to change the parameters above but it is not solved.

Thanks,
Amos

magnusbaeck · July 1, 2016, 5:35am

Exactly how are the files rotated? Are they renamed? Or copied and truncated?

Amos_Shahar · July 1, 2016, 10:35pm

I am not sure. it uses log4j and as I mentioned the names are as follow:
filename.log
filename.log.1
filename.log.2
....

Amos

steffens · July 2, 2016, 11:50am

you glob pattern does not match the renamed files. Thusly filebeat has problems finding these rotated files.

Amos_Shahar · July 5, 2016, 8:19pm

I tried with all patterns ("log.*", ".log", "log*") but still have missing messages.
The only configuration that solve the problem is when I set the file rotation to a very big file (so there is no rotation) and than I get exactly ALL the messages. it seems that filebeat has issues with log rotating in high volume.
anyone any idea?

Amos

ruflin · July 6, 2016, 6:58am

It seems like you are hitting this bug here: https://github.com/elastic/beats/pull/1954

Could you try the nightly build to see if this resolves your problem? https://beats-nightlies.s3.amazonaws.com/index.html?prefix=filebeat/

The problem gets more sever as your scan_frequency is quite low.

Amos_Shahar · July 6, 2016, 12:19pm

Issue has been resolved with this version - Thanks!
filebeat-5.0.0-alpha5-SNAPSHOT-x86_64.rpm

When will you have a stable release with this bug fix?

Amos

ruflin · July 11, 2016, 8:21am

Glad it works with the most recent version. The first beta with these changes should be release in the next weeks.

Amos_Shahar · July 12, 2016, 6:41am

I am sorry but after a hour or two it stopped working again ....
It is a show stopper to the whole project. filebeat forward few logs every minute while I have more than 2000 log lines every minute.
What information do you need in order to help solving this issue?

Amos

ruflin · July 12, 2016, 8:16am

Can you post part of your log file? Please set the log level to at least INFO, best would be DEBUG to see all the details.

Amos_Shahar · July 12, 2016, 10:23am

I can send the log file and the content of some of the files to you but I prefer not to publish as there is customer information involved. do you have an email?

Thanks,
Amos

Amos_Shahar · July 12, 2016, 10:57am

you can see the filebeat INFO file at:
http://open-voip.org/images/0/08/Filebeat.txt
and the registry file:
http://open-voip.org/images/b/b7/Registry.txt

the problematic file is webSocket.log

Thanks,
Amos

ruflin · July 13, 2016, 6:28pm

Thanks for sharing some log files here. I had a quick look at the filebeat log and there is nothing really suspicious. It publishes very 30s between 30-60k events which sounds like enough to me to cover your case above.

Can you share the log lines from when you think that not all events are published? Or is that the case with the excerpt you shared? Can you share again the full config that you used for these tests?

Amos_Shahar · July 17, 2016, 11:38am

here is the conf file:

filebeat.prospectors:
- input_type: log
  paths:
    - /logs/tnet/webSocketEvents.log
    - /logs/tnet/FIX*.log
    - /logs/tnet/tomcatS*.log
    - /logs/tnet/dbPr*.log
  ignore_older: 2m
  fields:
    level: info

I tried few option with ignore_older parameter - all failed

Thanks,
Amos



output.logstash:
  hosts: ["ls-mydomain.com:4055"]

ruflin · July 18, 2016, 6:27am

Did you try not to use ignore_older?

Can you share the log lines from when you think that not all events are published? Or is that the case with the excerpt you shared?

system · July 21, 2016, 12:02pm

This topic was automatically closed after 21 days. New replies are no longer allowed.

Topic		Replies	Views
Filebeat missed one complete log file in log rotation Beats filebeat	10	1512	May 17, 2017
Filebeat missing some log lines Beats filebeat	4	2120	July 5, 2017
File beat reading data from rotated File not in prospect Beats filebeat	4	1018	July 5, 2017
Filebeat stops sending logs after logrotates Beats filebeat	4	2464	July 5, 2017
Log rotation and filebeat Beats filebeat	17	23135	August 20, 2018

Missing logs with rotate log

Related topics