Hi,
It's bit frustrating to understand what is Filebeat doing during startup. My situation is, I needed to ingest about 90k files from fs, which were simple xml logs. It happend quite fast. No problems at all.
Afterwards, filebeat.yml was limited to ingest only files not older thant 3 days (about 10k of files). However, filebeat is starting for more than 3 hours without sending new logs to logstash. Only records in log file are, lines like this:
2020-11-16T17:11:12.734+0100 INFO [monitoring] log/log.go:145 Non-zero metrics in the last 30s {"monitoring": {"metrics": {"beat":{"cpu":{"system":{"ticks":594515,"time":{"ms":3344}},"total":{"ticks":7365186,"time":{"ms":27140},"value":7365186},"user":{"ticks":6770671,"time":{"ms":23796}}},"handles":{"open":199},"info":{"ephemeral_id":"0eedf066-8019-4711-bb27-1c57570f73b2","uptime":{"ms":8670060}},"memstats":{"gc_next":326789872,"memory_alloc":234747080,"memory_total":1135321850328,"rss":27598848},"runtime":{"goroutines":21}},"filebeat":{"events":{"added":84,"done":84},"harvester":{"open_files":0,"running":0}},"libbeat":{"config":{"module":{"running":0}},"pipeline":{"clients":1,"events":{"active":1,"filtered":84,"total":84}}},"registrar":{"states":{"current":95781,"update":84},"writes":{"success":84,"total":84}}}}}
My filebeat.yml is:
# Change to true to enable this input configuration.
enabled: true
# Paths that should be crawled and fetched. Glob based paths.
paths:
#- /var/log/*.log
- E:\tmp\duplicate_response\*\*.xml
### Multiline options
multiline.type: pattern
multiline.pattern: '^<\?xml'
multiline.negate: true
multiline.match: after
close_eof: true
clean_inactive: 72h
clean_removed: true
ignore_older: 70h
The question is, when it will end "reindexing" of the registry file? File has 25MB and is growing...