Order of file generation and file reading

sharon.c · January 11, 2016, 8:12pm

I used filebeat logstash and elasticsearch together to port data from files to elasticsearch, I have a process to generate files in a folder, and filebeat reads them and sends data to logstash then to elasticsearch. The flow goes like this:
files (step 1) -> filebeat(step 2) -> logstash (step 3)- elasticsearch

It seems to me the order is very important, I need to start the application to generate files(step 1) first, and then start filebeat (step 2)

If I do it in an opposite order, start filebeat process first, and then start generating files , filebeat cannot read files. I used -e -d "*" to track the response when initiating the filebeat, the process responds like "filebeat is harvisting on the file". But it will never start reading the file.

So shall I assume that I have to start generating the files before starting filebeat to read the files?

msimos · January 11, 2016, 9:07pm

Hi,

Using a path like /path/to/files/* the prospector will scan the directory for new files every 10 seconds. So you shouldn't need to generate the files before starting Filebeat. You can also decrease the scan_frequency if you want Filebeat to check for new files quicker:

https://www.elastic.co/guide/en/beats/filebeat/current/filebeat-configuration-details.html#_scan_frequency

Topic		Replies	Views
Filebeat does not poll files in order Beats filebeat	5	2296	November 21, 2017
Can we make filebeat to read files in particular sequence Beats filebeat	6	2073	July 5, 2017
Monitoring files in a directory Beats filebeat	5	772	July 16, 2018
Filebeat is skipping some older files and starts to send new ones Elasticsearch	0	73	May 7, 2024
Filebeat is always read file from beginning Beats filebeat	2	1451	February 1, 2020

Order of file generation and file reading

Related topics