Recommendations for parsing 1000's ~10MB files to backfill elasticsearch

steffens · March 29, 2019, 3:53pm

What is the ratio between lines published and lines filtered out. The registry file keeps track of the file offset, but needs some IO to be written. If the ratio is somewhat 'bad', then the registry writes will slow down filebeat, as it also requires some fsync when writing the registry. Setting filebeat.registry_flush: 1s helps in this case (See registry_flush docs).

Topic		Replies	Views
High CPU Usage - Windows Beats filebeat	6	2134	July 8, 2016
Determination Filebeat -> Elasticsearch performance Beats elastic-stack-monitoring , filebeat	3	352	March 5, 2019
Filebeat include_lines performance v.s. grep Beats filebeat	2	1666	November 9, 2018
Filebeat 6.2 throughput and general performance Beats filebeat	7	4473	April 3, 2018
Suggestion improving filebeat performance Beats filebeat	3	1224	November 24, 2017

Recommendations for parsing 1000's ~10MB files to backfill elasticsearch

Related topics