my goal is to have this running on system where multiple file gets generated by running process and then removed. I want to send them to logstash server and put it in common location. basically like common log place.
how do I send more log line. and or when more log line comes logstash should handle it faster. I have enough resource(memory/cpu) on logstash system.
now in startup whole file gets sync in one go. but then subsequent sync does not work fast enough. it gets only few new lines every few second. and file open/close becomes too frequent
issue is that filebeat slows down sending data or logstash is not putting data fast.
in initial startup it does sends all thousands of line in few seconds ( this tells me there is no resource problem or network issue on both side), but then it just comes in batches. hence I am trying to figure out which setting that controls that and what would be good for my use case.
What are the versions for both filebeat and logstash?
They work in batches, what you can do is try to change to batch sizes to see if it reaches your desired ingestion rate.
You seem to not have changed the Filebeat bulk_max_size, in later versions this value has a default of 2048, and your Logstash batch size is 4096.
Both are maximum value, filebeat and logstash may emmit smaller batches, larger batches can also impacting in speed processing.
You also have 1 worker on both filebeat and Logstash, this will impact your ingestion if you have multiple files that are being written at the same time.
Try to set the batch size for Logstash the same as the one in Filebeat (2048).
Testing out 8.19.15 filebeat. Logstash in other hand is 8.16.1. do I need both same? I can change them as these is POC.
I am testing pipeline.batch.size to 50 now. smaller chunk. because filebeat side has either two or four file that needs to send data. and they are not in large volume. Eventually source server will be lot more about 50 servers and each will have two - four files which can have few line appended ever few second and sometime nothing for hours.
Basically trying to create central log location for these files.
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.