Currently I need to load some logfiles contained in tar (archive) files. To load them into filebeat, I am using gzcat tool to echo the log files inside the archive files using stdin input for filebeat. For a 200mb log file, it takes more than 1 hour. We wonder if there would be any way to speed things up. I was playing with different buffer size and poolsize options but it did not help much. Also, I tried using an SSD drive instead of a regular hard drive but it did not help much either. Any help would be greatly appreciated.
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.