We have a set logs file that roll over when it gets to a certain size; we keep 50 old files. We are using Log4Net to perform the log and roll over.
On average, we may have 2 to 10 files per day. We create daily index for the logs and removed the old index after certain days.
The issue is: the old indexes that are removed, somehow comes back in the next day. It appears as if filebeat re-processes the files that were read in previous day.
Note: The filebeat process is running continuously without stopping.
it appeared that the file had different inode id when it was rolled over. We put some condition to re-ingest file not older that 24hr to reduce duplication. Is there a better solution to minized duplication.
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.