I have manually edited the registry file on one of my filebeats containers in an attempt to get it to reprocess specific files. It will load the edited registry, and the logs indicate that it no longer recognizes the the files I have removed from it's list, but it still will not reprocess these files. Is there another location that I need to update to get it to reprocess these files?
Deleting the registry completely and reprocessing everything is not an option, I have thousands of files that equate to billions of entries into elastic.
Files are still available, they do not appear in the filebeat log. They are not in the registry either.
Additionally, the filebeat container also generates an additional registry file called registry.new when I make these changes. It doesn't make a whole lot of sense to me
When writing the registry file, filebeat first creates the file registry.new, with all new contents. Right after serializing to registry.new, filebeat uses a replace operation to atomically replace the old registry file with registry.new. The fact that you still have the registry.new around might point to an IO error. Have you change file credentials or permissions by accident when editing?
I will take a look if it has something to do with the registry.new file... that might be a weird artifact from docker volumes. I know that when I slice the entries out the filebeat registry parse at startup indicates it's tracking the correct amount of files though, so not sure if that's where the issue is...
I might try the offset thing as well. Thanks for the ideas.
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.