Thank you for your support. Then once I scan all the files using multiple FSCrawler instances, will running FSCrawler on the root directory continue tracking changes without performing a full re-scan?
I think you will need to keep it running as it was running for the first run.
I said "I think" because I'm not sure about it. I don't remember how ids are computed.
May be you could run it from the root and use different includes settings for each instance. And then run again from root without the includes setting... That way, all ids should be consistent.
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.