Hi,
since I have a field with a unique value for each of my csv entries, how can I use that field value as a fingerprint instead of generating one, to avoid duplicates?
I experimented with the csv filter for while, and was wondering about the above.
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.