Essentially, I'm trying to count the number of 'single instance' documents as one value, and the number of 'duplicate instance' documents as another value. Essentially a count of a count I believe.
Thank you for this! I've been sidetracked recently but I'm going to look into your suggestion in the next few days. I'll report back here on any progress.
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.