There's also similar entries for project A, B, C, D ect. These entries documents have an active of true or false, and a submission date from when this information was gathered.
So lets say I wanted to create a pie chart displaying the percentage of my charts that are currently active (the document for project A with the latest submission date where active=true). So I would have a single document per application that shows whether it's currently active or not as my data set. I want to find display the percentage of those projects that are active compared to the entire data set.
So the example would be a pie chart that's showing the percentage of active applications, but it's only using the latest entry for a given application in its data set. Does that make a little more sense now?
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.