What are my options if I want to keep track of the size (in bytes) of the
original input documents? That is, the sum of sizes, in bytes, of all
their fields?
What are my options if I want to keep track of the size (in bytes) of
the original input documents? That is, the sum of sizes, in bytes, of
all their fields?
The best one IMO is to index a field along with your doc that has the
bytes counted in whatever way interests you.
With this solution you can add as many different sizes as you need.
On Monday, January 21, 2013 1:21:34 PM UTC-5, Drew Raines wrote:
Otis Gospodnetic wrote:
What are my options if I want to keep track of the size (in bytes) of
the original input documents? That is, the sum of sizes, in bytes, of
all their fields?
The best one IMO is to index a field along with your doc that has the
bytes counted in whatever way interests you.
The client doesn't have this information.... or at least I can't rely on it
to existing and be accurate.
So I need to figure out the size on the server side somewhere.
But I would like to avoid unwrapping JSON just to sum up all document's
fields' values.
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.