What is the original size/volume of data send by a particular host?
What is the size/volume of data indexed into ES from a particular host
For (1) it is actually the size of the "message"
My thought was to calculate size per message and then multiply by number of messages per day from the host
Is my understanding correct that the "message" field which comes into ES (if there was no alteration) is the original _raw data from the client system?
That would give you a raw idea of the size of every document.
Then if you filter by host and run a sum agg to the size field, that'd give you an idea of the size of the related documents.
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.