I have a stream of timeseries data that streams to ES once / device /
second. I would like to apply a facet (or aggregate) that buckets based on
time but produces the outputs of the stats facet on a value during that
time.
So something like 5 minute buckets from start_date to end_date
giving count, total, sum of squares, mean (average), minimum, maximum,
variance, and standard deviation values for those buckets.
Can this be done in the 0.9.x branch? If so, guidance on how and tradeoffs
will be helpful. How about 1.x and the new aggregates? Client is not going
live for a few months so the 1.x branch could be reasonable.
Thats exactly what the new Aggregations feature is for. Have you seen our
recent blog post for the 1.0 beta 2 release? Aggregations is included in
that release.
I would download the beta and try your example there and let us know how it
goes.
Here are the aggregations docs:
On Monday, December 9, 2013 1:38:12 PM UTC-8, Michael Sick wrote:
I have a stream of timeseries data that streams to ES once / device /
second. I would like to apply a facet (or aggregate) that buckets based on
time but produces the outputs of the stats facet on a value during that
time.
So something like 5 minute buckets from start_date to end_date
giving count, total, sum of squares, mean (average), minimum, maximum,
variance, and standard deviation values for those buckets.
Can this be done in the 0.9.x branch? If so, guidance on how and tradeoffs
will be helpful. How about 1.x and the new aggregates? Client is not going
live for a few months so the 1.x branch could be reasonable.
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.