I'm playing around with the aggregations to get a better feel of what I can
don with them.
I was wondering how I would write the following aggregations/
I have entries listing user interactions and the ip at the time of
interaction.
Say I want to count for every user the number of different ips i have seen
for them.
Other question
I want to find the most seen ips for every user.
If this helps anyone I have found the query:
On a side it is relatively slow:
my dataset is about 2,5 million docs on a single node with 15gb ram;
the query:
On Tuesday, December 3, 2013 6:18:02 AM UTC+1, nicolas maillard wrote:
Hello everyone
I'm playing around with the aggregations to get a better feel of what I
can don with them.
I was wondering how I would write the following aggregations/
I have entries listing user interactions and the ip at the time of
interaction.
Say I want to count for every user the number of different ips i have seen
for them.
Other question
I want to find the most seen ips for every user.
The aggregations framework present in Elasticsearch 1.0 beta 2 is still at
an early stage and doesn't have all the optimizations that facets have got
over their years of existence. For example, if you compare terms facets
against terms aggregations on string terms, you may notice that terms
aggregations are significantly slower. The reason is that aggregations
don't know yet how to leverage terms ordinals in order to speed up the
generation of the buckets: this is something that will be addressed in the
1.0 release. There are other similar improvements that are planned for the
next weeks and performance numbers should hopefully get better in the next
releases.
definitly looking forward to this release. I'm testing out the usability on
some of our use cases and right now it is a little slow and very ogten
hitting the Ram limit even for this small table and a somwhat simple query.
none the less great feture and I am sure will event better by the time it
hits GA.
Thanks es for all the hard work
On Tuesday, December 3, 2013 10:34:05 AM UTC+1, Adrien Grand wrote:
Hi Nicolas,
The aggregations framework present in Elasticsearch 1.0 beta 2 is still at
an early stage and doesn't have all the optimizations that facets have got
over their years of existence. For example, if you compare terms facets
against terms aggregations on string terms, you may notice that terms
aggregations are significantly slower. The reason is that aggregations
don't know yet how to leverage terms ordinals in order to speed up the
generation of the buckets: this is something that will be addressed in the
1.0 release. There are other similar improvements that are planned for the
next weeks and performance numbers should hopefully get better in the next
releases.
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.