Hi guys,
Thanks again for the quick replies, very much appreciated!!
We are using ES for the past year and from day 1 we haven’t had good
perforce for groovy script that use scripted metric aggregations.
Our data is not huge yet, we have 163 indices, 356 shards and 360M
documents but when we run the groovy script it can take up to 2-3 minutes.
From our understanding it should run much faster.
thus we are afraid from the future, when data becomes lots bigger, after
the beta stage
now i'm not sure whats the different between Hadoop and HDFS.
is Hadoop an engine that runs over HDFS?
btw, my complicated scenario, is that i have tons of events, with fields:
event type, user id, date,.. [lots more]...
for example :
{ userid:1, event_type:A,date:03/05/2015 14:25:01}
{ userid:1, event_type:T,date:03/05/2015 14:25:02}
{ userid:1, event_type:S,date:03/05/2015 14:25:03}
{ userid:1, event_type:Z,date:03/05/2015 14:25:04}
{ userid:1, event_type:B,date:03/05/2015 14:25:05}
in the query, i need to find specific flows of users, and not necessary in
a roll , for example: A->S->Z needs to return the user above, wither all
the relevant docs.
when using scripted metric aggregation, it takes a long time, and moreover-
takes lots of memory, and sometimes kill the ES
can Hadoop help me with it?
i thought of creating a list of events per user (currently i have a type
"events" in a daily index, with list of events ordered by date and time,
and the user is a field in this type )
thanks again!
--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/b4fba6db-18ca-4365-af8e-9f95a8571075%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.