Time/Order based query

John2 · November 4, 2015, 11:24am

I am using elasticsearch to import log events. Suppose an event X occurs, followed by event Y. Is there a query that can be performed that will find all documents that match an X event followed by a Y event using the Query DSL? Could aggregations be used, or is this kind of query not supported in elasticsearch?

Thanks.

polyfractal · November 4, 2015, 1:56pm

At the moment, these types of queries are tough for Elasticsearch. Event X may be located on one shard, while Event Y may be on a different shard. Matching/sorting based on sequential causality would mean that both shards (on potentially different nodes) would have to coordinate their actions and communicate, which could be very expensive.

You might be able to accomplish something similar with the new pipeline aggregations, but not likely. These aggs work on the results of other aggregations (e.g. they operate on buckets, not documents), so you'd only be able to calculate stats on the sampled buckets.

You'll probably have better luck by designing some kind of "entity-centric indexing" scheme, where you save the sequential relationship in an "entity" and use that document to determine matches. Mark Harwood has a few presentations on the subject:

John2 · November 12, 2015, 1:29pm

Thanks for your reply. I had a feeling that it wouldn't be easy, if at all possible.

Topic		Replies	Views
Elasticsearch sequence pattern mining Elasticsearch	4	2441	July 12, 2017
Question for Aggregation Elasticsearch	1	392	July 5, 2017
Elasticsearch Query with pattern and time sequence Elasticsearch elastic-stack-machine-learning	4	1356	November 19, 2019
Searching by Temporal Proximity Elasticsearch	1	654	July 6, 2017
Dynamic pattern matching over a sequence of events Elasticsearch	5	1717	November 4, 2022

Time/Order based query

Related topics