Interval query using two fields

telastic · July 31, 2018, 4:52pm

Is there a way to count the number of documents that fall within an interval, based on two fields?

For example, say you had documents with start and end times. Instead of intervaling over one field (like the start time, in this case), I want intervals that count how many documents are completely or partially contained within each time interval, based off of their start and end times.

I understand how to do this for only one interval using ranges, like:

 "query": {
    "bool" : {
      "must" : [
        {
          "range" : {
            "start_time" : {
              "lt" :  "2018-04-12T00:05:00.000Z" (the end time of the interval)
            }
          }
        },
        {
          "range" : {
            "stop_time" : {
              "gt" :  "2018-04-12T00:04:45.000Z" (the start time of the interval)
            }
          }      
        }
        ...
}

This returns the correct values for one sub-interval, but I would ideally like to execute a query and get the correct values for a time range containing multiple intervals. Is this possible?

dadoonet · July 31, 2018, 5:14pm

I believe this what date_range data type is made for I believe. See https://www.elastic.co/guide/en/elasticsearch/reference/current/range.html

telastic · July 31, 2018, 8:33pm

Interesting, I didn't realize there are range data types. I looked into it, but this would still only allow me to get one interval at a time, right? Histogram and range aggregations don't seem to be compatible with range data types, and the range query only works on one range.

dadoonet · July 31, 2018, 9:13pm

Histogram and range aggregations don't seem to be compatible with range data types

Sure but you can always index a range and dates separately.

telastic · July 31, 2018, 9:20pm

Can you elaborate what you mean?

dadoonet · July 31, 2018, 9:58pm

Index:

{
  "start": "2018-04-12T00:04:45.000Z",
  "end": "2018-04-12T00:05:00.000Z",
  "range": {
    "from": "2018-04-12T00:04:45.000Z",
    "to": "2018-04-12T00:05:00.000Z"
  }
}

telastic · July 31, 2018, 10:01pm

But that still doesn't allow me to query over multiple intervals, right? I'd still only get the one interval, which isn't what I want.

dadoonet · August 1, 2018, 1:49am

I don't know if you can use an array of ranges.

@jpountz do you have an idea?

jpountz · August 1, 2018, 7:07am

We would like to make histogram and range aggregations able to work on range fields (https://github.com/elastic/elasticsearch/issues/23182) but this will take some time. In the meantime, your best option would be to use a filters aggregation I suppose, with one filter per range that you want to count, regardless of whether you run this range with two fields or a single range field.

system · August 29, 2018, 7:08am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Range values in a histogram Elasticsearch	2	262	July 6, 2017
Query documents using multiple time fields Kibana	2	617	July 17, 2017
Aggregation: Difference of Two Date Fields to Aggregate Timedeltas Elasticsearch	1	1962	May 30, 2017
Timeseries with startDate-endDate. Aggregations on different date intervals Elasticsearch	1	430	January 11, 2018
How to use date_histogram to aggregate documents that span multiple period based on two date fields Elasticsearch	1	361	March 12, 2021

Interval query using two fields

Related topics