Aggregation queries question

nathen.hinson · April 5, 2018, 3:31pm

Hello,

I have roughly 68 million documents across four indices that have identical mappings, for example:
{
'log_time': '2017-01-06T09:23:00Z',
'extra': {},
'used': False,
'who': u'Bruno',
'interaction_id': u'Fifty-five',
'what': u'message',
'value': u'=== Closed Without Comment ==='
}

I am looking to return a group of interaction_ids where each group has at least one of the following filters are true: At least one document where 'what' is 'Status' and the value is 'New',
At least one document where 'what' is 'Status' and the value is 'Closed' and at least one document where 'what' is 'Queue' and 'value' is 'Breakfix'.

I have tried combinations of a nested filtering aggregation, straight up pipeline bucket aggregation, bucketing by the filter conditions and then attempting to a child aggregation where it buckets by interaction_id all to no avail.

Would anyone have any suggestions what I might try next?

With Thanks!

Mark_Harwood · April 5, 2018, 3:39pm

How many unique interaction_ids are there?

nathen.hinson · April 5, 2018, 3:42pm

About 3 million

nathen.hinson · April 5, 2018, 5:08pm

Sorry replied to the thread and not to you: Its roughly 3 million unique interaction_ids

Mark_Harwood · April 6, 2018, 7:29am

Joins on high cardinality fields are problematic for any distributed system. Here’s why and shows a way to work round it: https://youtu.be/yBf7oeJKH2Y

nathen.hinson · April 9, 2018, 1:39pm

That's a great resource. Thanks Mark!

system · May 7, 2018, 1:40pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Elasticsearch aggregations for analytics Elasticsearch	1	293	July 6, 2017
Elasticsearch filter document ids with same field values count Elasticsearch	1	609	April 11, 2020
Aggregation to count parent documents by field in their children? Elasticsearch	1	410	July 29, 2020
Is it ok to make an idsQuery with lots (10k+) of ids on ES 5.x? Elasticsearch	2	685	March 6, 2017
Aggregate elasticsearch index by nested filed values Elasticsearch	1	380	January 13, 2017

Aggregation queries question

Related topics