Term query with search result

no_jihun · May 2, 2016, 2:44pm

Hi.

I know ES is not a kind of RDB and no rich support for Joins available.

But let me ask you this can be done with ES for check.

- index1 is index
- user is type,
- _id is auto generated
- 6 documents indexed.

/index1/user
{"user_no":1, "country":"us"}
{"user_no":2, "country":"jp"}
{"user_no":3, "country":"gb"}
{"user_no":4, "country":"in"}
{"user_no":5, "country":"id"}
{"user_no":6, "country":"us"}


- index2 is index
- activity is type
- _id is autogenerated
- 6 documents indexed.

/index2/activity
{"user_no":1, "point":10}
{"user_no":2, "point":20}
{"user_no":3, "point":13}
{"user_no":4, "point":23}
{"user_no":5, "point":44}
{"user_no":6, "point":19}

With data above, I want to do something like below in RDB.

select
  sum(point), count(*) 
from
  index2.activity
where 
  user_no in (
      select user_no from index1.user where country='us'
  )

I checked out https://www.elastic.co/guide/en/elasticsearch/reference/master/query-dsl-terms-query.html#query-dsl-terms-lookup
But It seems with 'Terms-loopk up', all terms(which will be used in IN cluase) should be exist in one document.

No way in ES?

Thanks.
Jihun.

nik9000 · May 2, 2016, 3:04pm

There are four imperfect ways that reflect the Elasticsearch's distributed system-ness:

Do it yourself in your application. This might be the worst way.
Parent/child. This works by routing all the document to the same shard that their parent will be on. Then you can join around parent and child. This is fairly niche but fairly useful. Not useful for when parents have lots of children though. I don't have a definition for "lots" sorry.
nested. This is like automatic denormalization but it still has all the performance characteristics of denormalization.
Manually denormalize. Make it look like:

/index/activity
{"point": 10, "user": {"number": 1, "county": "us"}}

None of these are great and, sadly, if you want to use parent/child you really need to experiment at the scale you expect. Here is a handy link about it.

nik9000 · May 2, 2016, 3:04pm

Or the best way, depending, I guess.

no_jihun · May 3, 2016, 1:09am

@nik9000 Got it. Thanks!

Topic		Replies	Views
Using Terms Query Elasticsearch	10	1679	April 5, 2018
Join on ElasticSearch Elasticsearch	11	2205	July 5, 2017
Join Possibilities for Nested / Parent-Child Elasticsearch	12	935	July 5, 2017
Simple sql type query in multiple index Elasticsearch	4	583	December 24, 2017
Multiple indices with different fields Elasticsearch	2	347	August 22, 2018

Term query with search result

Related topics