How to structure this index/query problem

martinjuhasz · February 20, 2020, 12:33pm

Hey,
i'm pretty new to using ES and wonder how one would solve my current problem, regarding how to index this best and how to query it.

As an example lets say i have two entities in my application. Users and Challenges.

What i want to do now is beeing able to list the following:

list all challenges for a user , either all, only the ones he declined or only the ones he accepted
list all challenges and how many have accpeted those

How would i index and query the part of accepting/declining challenges?

What i did for now is to index challenges and users, each in their own index. I'm able to query/filter etc challenges and users without problem.

What i fail to is to come up with a good way to query my additional needs.
Thanks for any hints about this!

dadoonet · February 20, 2020, 12:50pm

Welcome!

As all entities you are searching for are challenges, I'd only index challenges.

And each challenge would have all the information about the user who ran that challenge.

martinjuhasz · February 20, 2020, 12:52pm

so basically if i get you right... have one document per challenge which includes a property of all the id's of users that took the challenge and another prop for those who declined?

dadoonet · February 20, 2020, 1:39pm

I meant one document per challenge a single user ran.
The same challenge will be duplicated as many times as needed but with other user details.

My 2 cents.

martinjuhasz · February 20, 2020, 1:58pm

Ok interesting. How would i handle changes in the challenge entity then? f.e. one property changed (challenge name) across all documents with duplicated data?

Also how would one query "all challenges" not depending on a user without duplication?

dadoonet · February 20, 2020, 3:27pm

You can reindex all the documents that needs to be reindexed when a challenge changes.
You have to have an idea of the cost behind. Ie are you going to reindex 10m documents? Or less than that? How much?

Also does it make sense to change a challenge that happened in the past after a user has already submitted it? That's a business question of course.

About deduplication, may be aggregations would work for you.

Worse case, if you really need to have 2 distinct entities, have a look at the join data type also called parent/child. Although I'm not a big fan, your use case might be a good candidate for this feature.

system · March 19, 2020, 3:27pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
General question about indexing Elasticsearch	4	370	July 6, 2017
Call for ideas Elasticsearch	7	369	July 6, 2017
Searching for documents only if user has it Elasticsearch	2	302	July 6, 2017
Join kind of query Elasticsearch	5	301	July 6, 2017
Question about use case: gmail-like "I can only see my own emails"? Elasticsearch	11	459	July 6, 2017

How to structure this index/query problem

Related topics