How to index/search data in very-large subset of documents?

sangupta · June 12, 2015, 8:16am

Hi - this might be a newbie question but I could not find a related answer in my search.

The use-case contains a lot of documents (100 million+) each tagged with a unique case number. The number of case numbers are currently 100 thousand or so. Thus, each case number can have 1000+ documents easily.

We want to do a search within a group of case numbers for a given agent - say at the max say 2000 case numbers.

How do we model this:

Should we fire an IN query in case numbers during Search.
Or, should we have one document for a case number - and update the same search-document when a new user-document is added against that case number.

Thanks in advance.

Topic		Replies	Views
Architecture and performance question on searching small subsets of documents Elasticsearch	4	424	July 6, 2017
BulkIndexing Elastic Search elastic-app-search	2	427	November 22, 2019
Search to treat multiple documents as one Elasticsearch	2	707	July 5, 2017
Indexation of many documents in Appsearch Elastic Search elastic-app-search	1	457	October 16, 2019
Documents that only contain integers Elasticsearch	3	291	July 6, 2017

How to index/search data in very-large subset of documents?

Related topics