Warning this is probably not a good use case for Elasticsearch but...
I have an index which will containing millions of documents. We have built a real-time API which
allows customers search and return less than 1000 documents and a second API which will call a batch search API where customers will submit a search asynchronously and an app will search the index and when complete stream the resultss back to the consumer - our index and apps are deployed in AWS and the consumers are on prem. Consumers on the batch search may expect up to 2 million documents returned
I've tested the batch search API and is accurate at ~100K documents; however I have also searched using a criteria that should have return 1.6 million documents and missed returning a few hundred documents.
- How accurate should I expect the Elasticsearch response to be and 2) is there any method I can use to increase the accuracy of the response?
Thanks in advance