Hi, could use some help here.
Something encountered on our ES 5.4 production cluster (4 nodes) with one of our indices (~500gb) a few days ago - The same exact request to the same server was returning different results and different total hits alternately - First call x total, then y, then x again, then y again, and so forth... Same issue with different requests on the same index, though the other indices in the cluster seemed unaffected.
We've restored the index on a separate node and the problem persisted. However, today the issue disappeared practically by itself (we've reindexed a lot of the data but not all). Still very worrying of course.
Any idea what caused the problem, and how we could avoid it in the future?
The schema is quite big so I won't add it here, but let me know if more details are needed.