I have an ES cluster with ~500 indices and 3TB of data. I'm looking for the best way to sample data from this cluster into a much smaller test cluster for testing upgrades.
I don't need all the data in my test cluster. I just want to sample N number of documents from each index. Any good way to do that? I looked at logstash but didn't see any obvious way to get "the first 10K records" from the elasticsearch input plugin. Am I missing something? Some other tool out there that will do this for me?