Best way to "sample" an ES cluster for testing


(Matt Hughes) #1

I have an ES cluster with ~500 indices and 3TB of data. I'm looking for the best way to sample data from this cluster into a much smaller test cluster for testing upgrades.

I don't need all the data in my test cluster. I just want to sample N number of documents from each index. Any good way to do that? I looked at logstash but didn't see any obvious way to get "the first 10K records" from the elasticsearch input plugin. Am I missing something? Some other tool out there that will do this for me?


(system) #2

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.