If you use the Hadoop gateway to ship all your ES data to HDFS, is it
in a format amenable to running map-reduce jobs over, independently of
For example, it would be really useful to be able to do Pig queries
over the raw JSON document contents. Wonderdog (https://github.com/
infochimps/wonderdog) lets you do this via the ES cluster as a scan
query, but that will put load on ES. If the data's already being
written to the Hadoop cluster via a gateway, can you just analyse it
there? And if so, does anyone have an example?