Access Elasticsearch term information through ES-Hadoop API


Is it possible to get access to terms information that is indexed in ES and
run some kind of map-reduce on them using ES-Hadoop? For example, let's say
I have a CSV data set that was indexed into ES, with one particular field
that was setup to be analyzed using a snowball analyzer. So, can I read the
terms that were generated due to the snowball analyzer using ES-Hadoop?
Something similar to how the ES scripts give access to TF, TTF, etc.

Also, is it possible to get the generated ID for an ES document using the
ES-Hadoop APIs? This way, I could be able to query a document's term vector
by _id, and maybe do calculations on terms, one document at a time.


You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to
To view this discussion on the web visit
For more options, visit