Elasticsearch - Spark Retrieve only specific fields and not the whole document


Since retrieving aggregations is still not possible I am looking for ways to make the I/O smaller.
(retrieving millions of documents only to get the first and last is really slow)
My new thought would be to not retrieve the whole document but only the fields I need for my analysis

Is this possible with the ES Spark connector?

ES Version: 6.0.0
ES Spark Connector: elasticsearch-spark-20 6.0.0
Scala Version 2.11.8
Spark Version: 2.2

(system) #2

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.