Since retrieving aggregations is still not possible I am looking for ways to make the I/O smaller.
(retrieving millions of documents only to get the first and last is really slow)
My new thought would be to not retrieve the whole document but only the fields I need for my analysis
Is this possible with the ES Spark connector?
ES Version: 6.0.0
ES Spark Connector: elasticsearch-spark-20 6.0.0
Scala Version 2.11.8
Spark Version: 2.2