Elasticsearch - Spark Retrieve only specific fields and not the whole document

phytopius · January 10, 2018, 12:28pm

Since retrieving aggregations is still not possible I am looking for ways to make the I/O smaller.
(retrieving millions of documents only to get the first and last is really slow)
My new thought would be to not retrieve the whole document but only the fields I need for my analysis

Is this possible with the ES Spark connector?

ES Version: 6.0.0
ES Spark Connector: elasticsearch-spark-20 6.0.0
Scala Version 2.11.8
Spark Version: 2.2

system · February 7, 2018, 12:30pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Spark code to get select firelds from ES Elasticsearch es-hadoop	3	1924	November 1, 2017
Spark, read data from ES, how to specify fields? Elasticsearch es-hadoop	9	13812	July 6, 2017
Retrieve stored fields from elastic search via elasticsearch hadoop connector Elasticsearch es-hadoop	1	696	November 15, 2017
Elasticsearch-hadoop not discovering all fields in the index Elasticsearch es-hadoop	2	1117	June 18, 2017
ElasticSearch+Hadoop+Spark Elasticsearch	2	964	July 6, 2017

Elasticsearch - Spark Retrieve only specific fields and not the whole document

Related topics