Compressions while reading from ES

lbadi · March 9, 2018, 8:46pm

Hi guys!
We are moving data between clouds using spark and ES.
Is there anyway to read using gzip compressions ? Any advice how we can move daily big amount of data from an active index to a spark cluster hosted in another cloud ?

james.baiera · March 17, 2018, 12:25am

This is a frequent ask from the community, one that I am certainly on board with. Sadly, at the moment we do not support it. We're currently using the built in Apache HTTP client implementation that ships with the Hadoop ecosystem libraries. In Hadoop 2.8+ this library is scrapped, so we're looking to align with the Elasticsearch project and adopt their low level rest client (potentially shading it and packaging it in with the connector). That library is modern enough to support compression options like gzip.

system · April 14, 2018, 12:25am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Enable compression while indexing data using Hive/Elastic connector Elasticsearch es-hadoop	3	1141	January 1, 2020
Enable HTTP response compression when reading from ES Elasticsearch es-hadoop	3	850	April 9, 2017
Reading from Elasticsearch index using spark ( es-hadoop ) connectors Elasticsearch es-hadoop	2	1405	March 22, 2022
Indexing compressed (gzip) content into Elastic through Java APIs Elasticsearch	6	2251	March 7, 2017
Use cases for es-hadoop Elasticsearch es-hadoop	3	1170	November 20, 2019

Compressions while reading from ES

Related topics