Hadoop-Elasticsearch - Avro Support

ajaybhatnagar · July 7, 2015, 2:36pm

Hi,

I am looking into possible integration of Elasticsearch and Hadoop using es-hadoop. Would like to know if it can support Avro format from/to Hadoop and Elasticsearch ?

If yes, are there any requirements / plugins needed to support or some special configuration requirements.
If it is not supported currently, are there any plans for this to be available ?

Thanks
Ajay

vijaym123 · July 8, 2015, 7:31am

REGISTER piggybank.jar and read your Avro data and then Store it on your ES cluster with EsStorage!

Eg:

REGISTER piggybank.jar;
records = LOAD '/input/data' USING org.apache.pig.piggybank.storage.avro.AvroStorage('no_schema_check',
'schema_file', 'examples/schema/test.avsc');
STORE records INTO 'library/book' USING org.elasticsearch.hadoop.pig.EsStorage('es.http.timeout=5m','es.index.auto.create=false' );
Cheers,
Vijay

ajaybhatnagar · July 8, 2015, 1:59pm

Thanks Vijay
We will give it a try.
Ajay

costin · July 9, 2015, 9:27am

Thanks for the example @vijaym123. es-hadoop supports any data format
supported/available in Hadoop. As in the example below, simply use the
appropriate Storage/Input/OuputFormat/Loader and you're set.
This is consistent across all the libraries supported by the connector -
Map/Reduce, Hive, Pig, Spark, Storm, etc...

Topic		Replies	Views
Indexing logs with es-hadoop Elasticsearch es-hadoop	2	1470	July 6, 2017
Using EsStorage for nested data Elasticsearch es-hadoop	1	794	July 6, 2017
Avro support in elasticsearch Elasticsearch	1	468	July 6, 2017
Integration of hadoop (specifically HDFS files) with ELK stack Elasticsearch es-hadoop	2	641	September 11, 2019
Use cases for es-hadoop Elasticsearch es-hadoop	3	1170	November 20, 2019

Hadoop-Elasticsearch - Avro Support

Related topics