Value got nulled when ingesting to ES from Hadoop using Spark

anung · December 9, 2020, 1:53pm

my env:

cloudera hadoop 6.3
es 7.10
spark 2.4.0
elasticsearch-spark-20_2.11-7.10.0.jar

my code very simple:

import org.elasticsearch.spark.rdd.api.java.JavaEsSpark;

JavaEsSpark.saveToEs(rdd_tJava_1, "nyc_yellow_taxi/docs");

the output is as expected, the record count is the same. but when inspecting the records, there are problems, some columns got nulled. all of the records from that colums.

so columns:

pulocationid, vendorid, dolocationid, dolocationid

got nulled.

example here a record in hadoop:

and here the output in elastic:

any idea guys?

system · January 6, 2021, 1:53pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
ElasticSearch+Hadoop+Spark Elasticsearch	2	964	July 6, 2017
Elasticsearch.spark.sql queries return null Elasticsearch es-hadoop	1	730	June 3, 2019
Elastic Search Hadoop Connector - Spark Facing Issues while Saving to ES Elasticsearch es-hadoop	4	1822	July 6, 2017
Hive integration with Elasticsearch show nulls fileds Elasticsearch es-hadoop	4	1240	August 9, 2017
Tracing Errors In EsSpark Elasticsearch es-hadoop	2	689	July 6, 2017

Value got nulled when ingesting to ES from Hadoop using Spark

Related topics