Es.read.field.as.array.include multiple values

Ramesh_Nuthalapati · May 8, 2020, 11:33pm

I understand from documentation that when I get this error

SparkException: Job aborted due to stage failure: Task 0 in stage 20.0 failed 4 times, most recent failure: Lost task 0.3 in stage 20.0 (TID 75, 10.244.241.30, executor 4): org.elasticsearch.hadoop.rest.EsHadoopParsingException: org.elasticsearch.hadoop.EsHadoopIllegalStateException: Field 'metadata.a.b.c.d' not found; typically this occurs with arrays which are not mapped as single value

I should do

spark.read.xx.
.option("es.read.field.as.array.include", "metadata.a.b.c,metadata.a.b.c.d,metadata.a.b.c.d.e")

But it keep throwing the error even I include the option above.

Apache Spark 2.4.5,
Scala 2.11
ES hadoop 7.6.2

Any thoughts?

james.baiera · May 14, 2020, 8:09pm

es.read.field.as.array.include is a finicky property that can cause some very confusing serialization errors in the connector. Can you post a mapping and an average document that you're trying to read? Are the c.d.e fields meant to be treated as 3 nested arrays?

system · June 11, 2020, 8:09pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Field not found; typically this occurs with arrays which are not mapped as single value Elasticsearch es-hadoop	9	6423	July 6, 2017
NullPointerException when settings "es.read.field.as.array.include" options Elasticsearch es-hadoop	7	1703	July 6, 2017
Handling array values while reading from elasticsearch in spark using elasticsearch-spark Elasticsearch es-hadoop	1	961	November 19, 2020
PySpark fails to read multiple nested levels of Elasticsearch index Elasticsearch es-hadoop	1	1273	November 18, 2017
Reading from ES using spark issues (colon in options) Elasticsearch	3	981	February 28, 2018

Es.read.field.as.array.include multiple values

Related topics