ScalaMapFieldExtractor Cannot extract value from entity

simonlucalandi · June 7, 2017, 8:19am

Hello there.
I have an RDD[SubscriptionInfo] wher SubscriptionInfo is a defined as a case class .

I'm trying to save the rdd to ES and I want to specify the _id to use, so I'm using something like this:

rdd.saveToEs("spark/docs", Map("es.mapping.id" -> "sub_id"))

but I get an error:

Caused by: org.elasticsearch.hadoop.EsHadoopIllegalArgumentException: [ScalaMapFieldExtractor for field [[sub_id]]] cannot extract value from entity [class java.lang.String] | instance [Some(SubscriptionInfo(ea9db3d23862862e68886de79bd2d489_4307394759,None,Some(List(Billing(Sun May 07 00:03:06 UTC 2017,2.0,None))),None,None,2.0,EmptyLTV()))]

Which is the correct way to specify the mapping id in this case?

simonlucalandi · June 7, 2017, 12:21pm

It seams that the solution is to use and additional map stage, as suggested in https://discuss.elastic.co/t/cannot-extract-value-from-entity/66556

rdd.map({case Some(subscriptionInfo) => subscriptionInfo}).saveToEs("spark/docs", Map("es.mapping.id" -> "sub_id"))

james.baiera · June 7, 2017, 2:04pm

My advice would be to avoid using Optional classes with ES-Hadoop since most of the logic is null aware already.

system · July 5, 2017, 2:04pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Cannot extract value from entity Elasticsearch es-hadoop	2	4802	December 16, 2016
Mapping es.mapping.id Elasticsearch es-hadoop	2	4240	July 6, 2017
[elasticsearch-hadoop] How to specify es.mapping.id value from inside a map? Elasticsearch es-hadoop	2	2362	January 17, 2018
ES Document Update Issue MR Elasticsearch es-hadoop	4	2323	July 6, 2017
What's the best way to append objects to field of nested type by es-spark? Elasticsearch es-hadoop	2	1975	July 6, 2017

ScalaMapFieldExtractor Cannot extract value from entity

Related topics