Error while indexing list to Elastic search using pyspark

Ravi_Ranjan · January 16, 2018, 1:00pm

Hi, we are facing an issue while reading a bson file as pyspark and then indexing it in ES.
The error is regarding the type of certain keys in the bson file which is an array. The error is as follows:
Data of type java.util.ArrayList cannot be used.
There is an answer to similar problem which is not helpful. The link is given below

How to get rid of this?

james.baiera · January 29, 2018, 4:40am

Could you share the full stack trace?

Ravi_Ranjan · January 29, 2018, 5:38am

Solved the issue. The error was with spark not being able to handle arrayList type. So i converted all the arraylists in my data to tuple which worked.

james.baiera · January 29, 2018, 5:39am

Great to hear, cheers!

system · February 26, 2018, 5:39am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Pyspark write list type to ES Elasticsearch es-hadoop	2	1957	July 6, 2017
Getting a "RDD element of type java.util.HashMap cannot be used" error Elasticsearch es-hadoop	7	4050	September 5, 2017
Allow multivalued/array for all fields? Elasticsearch es-hadoop	2	1223	December 7, 2017
Spark elasticsearch 5.0.2 scala.MatchError Elasticsearch es-hadoop	2	2304	January 9, 2017
How to write to ES from a pyspark dataframe? Elasticsearch es-hadoop	5	5119	July 6, 2017

Error while indexing list to Elastic search using pyspark

Related topics