Hi, we are facing an issue while reading a bson file as pyspark and then indexing it in ES.
The error is regarding the type of certain keys in the bson file which is an array. The error is as follows:
Data of type java.util.ArrayList cannot be used.
There is an answer to similar problem which is not helpful. The link is given below
Solved the issue. The error was with spark not being able to handle arrayList type. So i converted all the arraylists in my data to tuple which worked.
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.