Python, Elasticsearch and Apache Spark, Simple data reading

Keith_Massey · June 21, 2022, 1:14pm

Hi @Silver137. I think that the problem is a scala version compatibility issue. Unfortunately neither spark nor scala are usually compatible across versions. The version that ships in the big hadoop jar (elasticsearch-hadoop-8.2.2.jar) is for spark 2 / scala 2.11. Since you are using scala 2.13 and spark 3.3, you want to use the elasticsearch-spark-30_2.13 artifact (Maven Central Repository Search). You can read a little more about this at Issue Using the Connector from PySpark in 7.17.3 - #3 by Keith_Massey.

Topic		Replies	Views
Connecting elastic search through pyspark Elasticsearch es-hadoop	2	1701	December 7, 2021
Problem with retrieving data from Elasticsearch by Spark Elasticsearch es-hadoop	2	1416	March 11, 2019
Crash when reading DataFrame Elasticsearch es-hadoop	3	1404	July 6, 2017
Issue Using the Connector from PySpark in 7.17.3 Elasticsearch es-hadoop	4	2924	June 4, 2022
ES-Hadoop PySpark error Elasticsearch es-hadoop	2	2170	January 10, 2018

Python, Elasticsearch and Apache Spark, Simple data reading

Related topics