org.apache.spark.SparkClassNotFoundException: [DATA_SOURCE_NOT_FOUND] Failed to find the data source: org.elasticsearch.spark.sql. Make sure the provider name is correct and the package is properly registered and compatible with your Spark version

vsam · July 19, 2025, 1:06am

Hi Everyone,

Currently I want to write pyspark dataframe to Elasticcloud index

Below is the code, however pyspark is unable to find the org.elasticsearch.spark.sql.

Environment - Databricks
Databricks Run time - 16.3

(   
    df
    .write
    .mode('overwrite')
    .format("org.elasticsearch.spark.sql")
    .option("es.resource", "INDEX_NAME")
    .option("es.write.operation", "index")
    .option("es.net.http.header.Authorization", "TOKEN")
    .option("es.nodes", conf['host'])
    .option("es.port", conf['port'])
    .save()
    )

Error

 org.apache.spark.SparkClassNotFoundException: [DATA_SOURCE_NOT_FOUND] Failed to find the data source: org.elasticsearch.spark.sql. Make sure the provider name is correct and the package is properly registered and compatible with your Spark version

Please provide you suggestion.

Topic		Replies	Views
ServiceConfigurationError: org.apache.spark.sql.sources.DataSourceRegister: Provider org.elasticsearch.spark.sql.DefaultSource15 could not be instantiated Elasticsearch	1	2941	June 13, 2020
Problem reading from Elasticsearch using Spark SQL Elasticsearch	2	816	July 6, 2017
Structured Streaming - "Failed to find data source: es" Elasticsearch es-hadoop	4	6483	January 17, 2018
Error in Databricks 7.6 and Elastic 7.1.1 java.lang.ClassNotFoundException: Failed to find data source: org.elasticsearch.spark.sql. Elasticsearch es-hadoop	6	6527	July 20, 2021
Problem reading from Elasticsearch using Sparl SQL Elasticsearch	1	408	July 6, 2017

org.apache.spark.SparkClassNotFoundException: [DATA_SOURCE_NOT_FOUND] Failed to find the data source: org.elasticsearch.spark.sql. Make sure the provider name is correct and the package is properly registered and compatible with your Spark version

Related topics