Issue with Loading Data from Elasticsearch into Databricks

Hernando_Segovia · September 5, 2024, 9:02am

Hello,

I'm encountering an issue while trying to load data from Elasticsearch into Databricks. Below is the code I'm using and the error message I'm receiving.

Code:

es_read_conf = {
    "es.nodes": "your-cluster-url",
    "es.port": "443",
    "es.net.http.auth.header": "Authorization: ApiKey <your-api-key>",
    "es.resource": "b-s-data",
    "es.net.ssl": "true",
    "es.nodes.wan.only": "true"
}

df = spark.read.format("org.elasticsearch.spark.sql").options(**es_read_conf).load("beat-starnet-data")

display(df)

Error:

kotlin

Py4JJavaError: An error occurred while calling o492.load.
: org.elasticsearch.hadoop.EsHadoopIllegalArgumentException: Cannot detect ES version - typically this happens if the network/Elasticsearch cluster is not accessible or when targeting a WAN/Cloud instance without the proper setting 'es.nodes.wan.only'
    at org.elasticsearch.hadoop.rest.InitializationUtils.discoverClusterInfo(InitializationUtils.java:403)
    at org.elasticsearch.spark.sql.ElasticsearchRelation.cfg$lzycompute(DefaultSource.scala:234)
    ...
Caused by: org.elasticsearch.hadoop.rest.EsHadoopInvalidRequest: org.elasticsearch.hadoop.rest.EsHadoopRemoteException: security_exception: missing authentication credentials for REST request [/]

Details:

The Elasticsearch cluster is accessible, as verified by basic connectivity tests.
The cluster status is yellow, but there are no signs of connectivity issues.
I am using the correct version of the Elasticsearch-Hadoop connector for Elasticsearch 8.6.2.

Questions:

Main Error: The primary error is security_exception: missing authentication credentials for REST request [/. What could be causing this authentication issue?
Configuration: Are there additional settings or adjustments needed for proper integration with a cloud-based Elasticsearch cluster?

Any guidance or suggestions on resolving this issue would be greatly appreciated.

Thank you!

carly.richmond · September 5, 2024, 9:40am

From Elastic Search to Elasticsearch

carly.richmond · September 5, 2024, 9:47am

Hi @Hernando_Segovia,

Welcome! I'll be honest, I'm not too familiar with Databricks or Apache Spark. But looking at the error it looks to not be accessible from your code:

org.elasticsearch.hadoop.EsHadoopIllegalArgumentException: Cannot detect ES version - typically this happens if the network/Elasticsearch cluster is not accessible or when targeting a WAN/Cloud instance without the proper setting 'es.nodes.wan.only'

From the details you've provided can you:

Diagnose the reason for the yellow cluster status as per the tips in the documentation. Specifically if you can share the output of the cluster health status that would be useful.
Can you check the Elasticsearch URL, port and API key values in your code are correct, and that the key has sufficient permissions to read from the index you want to read from?
Are you using a WAN/Cloud instance as mentioned in the warning? Just checking since you have set es.nodes.wan.only to true in your code.

Let us know!

Keith_Massey · September 5, 2024, 2:41pm

Hi @Hernando_Segovia. I don't believe that es-hadoop supports es.net.http.auth.header (although I might be wrong). Is there anything in the executor logs for your spark job related to authentication?

system · October 3, 2024, 2:42pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Problems connecting to ES from Databricks using spark connector Elasticsearch es-hadoop	3	690	June 5, 2023
Elastic - Spark connector failing to read data Elasticsearch es-hadoop	8	1105	June 29, 2023
Save data from Databricks to ElasticSearch Cloud Elasticsearch	1	1900	February 24, 2020
Problems connecting to ES Cross cluster search cluster indexes from Databricks using spark connector Elasticsearch es-hadoop	4	405	July 27, 2023
Spark elasticcloud connection issue Elasticsearch	1	494	November 21, 2018

Issue with Loading Data from Elasticsearch into Databricks

Related topics