Prefix and port appear flipped in es-hadoop implementation

petersedivec · March 20, 2023, 12:22pm

Attempting to read/write with es-hadoop however I am getting the following error in Databricks

EsHadoopInvalidRequest: [HEAD] on [index_name] failed; server [https://serveraddress.com/es:443] returned [405|Method Not Allowed:]

Note the port comes after the prefix which contradicts what the documentation says here for prefix

es.nodes.path.prefix (default empty)
Prefix to add to all requests made to Elasticsearch. Useful in environments where the cluster is proxied/routed under a certain path. For example, if the cluster is located at someaddress:someport/custom/path/prefix, one would set es.nodes.path.prefix to /custom/path/prefix

My config in databricks just looks like

spark.read.format("org.elasticsearch.spark.sql")
    .option("es.nodes", es_url)
    .option("es.nodes.wan.only","true")
    .option("es.nodes.path.prefix", "/es")
    .option("es.port", "443")

From the databricks terminal if I attempt to curl using

curl https://serveraddress.com:443/es/_search

I get the expected response, however if I flip it and

curl https://serveraddress.com/es:443/_search

It fails with the same error. This also works successfully

curl https://serveraddress.com/es/_search

so if there is a way to completely omit the port that would be great but it seems like this defaults to 9200 and I haven't figured out how to prevent that.

Any suggestions would be appreciated? Is this a bug that warrants an issue on the GH project?

leandrojmp · March 20, 2023, 12:45pm

This is weird and it looks like a bug, maybe it is better to open an Github issue with instructions to reproduce this bug.

But in the mean time you can ignore the es.port and pass the port directly in the node url, something like serveraddress.com:443.

petersedivec · March 20, 2023, 1:43pm

Awesome, that works regarding passing the port in the URL. I'm kicking myself for not thinking to have attempted that. Will open a github issue with steps to reproduce

Keith_Massey · March 20, 2023, 3:01pm

After a brief look at the code, I'm not sure how this happens -- it appears to correctly add the prefix after the port. Steps to reproduce in a ticket will be great. Thanks.

system · April 17, 2023, 3:02pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Read from ES index fails without write permission with HEAD [405\|Method Not Allowed:] Elasticsearch es-hadoop	11	1483	May 1, 2023
Elasticsearch-hadoop Connector Elasticsearch es-hadoop	1	699	August 26, 2020
How to setup ElasticSearch-Hadoop to work thru HTTPS? Elasticsearch es-hadoop	3	1370	July 6, 2017
403 Forbidden when using '"es.nodes.path.prefix" pyspark 8.4.1 pyspark 3.3 Elasticsearch es-hadoop	9	631	October 25, 2022
ElasticSearch Spark Hadoop Connector Elasticsearch es-hadoop	2	1091	July 6, 2017

Prefix and port appear flipped in es-hadoop implementation

Related topics