ES Spark API writing to a Cluster failes when connecting to CLient Node

Ramdev_Wudali · November 16, 2015, 4:04pm

Hi:
I am trying to use the Spark API to write data from a file to an ES index.
My ES Cluster config :
3 data nodes (with HTTP disabled)
2 Client nodes (with no data stored on the nodes).

As I understand, the Spark API works only via the HTTP interface. So when I get my process to connect to the Client Nodes, I get an exception that indicates that process does not have access to any of the shards.

Can I use the Spark API in a scenario like mine where I do not have HTTP access to the data nodes.

Thanks

Ramdev

costin · November 16, 2015, 4:28pm

If you are using client nodes, you just need to configure ES-Hadoop accordingly - see es.nodes.client.only.

Ramdev_Wudali · November 16, 2015, 6:08pm

Hi Costin:
Thanks for the brilliant tip I had not looked at all the config
options available (my bad). That said, I was wondering if there was a
reason to go only http and not Transport client ?I mean why not enable the
transport client ?

Thanks much

Ramdev

costin · November 16, 2015, 6:28pm

Transport client is meant for internal use - it might be deprecated in the
future and it is tied to the version of ES being used. Further more it is
fairly big.
REST on the other hand is fully supported, easy to debug/monitor/route,
provides version isolation and in general, is fully available through
existing libraries thus ES-Hadoop itself is quite small.

Further more, having hundreds of clients connecting to ES through REST vs
transport is better (much more scalable) in particular from the perspective
of a node client.

Topic		Replies	Views
Getting a "No data nodes with HTTP-enabled available" error when writing from Spark to elasticsearch on Google Dataproc Elasticsearch es-hadoop	7	4184	October 6, 2017
Client-only routing specified but no client nodes with HTTP-enabled available Elasticsearch	2	1557	July 6, 2017
Can we create a node-client with ES/Hadoop or Transport client is the only way out? Elasticsearch es-hadoop	6	1248	July 6, 2017
Using Spark DataSource with ES Hadoop Elasticsearch es-hadoop	2	693	July 6, 2017
Facing EsHadoopIllegalStateException when reading from a ES cluster Elasticsearch es-hadoop	3	3889	July 6, 2017

ES Spark API writing to a Cluster failes when connecting to CLient Node

Related topics