Hi, I'm pretty new to Hadoop, and I'm following this tutorial to try to integrate elasticsearch to Hadoop. Everything went well util I tried to execute below statement:
Exception was thrown.. java.lang.NoClassDefFoundError: org/apache/commons/httpclient/URIException..
I followed this post Hive (HDP 2.3) and ES-Hadoop Integration Issue to manually add the httpclient jar file in, but the error persisted. How can I fix this issue?
Could you post which versions of the systems and artifacts you are using as well as the classpath from your executing hive tasks? Could you also post the commands you used to add the httpclient jar? Thanks!
Hi James,
I'm using Hortonworks Sandbox 2.4 for VMWare, and elasticsearch-hadoop 2.3.2
The elasticsearch version i'm using 2.3.3.
I copied all ES related JARs to the hive lib directory, used the following commands to add in the JARs
ADD JAR /usr/hdp/2.4.0.0-169/hive/lib/elasticsearch-hadoop-2.3.2.jar
ADD JAR /usr/hdp/2.4.0.0-169/hive/lib/httpclient-4.4.jar
If I do the LIST JARS command, I only see the above two JARs listed.
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.