How to fix guava version conflicts with Hadoop YARN classpath?

royce-pandora · March 24, 2016, 6:44pm

Hi,

I've spent several hours trying to fix a problem with Elastic Search v2.2 Java lib and Cloudera Hadoop Cluster v5.5.1.

The problem:

Guava-18.0.jar is not being recognized by elastic search v2.2 when used by a Pig (v0.11.0) UDF. As a result, the stack trace below occurs.

I've read and implemented the To Shade or not To Shade article solution, however, I continue to get the exception error below.

How do I make use of guava-18.0.jar for elastic search v2.2 while other hadoop projects use older guava versions in the class path?

2016-03-24 17:52:17,450 ERROR [main] org.apache.hadoop.mapred.YarnChild: Error running child : java.lang.NoSuchMethodError: com.google.common.util.concurrent.MoreExecutors.directExecutor()Ljava/util/concurrent/Executor;
at org.elasticsearch.threadpool.ThreadPool.(ThreadPool.java:190)
at org.elasticsearch.client.transport.TransportClient$Builder.build(TransportClient.java:133)
at com.nextbigsound.find.FindConfig.getElasticSearchClient(FindConfig.java:176)
at com.nextbigsound.hadoop.analytics.find.FindTracksStoreFunction.prepareToWrite(FindTracksStoreFunction.java:252)
at org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigOutputFormat$PigRecordWriter.(PigOutputFormat.java:125)
at org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigOutputFormat.getRecordWriter(PigOutputFormat.java:86)
at org.apache.hadoop.mapred.ReduceTask$NewTrackingRecordWriter.(ReduceTask.java:540)
at org.apache.hadoop.mapred.ReduceTask.runNewReducer(ReduceTask.java:614)
at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:389)
at org.apache.hadoop.mapred.YarnChild$2.run(YarnChild.java:163)
at java.security.AccessController.doPrivileged(Native Method)
at javax.security.auth.Subject.doAs(Subject.java:415)
at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1671)
at org.apache.hadoop.mapred.YarnChild.main(YarnChild.java:158)

Amandeep_Singh · April 15, 2016, 11:44pm

Having exactly the same issue.

Amandeep_Singh · April 16, 2016, 12:08am

Just solved it, apparently the hadoop version I was running the bulk loader had an earlier version of guava jar in the path.
I was able to solve it by setting
mapreduce.job.user.classpath.first to true

Point being, make sure there is no other guava jar in the path than you supply

royce-pandora · April 19, 2016, 4:57pm

The way I solved this issue is by using the es-hadoop connector by elastic search. It avoids the guava version issue entirely and, in my case, replaces a custom elastic search implementation.

I'll write a blog post and share how I solved the issue using Pig and es-hadoop.

Sunil_G.C · June 20, 2016, 5:50am

How and where to set? Please tell in detail.

SaurabhM · September 1, 2016, 9:09pm

We also have the same issue any further details.

Sunil_G.C · September 2, 2016, 1:41am

Issue can be fix by relocating guava jar by maven and use guava 18.0 jar mention on pom.xml file.

nmalinip · September 2, 2016, 7:15pm

We are facing same issue....Can you provide more details on how to set . Here is my build.sbt

libraryDependencies ++= Seq(
"org.apache.spark" %% "spark-core" % sparkVersion % "provided",
"org.apache.spark" %% "spark-streaming" % sparkVersion % "provided",
"org.apache.spark" %% "spark-streaming-kafka" % sparkVersion,
"org.apache.spark" %% "spark-sql" % sparkVersion % "provided",
"org.elasticsearch" % "elasticsearch-hadoop" % "5.0.0-alpha4",
"org.elasticsearch" % "elasticsearch" % "2.3.4",
"org.elasticsearch.plugin" % "shield" % "2.3.4" from "https://maven.elasticsearch.org/releases/org/elasticsearch/plugin/shield/2.3.4/shield-2.3.4.jar",
"joda-time" % "joda-time" % "2.7",
"com.databricks" %% "spark-xml" % "0.3.3",
"com.sun.jersey" % "jersey-servlet" % "1.19",
"com.google.guava" % "guava" % "18.0",
"com.amazonaws" % "aws-java-sdk" % "1.11.26",
"com.typesafe" % "config" % "1.3.0",
"com.databricks" %% "spark-csv" % "1.4.0",
"org.apache.spark" %% "spark-mllib" % sparkVersion

)

resolvers ++= Seq(
"Akka Repository" at "http://repo.akka.io/releases/",
"scala-tools" at "https://oss.sonatype.org/content/groups/scala-tools"
//"elasticsearch-releases" at "https://maven.elasticsearch.org/releases"
)

Sunil_G.C · September 3, 2016, 5:43am

Please refer to this link here to relocate the jar.

Thanks,

Topic		Replies	Views
Spark & ElasticSearch clash (guava) Elasticsearch	1	1025	July 5, 2017
Guava version confliction Elasticsearch	3	536	July 5, 2017
ElasticSearch spark yarn -hadoop classpath Elasticsearch es-hadoop	1	807	December 9, 2016
TransportClient in 2.1.x Elasticsearch	10	3578	July 5, 2017
Exception when using Elasticsearch-spark and Elasticsearch-core together Elasticsearch es-hadoop	5	3636	July 6, 2017

How to fix guava version conflicts with Hadoop YARN classpath?

Related topics