I've setup hadoop+spark 1.6 via CDH and added the latest Zeppelin. Here I've included the latest es-hadoop binding and am now trying to just load some data from my ES cluster. While it retrieves the mapping and also issues the query, it immediately deletes the scroll id after the query without ever getting any data. Consequently, I end up having the schema but not data in Zeppelin.
I'm really out of ideas here, can anyone help?! Thank you!
Here are some of my queries:
var sql = new org.apache.spark.sql.SQLContext(sc)
"es.nodes" -> "my.host.name",
"es.read.field.include" -> "host")).registerTempTable("logs")
z.show(sql.sql("select count(host) from logs"))
Or even simpler:
Returns a nice table with a bunch of columns, but empty