I use es-hive to execute two big table join, hive pull all data from es, it's very slow. I use jstack to print the process information:
I found it's very slow to pull data from es through http, how about transportclient? Is the transportclient faster than restclient when executing scroll query?
How can I read from searchresponse when I use transportClient to execute scroll? I mean that searchresponse.gethits return the native type(such as object,int etc), but it need writeable type in mapreduce framework?
can anyone help.