@rameshkr1994
Thank you for your reply, I am very happy to receive a reply.
Please forgive me for my bad English.
My data is indeed stored in ES, but why 10 shard == 2 map ?
i try to use 20 shard index , but start up 2 map
I saw the official document writes ' In short, roughly speaking more input splits means more tasks that can read at the same time, different parts of the source. More shards means more buckets from which to read an index content (at the same time). '.
you :split means more task:- but me:split and shards are same concept in ES.
you: More shards means more buckets but me : buckets concept is totally diff with shards. you:i try to use 20 shard index , but start up 2 map but me:what is map here your are getting number of map 2 while running the query from Hive?
your ES Cluster decides the number of shards and number of cluster nodes.
finally : you are applying the concept of Hadoop with ES.
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.