Hi, according to this post by Eric Gaumer, dated two years ago, to increase CPU utilization one needs to increase the number of shards. I wonder if it is now possible to use all 32 cores on an AWS c3.8xlarge for indexing while keeping only one shard per node.
With one shard, I can get to about 33% CPU usage (~12 cores).
With three shards, I can get to about 60% CPU usage (~18 cores).
@lwintergerst Please see my updated question. With one shard, I thought indexing was IO-bound. But when I increased shard count to three, there's a significant increase in CPU usage (almost doubled: 33 -> 60). Other indicators (index rate, IOPS, cluster load) are also up by a lot.
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.