If indexing is I/O bound, you should investigate giving more memory to the filesystem cache (see above) or buying faster drives. In particular SSD drives are known to perform better than spinning disks....
I've been experimenting with indexing with SSD drives, but I'm not finding significant improvement in indexing speed. I'm left to conclude that likely my indexing is not I/O bound.
How can I tell for sure whether indexing is I/O bound?
Also, are there any recommended settings for indexing when backed by SSD drives?
With our regular HDD cluster, during indexing (using sar to measure), CPU averages 15% and is blocked by disk I/O on average 1.5% of the time. Measurements include search traffic.
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.