Indexing Performance vs Document Size

  • ES: 2.3.2, Lucene 5.5.0
  • OS: Ubuntu 14.04.1
  • Java: 8u73
    java version "1.8.0_73"
    Java(TM) SE Runtime Environment (build 1.8.0_73-b02)
    Java HotSpot(TM) 64-Bit Server VM (build 25.73-b02, mixed mode)
  • _all is disabled
  • _source disabled
  • Fields are explicitly mapped
  • Same configs and hardware with only change being document size 1K vs 30K
  • CPU is close to 100%
  • The difference in throughput with size of documents is surprising (3MB/s vs 20MB/s)