We currently index our documents in bulk using the PHP client. On average our index times (including travel over network) are:
1,000 documents in 4 seconds
10,000 documents in 80 seconds
If we index in parralel the index time are much larger, so that does not seem to be a solution.
Details:
Index: 1
Shard: 1
Node: 1
Memory: 1 GB
Storage: 16GB
Documents: 2,000,000
Our server is hosted by TransIP (Netherlands) which has a 100 Gb/s network connection. The elastic cluster is cloud based (Ireland).
How can we speed up indexing our documents?