I plan to use the NRT feature heavily, for near-real-time indexing of
documents, let's say adding 1,000 documents at a time via bulk index.
I'd like to know:
Does the bulk indexing performance depend on the size of the index
(no. documents already indexed)? If so, linearly or sub-linearly?
Does the indexing performance depend on the number of shards (nodes)?
Does the indexing performance depend on the number of replicas?
Should I expect significant spikes in indexing latency (because of
some internal ES re-allocations, merges, etc.)? What is a "normal"
standard deviation of the time between "start bulk index" and "can
query the new docs in search"?
Sorry for my basic questions, I do not know Lucene well. Thank you.