Elastic Team Member
Sep 2014
Hiya
OK I see where the confusion is coming in. I used the word asynchronously
in slightly different contexts there. I will try to reword in the
Definitive Guide.
Replication is sync by default, in other words: the primary waits for
indexing to happen on the replica before it returns to the user. That
said, lots of these processes happen at the same time, so sending the
document to the replica is asynchronous. It doesn't send a change then
wait for the response before sending the next one. This all happens in
parallel.
It depends. Each bulk is processed one-by-one on each shard, but parallel bulks are processed in parallel. This has nothing to do with async vs sync IO.
The translog is written using synchronous IO.
All files (including translogs) are written using synchronous IO.
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.