What happens under the hood for a bulk index call in Elasticsearch?

I'm tuning my ingestion pipeline to maximize throughput. I've read through the recommendations in the reference guide about how to tune for index speed (https://www.elastic.co/guide/en/elasticsearch/reference/master/tune-for-indexing-speed.html)

With all the different settings to experiment with, I find myself wondering what is going on behind the scenes during a bulk index call. What stages are there, how is work divided up, and what is parallelized?

Despite its age the definitive guide might help a bit here. Take a look at https://www.elastic.co/guide/en/elasticsearch/guide/current/distrib-multi-doc.html#bulk-format

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.