We are importing data to elasticsearch cluster in few indices, around
At the same time, we care about search on existing indices, few of them are small-
~100mb, few of them are big-
In order to optimize indexing, we:
bulkapi with optimized bulk size;
- set refresh interval to
- set replication factor to
Now, we are trying to understand how merge throttling can help. How search and segment merging are related, if search only against existing indices?
According to this article, we can disable merge throttling.
- Does that mean merges will "eat" disks i/o?
- Does that mean merges won't happen at all and we have to
_forcemergemanually, after indexing is done? Should be worried about max open file descriptors in such case?
Very confused here, any help is highly appreciated.