Getting throttled by _msearch

doron · January 20, 2020, 2:30pm

My scenario is as follows: I want to run a very large number of queries and I want to fully utilize my cluster (40 data nodes X 16 CPUs).

I am batching my queries (400 per batch) and sending them via _msearch, however it seems I'm getting throttled. The cluster CPUs hardly get utilized, and I simply cannot get past ~10 seconds for 400 queries, no matter how I play with the max_concurrent_searches and max_concurrent_shard_requests parameters. The took values per each query simply increase as I increase the concurrency, but the total time remains the same.

Any idea what I'm doing wrong here? I am using ES 6.5.3.

Thanks.

Christian_Dahlqvist · January 20, 2020, 2:55pm

How large is your data set? Can it all fit in the operating system page cache? If not, it is quite possible that you are limited by disk I/O and not CPU.

doron · January 21, 2020, 5:17am

The dataset doesn't fit in memory, however I don't think I'm I/O limited here. I've been I/O limited in ES before, and usually the nodes' load factor shoot through the roof. In this case it is staying pretty idle. In addition, the very same cluster is able to reach better throughput in production, when using the regular search endpoint, so it feels like this is related to _msearch.

system · February 18, 2020, 5:17am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
How to speed up msearch queries? Elasticsearch	6	500	March 25, 2020
2.3.2 Java client : is there any way to throttle an _msearch request? Elasticsearch	1	394	February 20, 2018
ES not returning results when doing multi-thread msearch Elasticsearch	11	641	August 10, 2018
Slow search response time (low CPU utilization) Elasticsearch	7	3398	July 31, 2019
Performance issues with search Elasticsearch	1	292	September 9, 2019

Getting throttled by _msearch

Related topics