Multi process search performance

Sunho_Lee · April 8, 2020, 10:52am

Hi everybody.
I have strange results that multi-process search too slow.
I have tested that 35 billion documents in 1 index vs 3.5 billion documents in 10 index. I used 7.6 version elasticsearch. and 2 data nodes (52 core, 62GB memory).

search query to 35 billion document index(50 shards)
search query to 3.5 billion documents in 10 index(5 shards per index) by a comma-separated like ["index_A", "index_B", ... ]
search query to 3.5 billion documents in 10 index(5 shards per index) by python client with pool.starmap() using 10 processes. each process query to each index

I expected the 3rd result to be ten times better than the 1st result. because of the smaller is faster. but the three results almost the same. I can't understand that result.

Can you explain why this happens?
I profiled the API. 'build_scorer' used almost elapsed time. as the number of processes increased, the 'build_scorer' also increased at 3rd experiment. why?

system · May 6, 2020, 10:52am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
ES not returning results when doing multi-thread msearch Elasticsearch	11	760	August 10, 2018
Performance 1 shard / 1 node vs 5 shards / 5 nodes Elasticsearch	5	411	July 6, 2017
Elasticsearch performs slowly when data size increased Elasticsearch	3	942	March 21, 2017
After 10million docs indexed the insert /search operation become slower and slower Elasticsearch	1	332	July 6, 2017
Concurrent Search in elasticsearch Elasticsearch	7	2221	July 5, 2017

Multi process search performance

Related topics