Significance of two phases - "query then fetch" with default number of shards as 1

nages · May 21, 2020, 6:22pm

Is there any significance for search to be executed in two phases - "query then fetch" when the default number of shards as 1 ( starting from 7.x ) ? leaving the cases of considering replicas

Christian_Dahlqvist · May 21, 2020, 7:01pm

That is probably the only scenario where executing the query in two phases may not bring a lot of added benefits. I believe this is a quite rare scenario and likely one that generally performs quite well anyway, so I don't think it adds much overhead either. Optimizing this would thesefore likely bring very little benefit, but make the code more complex and difficult to maintain, which IMHO seems like a bad tradeoff.

nages · May 22, 2020, 2:03am

well. if we see 7.x, default number of shards is '1' , this decision was taken considering majority deployments were small / deployments with defaults. On the same note, does not it make sense to disable the strategy of doing aggregation in multiple places ( at least , once in data node and once in coordinating node) in the default scenarios

Christian_Dahlqvist · May 22, 2020, 2:59am

I do not think you would gain much as the same amount of work still need to be done, so it would add complexity for virtually no gain. If you have a small cluster it is also likely that the node serving the request also holds the shard which means there is not even a network hop to avoid in most cases, especially if you also use suitable preference setting.

I do not think this quite theoretical discussion around a very rare case which is usually very fast anyway is very useful so will leave it.

system · June 19, 2020, 2:59am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Question about some details for Query Phase Elasticsearch	4	444	May 7, 2020
IS "Fetch" phase really needed during seach Elasticsearch	8	1103	June 17, 2020
Why default number_of_shards is 1 in future release Elasticsearch	3	3163	December 6, 2018
ElasticSearch number of shards queried Elasticsearch	14	905	May 14, 2019
Optimal shards: 1 or number of nodes? Considerations Elasticsearch	10	5219	August 29, 2018

Significance of two phases - "query then fetch" with default number of shards as 1

Related topics