Why does using ?size=1000000 increase query latency 100X when totalHits is only 10

Nikita_Tovstoles · October 15, 2015, 7:44pm

I noticed if I specify a very high size query param for the below query, the response latency increases 100x (2ms to 200-500ms) when running locally. The total doc count is ~ 1000, totalhits for this particular query is only 10. Why the increase?

Thank you,

-nikita

{
  "query": {
    "bool": {
      "should": {
        "nested": {
          "query": {
            "term": {
              "brand.id": 551
            }
          },
          "path": "brand"
        }
      }
    }
  },
  "_source": false
}

nik9000 · October 15, 2015, 8:01pm

The array that collects the hits during the search phase is allocated up front. There is work around for this that only allocates max(total_docs_on_shard, requested_docs) up front but its still dangerous to do that because if you did have a ton of documents then it'd be slow again.

Nikita_Tovstoles · October 15, 2015, 10:17pm

Thanks, Nik; makes sense.

Topic		Replies	Views
High performance penalty, when size in query is increased Elasticsearch	3	513	October 30, 2018
Elastic search latency increasing after doing large number of updates Elasticsearch	2	1243	June 7, 2017
Search cache question Elasticsearch	1	333	July 6, 2017
Elasticsearch high latency Elasticsearch	16	3411	June 8, 2023
Understanding search response Elasticsearch	2	551	July 6, 2017

Why does using ?size=1000000 increase query latency 100X when totalHits is only 10

Related topics