Slow results retrieval

TDZ · November 17, 2018, 1:24pm

So I have 5 billion docs on 5 shards in 1 index, all on one machine, each shard has 1 segment.
ES is running on 4 cpus with 26GB ram and 18GB of heap for ES.
Each doc has 4 ints and 2 floats.
Im running a query that uses range query on one field and a terms query on another, both are in the filter part of a bool query.
Then I try to retrieve 500k results using the scroll API with 5 slices (I have 5 threads running at the same time, one per slice), Im fetching only one int field using _source for each result, im using a 9K page size and im using the transport client for java.
Its taking me ~50 seconds.... Does that make sense or am I doing something wrong?

Christian_Dahlqvist · November 18, 2018, 11:37am

It is generally recommended to give no more than 50% of available RAM to heap. Elasticsearch requires off-heap memory for optimal performance.

What if you instead fetch the full document so Elasticsearch do not need to parse it?

TDZ · November 19, 2018, 8:13pm

Done both, didnt change anything

Christian_Dahlqvist · November 19, 2018, 8:49pm

What does disk I/O and iowait look like during retrieval?

TDZ · November 19, 2018, 9:21pm

Dont know about iowait but I assume its not a problem since im on an ssd, io is 300 iops and 38~MBs in spikes, thats the highest spike.
But I think thats besides the point, I want to use the above mentioned search for real time usages, so I wonder whats a reasonable expectation? For example, can ES provide 100k results in 2-3 seocnds? What about 500k?

system · December 17, 2018, 9:21pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Is Elasticsearch suitable for efficient retrieval of large number of docs? Elasticsearch	15	1232	November 29, 2022
Slow bulk indexing performance Elasticsearch	6	1375	December 11, 2018
Slow search response time (low CPU utilization) Elasticsearch	7	3438	July 31, 2019
Performance impact of returning large result sets Elasticsearch	3	4350	July 5, 2017
ElasticSearch - Memory and Query Performance Elasticsearch	4	1663	July 6, 2017

Slow results retrieval

Related topics