Incomplete results for scan / scroll searches

Jan_Fiedler · November 7, 2011, 9:23am

I have a little (Java) based test playing with the scan / scroll API. It is
basically working but I am always missing a single element in the last
scroll response. This is what I am doing conceptually:

Index 1000 documents and run a index refresh (index has default of 5
shards)
Run 'matchAll' search with search type = 'scan', size=100, timeout of
30 seconds
Run a loop of scroll request with the previous scroll id until I get
no more hits

I found that the number of total hits is correctly reported as 1000. The
first scroll response carries the expected 500 hits (number of shards *
100). However the second (last) scroll response only has 499 hits. It seems
that the last document is missing. Has someone observed similar issues ?

Jan_Fiedler · November 7, 2011, 11:05am

Update

The problem was most likely on my side. I expected each scroll to have 500
elements. What I actually get for the 1000 elements is 3 responses: 1 = 500
hits, 2 = 499 hits, 3 = 1 hit. I get all 1000 elements so there seems to be
no bug. I still find the sizes a little strange ... maybe someone can shed
some light on this.

Clinton_Gormley · November 7, 2011, 12:03pm

Hi Jan

The problem was most likely on my side. I expected each scroll to have
500 elements. What I actually get for the 1000 elements is 3
responses: 1 = 500 hits, 2 = 499 hits, 3 = 1 hit. I get all 1000
elements so there seems to be no bug. I still find the sizes a little
strange ... maybe someone can shed some light on this.

If you request $size results when scanning, it gives you a maximum of
$size results from each shard.

So if $size == 10, and you have 5 shards, you could get a maximum of 50
results. The actual number will vary, depending on which shards contain
the documents.

For example, if you have a total of 25 documents, on two shards, but 20
of them are on shard 1, then you would get:

first request: 15 results:
- 10 from shard 1
- 5 from shard 2
second request: 10 results
- 10 from shard 1

clint

Topic		Replies	Views
Scroll and Scan Elasticsearch	4	445	July 6, 2017
Unexpected number of hits with scroll search Elasticsearch	3	496	July 5, 2017
Scan with fields and size parameter not returning expected result Elasticsearch	2	2028	July 5, 2017
Issues with scan and scroll as well as count API Elasticsearch	5	1880	July 5, 2017
How does size in a scan and scroll work? Elasticsearch	1	987	October 23, 2018

Incomplete results for scan / scroll searches

Related topics