Elasticsearch Result Size

Sivaprakash_Shanmuga · July 23, 2015, 5:52pm

Hi

I'm using Python Library to integrate with Elasticsearch

es = Elasticsearch([{'host': 'localhost', 'port': 9200}])
res = es.search(index="college", doc_type='people', body=doc)

It returns only 10 records where as I have 30K+ matching records

es = Elasticsearch([{'host': 'localhost', 'port': 9200}])
res = es.search(index="college", doc_type='people', body=doc, size = 30450)

This retrieves all records. But I want to retrieve all the matching records without specifying size parameter (I might know it upfront). How to get it?

pemontto · July 23, 2015, 10:39pm

You want to do a scan and scroll search. I've just had to do the same thing in python, it might look something like this for you

es = Elasticsearch([{'host': 'localhost', 'port': 9200}])
res = es.search(index="college", doc_type='people', body=doc, scroll='60s', search_type='scan')

results = []
scroll_size = res['hits']['total']

while (scroll_size > 0):
    try:
        scroll_id = res['_scroll_id']
        res = es.scroll(scroll_id=scroll_id, scroll='60s')
        results += res['hits']['hits']
        scroll_size = len(res['hits']['hits'])
    except: 
        break

You could also look at using the scan helper

Topic		Replies	Views
How to retrieve all records from index Elasticsearch	5	3392	December 13, 2018
Increase size limit - Python ElasticSearch Elasticsearch language-clients	8	991	March 22, 2021
Python ElasticSearch - How to increase query size limit Elasticsearch language-clients	5	6554	March 20, 2021
Need help with scan/scroll using elasticsearch-py client Elasticsearch	2	11289	April 11, 2017
Query result differ / Scan and scroll result in very low performance using Python API Elasticsearch	2	2294	September 15, 2017

Elasticsearch Result Size

Related topics