ES Python API behaves strangely when used for search right after ingestion

mehdi-gital · May 10, 2019, 4:01pm

I run a search right after ingesting documents through my Python client and get an empty response even though in Kibana I can see that everything is ingested correctly. Here's a simple demo code:

from elasticsearch import Elasticsearch, helpers

es_host = 'localhost'
es_port = '9200'

es = Elasticsearch([{'host': es_host, 'port': es_port}])

bulk_data = []

for i in range(5):
    obj = {}

    obj['_index'] = 'test_ind'
    obj['_type'] = 'my_type'
    obj['a'] = i

    bulk_data.append(obj)

bulk_data = iter(bulk_data)
helpers.bulk(es, bulk_data)

body = {
  "query": {
    "match_all": {}
  }
}

print es.search('test_ind', body)

The above code does the ingestion but the search returns nothing. However, when I run my search in a separate code, it works as expected.

from elasticsearch import Elasticsearch

es_host = 'localhost'
es_port = '9200'

es = Elasticsearch([{'host': es_host, 'port': es_port}])

body = {
  "query": {
    "match_all": {}
  }
}

print es.search('test_ind', body=body)

Is there a conceptual reason as to why the first approach won't work or is this just a newbie goof on my part?

PS. Not sure if this matters but I'm using Python 2.7 and Elastic 5.5 (I know!!).

loren · May 10, 2019, 7:47pm

After you do helpers.bulk(es, bulk_data), you either need to wait a full second (settings.refresh_interval) docs, send in an explicit refresh with the bulk request, or use the Refresh API to have those docs returned from a search so soon after you index them.

mehdi-gital · May 13, 2019, 2:18pm

Hey thanks for your reply!!!
I had never looked into this but now it makes a lot of sense.
I tried to follow your pointers and added

time.sleep(2)
es.indices.refresh('test_ind')
time.sleep(2)

after the ingestion step but it's not making a difference. I'll figure it out with try and error though.
I remember I had similar issues with Rollover too ...

system · June 10, 2019, 2:18pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
[ES7.0.1+python3][HELP]es.bulk() + python3 doesn't ingest csv data into elasticsearch Elasticsearch	4	891	October 4, 2019
Data is imported into ES in batches, and it can be queried after four minutes Elasticsearch	2	208	June 13, 2022
Does BULK API returns results before task is completed? Elasticsearch	4	542	April 14, 2022
Elasticsearch api returning empty response (python) Elasticsearch	4	841	November 9, 2023
ElasticSearch Ingestion issue Elasticsearch	3	476	January 5, 2018

ES Python API behaves strangely when used for search right after ingestion

Related topics