ES has lost a portion of its data by importing the json data using python3 helpers.bulk and specifying _id

6c4518ef6a5980b3e6ef · March 6, 2018, 12:31pm

Elasticsearch version number is 2.3.3
this is my json file:
{"_source": {"bd_id": 12345, "date_type": "BEFORE_SEVEN_DAY"}, "_score": 7.5887613, "_id": "153380527BEFORE_SEVEN_DAY", "_index": "ex_data_1", "_type": "ex_shop"}
{"_source": {"bd_id": 1888, "date_type": "BEFORE_SEVEN_DAY"}, "_score": 7.5887613, "_id": "151008189BEFORE_SEVEN_DAY", "_index": "ex_data_1", "_type": "ex_shop"}
....

And my code

#open josn file
    print('import data begin...')
    with open(self.re_index+'_data'+'.json', 'r',encoding='utf-8' ) as e:
        actions = deque()
        j = 0
        for i in e:
            action =  {
                '_op_type': 'index',
                '_index' : json.loads(i)['_index'],
                '_type' : json.loads(i)['_type'],
                '_id' : json.loads(i)['_id'],
                '_source' : json.loads(i)['_source']
            }
            actions.append(action)
        print(len(actions))
   #helpers.bulk 
    for success, info in elasticsearch.helpers.parallel_bulk(es, actions,thread_count=50):
        if not success:
            print('Doc failed', info)
    print('import data end...\n\t total consuming time:'+str(time.time()-ip_begin)+'s')

Here is The results of the implementation:
import data begin...
60000
import data end...
total consuming time:19.1480000019073486s
ES__bulk

6c4518ef6a5980b3e6ef · March 6, 2018, 12:34pm

what's matter with code or helpers.bulk??
I need help!!!
@dadoonet

6c4518ef6a5980b3e6ef · March 8, 2018, 6:56am

Are people asleep????????????

system · April 16, 2018, 8:31am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Error on import with Python and Eland Elasticsearch language-clients	2	673	June 20, 2022
Problem with JSON file import into ElasticSearch Elasticsearch	10	5091	August 14, 2020
Importing JSON in to Elasticsearch 5.1 using CURL Elasticsearch	4	1368	March 9, 2017
Help Using Python to Load Data into ES Elasticsearch	6	6181	October 7, 2019
Why can't these entries load in Elasticsearch? Elasticsearch	3	482	July 27, 2018

ES has lost a portion of its data by importing the json data using python3 helpers.bulk and specifying _id

Related topics