'type': 'illegal_argument_exception', 'reason': 'cannot parse empty date'}

Saurabh_Sharma_IIT · August 7, 2019, 1:00pm

Hi, I am using bulk api to inject pandas dataframe into elasticsearch index. I first converted dataframe to dict and then used bulk.
In my .csv data file , I have a column name "start_date", I have converted it into datetime using to_datetime and it has some empty rows, when I used bulk. I got this error:

'type': 'illegal_argument_exception', 'reason': 'cannot parse empty date'

After that I converted my empty rows i.e '' to pd.NaT using replace function, but still I am getting the same error.
Please help how to resolve this issue, and also, Is NaT while injecting into elasticsearch has some issue to be taken care of?
Thanks

ftr · August 7, 2019, 7:12pm

It is unclear what the question is here: If you try to insert the empty string (or pd.NaT for that matter) in a date field, then it will complain that the input is not a date - which is indeed true.

The question for you is: What do you want elasticsearch to do for the lines where you have no data for "start_date"? One possible answer could be to just not have the field for the relevant documents - if that is what you want, then you must delete the "start_date" key from your dict for these documents: While elasticsearch will croak at attempting to index the dict {"other_data" : "blabla", "start_date": ""} (for example), it will be quite happy with the dict {"other_data" : "blabla"}, even if other documents do have the "start_date" field...

But, ultimately, the answer depends on what you want from a solution.

Saurabh_Sharma_IIT · August 8, 2019, 6:17am

Hi @ftr , thanks for your response. I am assuming that in missing date rows if I put pd.NaT, so then when I inject into elasticsearch I should not get any error. Is the assumption correct?

ftr · August 8, 2019, 8:10am

No. You must make certain that, for every row in your input data, one or the other of the following statements is true:

There is no key named "start_date" in the dictionary representing the row

or

The value for the key "start_date" is a valid date, as defined by your mapping (defaults to, IIRC, ISO8601 or milliseconds since the epoch)

If any row in your indata does not fulfill one of these, your ingestion will fail.

Saurabh_Sharma_IIT · August 8, 2019, 8:49am

Hi @ftr thanks for response. I understood what you have explained, and I made changes in my python code accordingly and it's working.
Thanks
Saurabh

system · September 5, 2019, 8:49am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Best way to manage missing data of date type while using bulk to inject data into elasticsearch Elasticsearch	4	3552	September 8, 2019
Python bulk insert customize date parser Elasticsearch language-clients	5	1072	February 11, 2022
Regarding illegal_argument_exception and empty value Logstash	6	1040	July 19, 2019
Resulting Error while indexing date values in elasticsearch? Elasticsearch	2	842	December 27, 2016
"type"=>"illegal_argument_exception", "reason"=>"cannot parse empty date"} in logstash logs Elasticsearch	3	412	October 21, 2022

'type': 'illegal_argument_exception', 'reason': 'cannot parse empty date'}

Related topics