Uploading csv file: Failed to parse date field dd/MM/yyyy HH:mm

MARCO_RAMBALDI · April 21, 2021, 9:23am

Hi,
I'm uploading a csv file (; as separator). I identified one of the columns as Machine Time. The format on excel is "customized" as seen in the image below:

To parse the date as a date type on Elastic I used the date filter:

     date {
                    match => ["Machine Time", "dd/MM/yyyy HH:mm", "ISO8601", "dd/MM/yyyy HH:mm:ss"]
                    target => "Machine Time"
            }

The strange thing is that on for example 6000 lines, it loads 5995 correctly and the missing 5 do not, even if the format is the same for all of them.

On the logstash logs I see the following error: Preview of field's value: '28/03/2021 02:05'
This is one of the 5 dates it fails to upload.

Can anyone help me? What can i do to fix?
Thanks.

Marco

aaron-nimocks · April 21, 2021, 9:51am

Looks like there's a space after the day. So to catch that then add a space in your match also.

date {
 match => ["Machine Time", "dd/MM/yyyy HH:mm",  "dd /MM/yyyy HH:mm",  "ISO8601", "dd/MM/yyyy HH:mm:ss"]
 target => "Machine Time"
}

MARCO_RAMBALDI · April 21, 2021, 9:55am

Hi aaron,
I don't know why I copied the string like this but there is no space. Now I edit the post.

immagine

Marco

aaron-nimocks · April 21, 2021, 10:05am

The date alone parses correctly so I wouldn't think this has anything to do with the date. Is there more logic in your configuration that could be it?

Are you able to post your .conf?

MARCO_RAMBALDI · April 21, 2021, 10:16am

input {
  file {
    path => "PATH/file_name.csv"
    start_position => "beginning"
    sincedb_path => PATH
  }
}

filter {
    csv { 
       columns => ["***", "Machine Time", "***", "***", ..., other 140 columns name]
       separator => ";"
       "***" => "integer" (this for each number column, almost 40 columns)
       date {
                match => ["Machine Time", "dd/MM/yyyy HH:mm", "ISO8601", "dd/MM/yyyy HH:mm:ss"]
                target => "Machine Time"
        }
}
output {
  stdout { codec => rubydebug }
  elasticsearch {
   hosts => ["localhost:9200"]
   index => "index_name"
   user => "***"
   password => "***"
}
}

aaron-nimocks · April 21, 2021, 10:29am

Nothing looks out of place. Are you able to isolate the same 5 records each time?

Have you looked at the file with a text editor and not within Excel to verify no extra/special characters that could be causing it?

MARCO_RAMBALDI · April 21, 2021, 10:43am

To make you understand the situation a little. I'm doing a test with a csv populated with data coming from a mysql database. Probably in the future I will connect directly to the database, but for now I want to use this csv and populate it manually. It is therefore a continuous flow of data ... I cannot afford to have this problem repeat itself again. If it happens with the first 6000 lines, it will probably happen with the new data. So the solution is not to isolate these 5 lines but to solve the problem, because otherwise it will repeat itself with the new data without my understanding the real cause.

Anyway, by block note i see that (so no extra caracters):

immagine

aaron-nimocks · April 21, 2021, 10:47am

I'm stumped from what I am seeing. Are you able to share the CSV to see if I can replicate the results?

MARCO_RAMBALDI · April 21, 2021, 12:28pm

How do I share a csv file to you?

aaron-nimocks · April 21, 2021, 12:39pm

https://pastebin.com/ or https://gist.github.com/ would probably be the easiest.

MARCO_RAMBALDI · April 21, 2021, 12:54pm

gist.github.com

https://gist.github.com/marcorambaldi/b1dde3a911a9de9ec21fcc8a56524c67.js

file.csv

13122;19/04/2021 08:24;0;0;0;0;78;92;100;41;100;82;94;180;164;10;10;16;6;30;29;-15;-26;19;17;18;21;2;0;2;23356;6056;13620;70;0;73;FULL;3;-15;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;1;1;1;0;1;0;0;0;0;0;0;1;0;0;0;0;0;1;0;0;0;0;1;1;-5;-8;1;0;1;0;1;0;0;0;0;0;0;0;0;0
13121;19/04/2021 08:15;0;0;0;0;76;88;98;41;100;84;94;179;164;11;12;16;7;29;30;-15;-26;19;17;18;21;2;0;2;23356;6056;13620;70;0;76;FULL;2;-15;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;1;1;1;0;1;0;0;0;0;0;0;1;0;0;0;0;0;0;0;0;0;0;0;1;-5;-7;1;0;0;0;1;0;0;0;0;0;0;0;0;0
13120;19/04/2021 08:12;0;0;0;0;76;87;97;41;100;83;93;181;173;9;10;16;6;27;31;-6;-15;19;18;18;21;2;0;2;23356;6056;13620;69;0;0;FULL;1;-6;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;1;1;1;0;1;0;0;0;0;0;0;1;0;0;0;0;0;0;0;0;0;0;0;0;-5;-4;0;0;0;0;0;0;0;0;0;0;0;0;0;0
13119;19/04/2021 08:06;0;0;0;0;77;91;98;41;100;84;90;175;162;6;5;16;5;31;33;-13;-26;19;18;18;21;2;0;2;23356;6056;13620;65;0;79;FULL;-6;-14;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;1;1;1;0;1;0;0;0;0;0;1;1;0;0;0;0;1;0;1;0;0;0;0;1;-5;-4;1;0;1;0;0;0;1;0;0;0;0;0;0;0
13118;19/04/2021 08:03;0;0;0;0;78;94;96;41;100;84;80;163;163;6;4;16;4;34;33;0;3;19;18;18;21;2;0;2;23356;6056;13620;57;49;22;FULL;-10;-4;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;1;1;1;0;1;0;0;0;0;0;1;1;0;0;0;0;1;0;1;0;0;0;0;1;-5;-4;1;1;1;0;0;1;1;0;0;0;0;0;0;0
13117;19/04/2021 08:01;0;0;0;0;78;93;97;41;100;81;75;153;158;6;0;17;0;27;34;3;4;19;18;18;22;2;0;2;23356;6056;13620;63;14;13;FULL;-10;7;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;1;1;1;0;1;0;0;0;0;0;1;1;0;0;0;0;1;0;1;0;0;0;0;1;-3;1;1;1;1;0;0;1;1;0;0;0;0;0;0;0
13116;19/04/2021 08:00;0;0;0;0;78;94;97;41;100;82;75;153;108;6;0;17;-1;27;34;3;3;19;18;18;22;2;0;2;23356;6056;13620;46;0;0;FULL;-6;7;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;1;1;1;0;1;0;0;0;0;0;1;1;0;0;0;0;1;0;1;0;0;0;0;1;0;1;0;0;1;1;1;0;0;0;0;0;0;0;0;0
13115;19/04/2021 08:00;0;0;0;0;78;94;97;41;100;82;75;153;108;6;0;17;-1;28;34;3;3;19;18;18;22;2;0;2;23356;6056;13620;46;0;0;FULL;-5;7;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;1;1;1;0;1;0;0;0;0;0;1;1;0;0;0;0;1;0;1;0;0;0;0;0;0;1;0;0;1;0;0;0;0;0;0;0;0;0;0;0
13114;19/04/2021 08:00;0;0;0;0;78;94;98;41;100;82;75;153;108;6;0;17;-2;27;34;3;3;19;18;18;22;2;0;2;23356;6056;13620;46;0;0;FULL;-4;7;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;1;1;1;0;1;0;0;0;0;0;1;1;0;0;0;0;1;0;1;0;0;0;0;0;0;1;0;0;1;0;0;0;0;0;0;0;0;0;0;0
13113;19/04/2021 07:56;0;0;0;0;77;91;98;41;97;81;75;157;93;4;1;18;-8;26;27;2;3;19;18;18;22;2;0;2;23356;6056;13620;0;16;0;FULL;9;7;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;0;1;1;1;0;1;0;0;0;0;0;0;0;0;0;0;0;0;1;0;0;0;0;1;1;3;5;0;1;0;0;0;1;0;0;0;0;0;1;1;0

This file has been truncated. show original

MARCO_RAMBALDI · April 21, 2021, 12:57pm

I had to omit some string format columns because they are sensitive data

aaron-nimocks · April 21, 2021, 1:16pm

I am not sure with what was given.

The CSV has 13102 and it ingested them all with a correct date conversion.

Can you post the full date parse error in the log?

{
  "count" : 13102,
  "_shards" : {
    "total" : 1,
    "successful" : 1,
    "skipped" : 0,
    "failed" : 0
  }
}

Machine Time" : "2021-04-19T13:56:00.000Z",

Badger · April 21, 2021, 1:26pm

What time zone are you in? Did 02:05 exist in that time zone, or did the time go from 01:59:59 to 03:00:00 and skip over the 02 hour?

MARCO_RAMBALDI · April 21, 2021, 1:29pm

How is it possible? The file I shared with you has 6551 raws, not 13102. How did you get that output you show me? I'm afraid the file was truncated or modified in some ways when I uploaded it to github gist. How many raws do you see?

MARCO_RAMBALDI · April 21, 2021, 1:30pm

I am in Italy. How do I answer your question? Where do I see it exactly?

In my advanced settings, i see that Timezone depends to Browser.

aaron-nimocks · April 21, 2021, 1:36pm

The file at failed parse error file · GitHub has 3513 lines. Not sure how I got that many results before since I downloaded the file and that's what was there. This time I just copy/pasted the rows.

But reran that one and got the same results.

{
  "count" : 3513,
  "_shards" : {
    "total" : 1,
    "successful" : 1,
    "skipped" : 0,
    "failed" : 0
  }
}

Badger · April 21, 2021, 1:38pm

Then you are on CEDT, which starts on the last Sunday in March. 2 AM on March 28th did not happen, the time skipped to 3 AM, so a date filter cannot parse it. More commentary here.

MARCO_RAMBALDI · April 21, 2021, 1:39pm

You had no problems with parse error because the lines that fail to load were truncated when I uploaded the file to github.

MARCO_RAMBALDI · April 21, 2021, 1:43pm

I risked taking the computer and throwing it out the window ... I've been trying to figure out the problem for three weeks

Topic		Replies	Views
CSV date parser failure while uploading it to logstash Logstash	6	1495	June 5, 2017
Found a bug in date parsing! Logstash	7	695	February 12, 2018
Logstash Date Filter Plugin: Field remains a string datatype Logstash	3	394	April 4, 2018
Logstash: Not able to parse date field from csv Logstash	3	789	March 27, 2019
Logstash : Time Field Logstash	10	2477	February 5, 2018

Uploading csv file: Failed to parse date field dd/MM/yyyy HH:mm

Related topics