Problem with the date-filter (timezone)

prime · May 9, 2017, 5:40pm

Hi,
i'm new to elk, so it may be a layer 8 problem, but i'm not able to fix it. So i hope here's somebody able to help me.
So currently the date-filter in my logstash config is not doing what i expect. I import csv Files with some date-fields in it. These have no timezone, so i added the date-filter like this:

date {
                locale => "de"
                match => ["Start", "dd.MM.yyyy HH:mm:ss"]
                timezone => "Europe/Berlin"
        }

This is how the field looks like in the csv-file:
"02.05.2017 07:46:49"

After running logstash, this is what i get in Elasticsearch:

"Start": [
      1493711209000
    ],

which is in "human language" "Tue, 02 May 2017 07:46:49 GMT". But after running through my date-filter, it shoult be "Tue, 02 May 2017 07:46:49 GMT+2:00", or am i wrong?

Help will be appreciated.

magnusbaeck · May 9, 2017, 6:24pm

Unless configured otherwise with the target option, the date filter writes the resulting timestamp to the @timestamp field.

prime · May 9, 2017, 6:30pm

Yes that's what i found out 5minutes ago. But if i add
target => "Start"

i don't get any data in Elasticsearch anymore.
This is what i found in the elasticsearch output:

Caused by: java.lang.IllegalArgumentException: Invalid format: "2017-05-01T06:34:52.000Z" is malformed at "17-05-01T06:34:52.000Z"

magnusbaeck · May 9, 2017, 6:32pm

That's probably because the Start field has been mapped a particular way, and suddenly the data you're sending doesn't match. Can you just delete the index and start over?

prime · May 9, 2017, 6:35pm

I did that, still the same error. Do i have to delete the index patterns as well?

magnusbaeck · May 9, 2017, 6:37pm

Index patterns in Kibana? No, you can leave them. When you get this error, what do the mappings of the index look like? Specifically the Start field. Use ES's get mapping API.

prime · May 9, 2017, 6:40pm

Mapping of Start:

          "Start" : {
            "type" : "date",
            "format" : "dd.MM.YYYY HH:mm:ss"
          },

This is my logstash configuration, if it is helpful:

input {
        file {
                path => ".../*.csv"
                start_position => "beginning"
                sincedb_path => "/dev/null"
        }
}


filter {
        csv {
                separator => ";"
                columns => ["UID","Start","Ende","IP","Land","Client", "Verbindungsdauer"]
        }
        geoip {
                source => "IP"
                database => ".../GeoLite2-City.mmdb"
        }
        grok {
                match => [ "path",".../...-%{GREEDYDATA:radiocountry}_%{GREEDYDATA:log_month}\.csv"  ]

        }
        fingerprint {
                source => ["message"]
                target => "fingerprint"
                key => "***"
                method => "SHA1"
                concatenate_sources => true
        }
        date {
                locale => "de"
                match => ["Start", "dd.MM.yyyy HH:mm:ss"]
                timezone => "Europe/Berlin"
                target => "Start"
        }
}


output {
        elasticsearch {
                hosts => "http://localhost:9200"
                index => "...-%{radiocountry}"
                document_id => "%{fingerprint}"
        }
        stdout {}
}

Thank you very much for helping.

magnusbaeck · May 10, 2017, 5:43am

Okay, so something resulted in that mapping of the Start field. Logstash doesn't do this out of the box and if your date filter works that's not what the Start field of your events will look like. Check again that you really have deleted the index so you're starting fresh and that the date filter always works.

prime · May 10, 2017, 7:19am

I deleted all indizes and checked if the mapping was gone. After starting logstash, i get the same error as before.

magnusbaeck · May 10, 2017, 7:28am

What if you create the index by hand? What do the mappings look like for the newly created empty index? There shouldn't be any Start mapping. What if you insert a document with a Start field by hand?

prime · May 10, 2017, 8:14am

Creating an empty index, just like "PUT testindex"?

magnusbaeck · May 10, 2017, 8:34am

Yes, exactly.

prime · May 10, 2017, 8:42am

Ok, i never did this by hand so i hope this is right.

GET testindex:

{
  "testindex": {
    "aliases": {},
    "mappings": {},
    "settings": {
      "index": {
        "creation_date": "1494404299923",
        "number_of_shards": "5",
        "number_of_replicas": "1",
        "uuid": "ZWr-zuBARv6tmgPmNJHKgA",
        "version": {
          "created": "5020199"
        },
        "provided_name": "testindex"
      }
    }
  }
}

Putting a new document in it:

PUT textindex/test/1
{
    "Start" : "02.05.2017 07:46:49"
}

Result:

{
  "_index": "textindex",
  "_type": "test",
  "_id": "1",
  "_version": 1,
  "result": "created",
  "_shards": {
    "total": 2,
    "successful": 1,
    "failed": 0
  },
  "created": true
}

magnusbaeck · May 10, 2017, 8:53am

No, don't insert "02.05.2017 07:46:49" in the Start field. That'll cause the automapper to pick the wrong format. Delete the index again. Recreate it. Check the mappings. Insert a document with an ISO8601 date (like "2017-05-01T06:34:52.000Z") in the Start field. Does that work?

prime · May 10, 2017, 9:08am

It does.
This is what the mapping looks like after inserting the document:

"testindex": {
    "aliases": {},
    "mappings": {
      "test": {
        "properties": {
          "Start": {
            "type": "date"
          }
        }
      }
    }

magnusbaeck · May 10, 2017, 1:41pm

Okay, good. Then the problem must be that the first document you insert with Elasticsearch still has the non-ISO8601 date format, forcing ES's automapper to map the field differently. When a subsequent document with an ISO8601 date arrives it doesn't match the mapping of the field.

prime · May 10, 2017, 2:05pm

All date fields in documents i have in the csv fileshave the same format ("02.05.2017 07:46:49").
Could the problem be caused by the csv headline?

Edit: Seems not to. I deleted the headline and have the same result. Here's an example of the csv file i'm using, the second field is the Start-Field:
"12345";"30.04.2017 17:28:09";"01.05.2017 03:25:36";"111.111.111.110";"de";"NSPlayer/12.0.7601.17514";"35847"

But if i understand correctly, the problem should occure if i don't use the date filter too? That's not the case.

system · June 7, 2017, 2:08pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Data filter not working properly Logstash	5	444	November 15, 2018
Logstash Date Filter Inquiry Logstash	3	304	December 26, 2018
Timezone issue with logstash date filter plugin Logstash	4	1571	June 27, 2020
Setting the correct timezone in date filter plugin Logstash	4	12560	September 5, 2018
Logstash _dateparsefailure Logstash	4	416	August 31, 2018

Problem with the date-filter (timezone)

Related topics