Parse not valid Json logs

I have some logs coming from my S3 bucket. The json in it is not valid since it has some non-standard " and { characters like below:

{“app_id":"fo","geid":"031fc83e0fea5c1e808857bd2db007d5","etid":"generic","ev":1,"date":"2016-02-01 00:00:00","mt":1454281200.04,"ip":"66.249.78.209","foui":null,"session_id":null,"member_id":-1,"is_buyer":false,"url":"https://de.fotolia.com/search?k="everglades+national+park"&offset=400","action":"displayed_contents"}

I tried with jsonlint.com to find it out. How should I deal with it?

Perhaps you can use a mutate filter and its gsub option to clean up this mess?

1 Like