How to parse this json

Sketchy · August 10, 2019, 12:50am

I am new to logstash and have been trying to parse the below json with no luck.

{
"stat": "user",
"Ref": "USER:[5000,John Smith,3A37332D2659554F9CCE0CD754185269]",
"statistics": [
    "0",
    "",
    "",
    "0",
    0,
    0,
    1,
    "",
    0,
    "",
    "0",
    1,
    "IDLE",
    "5017",
    1505
]

}

The main thing I am trying to do is put each statistic into its own field, each line under statistic represents a metric from the source system.

example output:

  statistic.Status => "IDLE"
  statistic.Duration => "1505"

Can anyone point me in the right direction to get started with this?

admlko · August 10, 2019, 5:09am

I think you need to convert the statistics array into hash (“key” => “value”) and for that you need a second array containing hash keys (column names).

First, you can use json_lines (or json) codec as input in order to parse the incoming event(s) as json, then in filter section you can use ruby filter to join the two arrays into a hash.
Something along these lines:

Keys = [“Status”, “Duration”, ....]
Event.set( Joined, Hash[keys.zip(event.get(“statistics”).map {|i| i})] )

It is probably better to inject the column names (keys array) early on with add_field etc. At least it is much cleaner and easier to maintain. Then you can just read it in ruby filter (event.get()).

PS. Sorry for the auto-capitalization of the variable names. I’m writing this on my phone.

Sketchy · August 11, 2019, 2:26am

Thanks for the reply but I thought it would be a much easier task to process that data in the statistics field. My initial idea was I thought I would be able to use the CSV filter on the statistics field but that doesn't seem to work. Same with gsub, if I try to do anything to the statistics field I get the error:

 gsub mutation is only applicable for strings and arrays of strings, skipping {:field=>"statistics"

I'm failing to understand why its so difficult to work with this field, I'm new to logstash but I have been able to get it to do amazing things on logs that are much more complicated than this.

The ruby debug output for the statistics field looks like this, surely there is a simple way I can create a field for each number.

"statistics" => [
    [ 0] "0",
    [ 1] "",
    [ 2] "",
    [ 3] "0",
    [ 4] 0,
    [ 5] 0,
    [ 6] 0,
    [ 7] "",
    [ 8] 0,
    [ 9] "",
    [10] "0",
    [11] 0,
    [12] "IDLE",
    [13] "5017",
    [14] 1051

Badger · August 11, 2019, 12:43pm

    mutate { join => { "statistics" => "," } }
    csv { source => "statistics" autogenerate_column_names => true }

Obviously you do not actually want autogenerate_column_names, you would use the columns option.

Sketchy · August 11, 2019, 2:17pm

That is exactly what I wanted I knew it would be this simple, thanks so much. I must have tried every possible way except this, so simple in the end.

Badger · August 11, 2019, 2:23pm

My initial reaction was that @admlko was right and this required a ruby filter. It was only when you said you wanted to use a csv filter that I realized that could be done.

Sketchy · August 11, 2019, 2:50pm

Well thanks to the both of you anyway

system · September 8, 2019, 2:50pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
How can I parse nested JSON strings to JSON objects? Logstash	7	1536	November 28, 2021
Logstash ruby filter doesn't parse json and return whole message field in Kibana Logstash	4	2398	February 9, 2019
JSON to fields in Logstash Logstash	7	484	December 9, 2020
Logstash Parse stingyfied json to seperate json fieldsl Logstash	1	287	April 3, 2023
Logstash: parsing fields from json in array of another json Logstash	3	1269	October 24, 2018

How to parse this json

Related topics