Split an array of arrays

opellulo · February 9, 2021, 11:56am

Hi all,
I'm currently working on parsing a really nasty xml stream: events come in single file form, can get up to 5000 lines and, most important, includes arrays of arrays which crucially hold the data I need to parse and visualize.

After some tinkering I come to the solution of using multiple split filters: one for main array and others for the sub-arrays, this solution however leaves me in the cold if the subarray includes fields which i have not included (case I fear will come up sooner or later).

Is there any way to make the Split filter recursive so that it could manage any kind of subarrays?
I understand this would be very load intensive, but given the scope of the project, this is something already expected.

Regards

kavierkoo · February 9, 2021, 2:23pm

Hi Opellulo,

Have you took a look at XML Filter?

filter {
  xml {
    source => "message"
  }
}

Hope this can help!!

Badger · February 9, 2021, 4:24pm

No, it will only split the field you tell it to split.

opellulo · February 11, 2021, 9:22am

Thanks for the replies,
I suppose that the "correct" way to do that is through xpath (which to be honest i don't want to dive into unless someone can grant me beforehand it can handle nested arrays) so I think that for the moment I will stick to the multiple splits. It may not be elegant but it works.

opellulo · February 12, 2021, 10:52am

After some testing it seems I find the error: the XML has an array node and multiples attributes in that very node and its children, so with a default xml filter configuration, logstash was mixing the values giving me a lot of headache.

The situation improved a lot after having changed those xml filter parameters:

force_array => false
force_content => true

All while keeping the split for the array node
Now I only got a cosmetic split error if the array node has only 1 value (it become an hash so it's not splittable) but i can live with it removing the tag

system · March 12, 2021, 10:52am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
How to split xml arrays? Logstash	15	7562	July 6, 2017
Xml filter array Logstash	17	3307	September 25, 2017
Unable to split Arrays into fields -parsing XML Attributes of an unknown number of child elements Logstash	2	252	June 16, 2022
Only strings and arrays are splittalbe for XML input into logstash Logstash	2	667	July 6, 2017
Split array with different nested elemetnts Logstash	7	1745	March 14, 2022

Split an array of arrays

Related topics