ES 32kb Field Limit - Logstash Ruby Plugin help

EmFalcon · April 30, 2019, 9:08pm

Using Elastic Stack 6.3. Workflow : Filebeat (input logfile) -> Logstash -> ES

A specific log file we have generates a individual message that exceeds 32kb which from what I am reading the limit of lucene for index and searching.

Is it possible to use the ruby filter plugin for logstash to split the field and send to 2 different fields based on size or length (of say 8100 chars)? My ruby skills are non-existent and if anyone can help me it would be greatly appreciated but below is what I THINK is possible. If there is a better way I am all ears.

Help!

filter
{
if ([entity_type] == "type_log") {
grok { id => "filter_grok_type_log"
match => { "message" => ("%{GREEDYDATA:message}") # ignore that this isn't exactly my filter
}
ruby {
code => message = event["message"].split(0..8100)
message2 = event["message"].split(8101.. ??) # ?? should be end of message field
}
mutate {
replace => { "message", "%[message]" }
add_field => { "message2", "%[message2]" }
}
}
}

Badger · April 30, 2019, 9:41pm

The following will chop up a string into 150 character chunks...

    mutate { add_field => { "someField" => "Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor
incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam, quis nostrud exercitation ullamco laboris nisi ut aliquip ex
ea commodo consequat. Duis aute irure dolor in reprehenderit in voluptate velit esse cillum dolore eu fugiat nulla pariatur.
Excepteur sint occaecat cupidatat non proident, sunt in culpa qui officia deserunt mollit anim id est laborum." } }
    ruby {
        code => '
            part = 1
            s = event.get("someField")
            while s != ""
                event.set("part#{part}", s[0..150])
                s[0..150] = ""
                part += 1
            end
        '
    }

EmFalcon · May 1, 2019, 12:15pm

Badger thank you but then my question becomes how do I take that and assign new fields to each chunk? Sorry if I seem obtuse, I feel that way on this problem.

Badger · May 1, 2019, 1:08pm

The filter does that.

     "part1" => "Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor\nincididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam, qu",
     "part2" => "is nostrud exercitation ullamco laboris nisi ut aliquip ex\nea commodo consequat. Duis aute irure dolor in reprehenderit in voluptate velit esse cillum ",
     "part3" => "dolore eu fugiat nulla pariatur.\nExcepteur sint occaecat cupidatat non proident, sunt in culpa qui officia deserunt mollit anim id est laborum.",

EmFalcon · May 3, 2019, 7:07pm

Thank you badger this works well enough that I can run with it.

EmFalcon · May 3, 2019, 9:27pm

Note that I actually have adjusted the code to read as such now.

    ruby {
            code => '
            if event.get("message").length > #value then
            part = 1
            s = event.get("message")
            while s != ""
            event.set("message#{part}", s[0..#value])
            s[0..#value] = ""
            part += 1
            event.set("message", "Split fields")
            end
            end
            '
            }

system · May 31, 2019, 9:27pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Any way to limit field length? Logstash	5	9429	July 6, 2017
How to split a single line message, into parts Logstash	9	1020	July 15, 2022
Not able to process large lines of log data into elasticsearch Elasticsearch	5	3142	December 21, 2016
Checking Field Length Logstash	8	8572	November 4, 2022
One time batch processing big number of files Logstash	13	3909	July 6, 2017

ES 32kb Field Limit - Logstash Ruby Plugin help

Related topics