Multiline codec not expected bulk

INS · November 2, 2022, 8:45pm

My case is that I have a separate pipelines for each file. Here for "Numbers*" files, it doesn't work properly for me to put data into elastic. On input side I have files divided into 50_000 lines and with a name that is assigned to the pipeline name. If I add files from different days, logstash packs me in one bulk and sends with wrong date. Are you able to point out where the error is?
Every files has a date in the snapshot line. So it should close the file and get the next new. But why it doesn't work as expected. I've also tried option with "auto_flush_interval => 4", it didn't help.

NUMBERs_AutoExport_-a_20221029044502.txt
NUMBERs_AutoExport_-a_20221023044502.txt

# snapshot,68843601,20221023044502
# NUMBERs
a,b,c
d,e,f
# Type2
foo,1,2,3
bar,4,5,6
# DN Blocks
224135896,224135897,,,,,,,,,,,
224135896,224135897,,,,,,,,,,,

input {
    file {
        path => "/opt/data/input/Numbers_*.txt""
        sincedb_path => "/dev/null"
        start_position => beginning
        codec => multiline { pattern => "^#" negate => true what => previous multiline_tag => "" }
    }
}
filter {
    mutate { remove_field => [ "[event]", "log" ] }
     if [message] =~ /# snapshot/{
         dissect {
            mapping => {
                "[message]" => "# %{activity},%{val},%{time}"
            }
            remove_field => ["[message]"]
        }
        date {
                match => ["time", "yyyyMMddHHmmss"]
                timezone => "Europe/Paris"
                target => "timestamp"
            }
        ruby { code => '@@metadata = event.get("@timestamp")' }
        drop {}

    } else if "# NUMBERs" in [message] {
        mutate { add_field => { "eventType" => "NUMBERs" } }
        split { field => "message" }
        if [message] !~ /^#/ {
            csv { columns => [ "c1", "c2", "c3" ] }
        }  
 ruby { code => 'event.set("@timestamp", @@metadata)' }

    } else if "# Type2" in [message] {
        mutate { add_field => { "eventType" => "Type2" } }
        split { field => "message" }
    } else {
        mutate { add_field => { "eventType" => "Unrecognized" } }
    }
}

logsatsh.yml

log.level: info
config.reload.automatic: true
config.reload.interval: 30s
pipeline.ecs_compatibility: disabled
pipeline.workers: 48
pipeline.batch.size: 2000
pipeline.batch.delay: 50
pipeline.ordered: auto

double

INS · November 3, 2022, 6:55am

Can anyone look at this case? the same behavior was observed with "pipeline.workers: 1"

system · December 1, 2022, 6:55am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Multiline codec reading entire log file as one line Logstash	1	1189	June 20, 2017
Logstash multiline codec plugin Logstash	4	261	July 16, 2018
Multiline codec issue Logstash	1	1059	July 6, 2017
Logstash file input/output with multiline codec Logstash	9	1373	April 16, 2021
Multiline codec in Logstash 6.6 not working Logstash	7	344	April 11, 2019

Multiline codec not expected bulk

Related topics