Hi All,
I'm using Logstash 2.3.0 to receive and process Netflow data. Average indexing is about 50K events per minute, but as I put the mutate
block to rename fields, throughput drops to 15K per minute.
## Rename all netflow fields
mutate {
rename => { "IN_BYTES" => "tx-bytes" }
rename => { "IN_PKTS" => "tx-pkts" }
rename => { "IN_SRC_MAC" => "src-mac" }
rename => { "IPV4_DST_ADDR" => "dst-ip" }
rename => { "IPV4_SRC_ADDR" => "src-ip" }
rename => { "L4_DST_PORT" => "dst-port" }
rename => { "L4_SRC_PORT" => "src-port" }
rename => { "L7_PROTO_NAME" => "protocol"}
rename => { "OUT_BYTES" => "rx-bytes" }
rename => { "OUT_DST_MAC" => "dst-mac" }
rename => { "OUT_PKTS" => "rx-pkts" }
rename => { "PROTOCOL_MAP" => "protocol-map" }
rename => { "TOTAL_FLOWS_EXP" => "total-flows-exp" }
}
Above is the mutate block. Logstash is running on a box with 2x 12 core Xeon X5650 2.67GHz and 192GB of ram with lots of free CPU and memory. Logstash starts with 24 threads.
Is there a better way the rename multiple fields in Logstash and does not hurt throughput because I cannot change field names from the Netflow source?
Thanks,