Max_bytes and multilines

vkakhnych · October 31, 2017, 12:15pm

max_bytes works fine on input messages but after applying multiline filter all new line bytes in original message replacing by two printable bytes: \n and we get a little larger size of message that we send further. Is it possible to apply max_bytes after multiline filter to get exactly size of result message?

Thanks!

steffens · November 1, 2017, 12:13pm

The reader in filebeat applies the max_bytes setting after multiline. The multiline reader joins/normalizes multiline events by joining the actual lines using a single newline symbol \n.

The input processing pipeline is build here.

vkakhnych · November 1, 2017, 12:59pm

Ok, but it doesn't change anything. The reason is to get in logstash/es after filebeat not more than exactly set in max_bytes size of message. Now I can't rely on that size and have to reduce it a little.

steffens · November 1, 2017, 1:16pm

Hm... maybe I didn't really understand your issue. max_bytes is already applied after mulitline processing. The message contents is not be bigger then max_bytes when being passed to the output. Only reason the message output can become bigger is some characters requiring some special (multi-byte) encoding. Plus, beats send additional meta-data in a json document with the actual contents. The full event will be bigger then the original log line.

vkakhnych · November 1, 2017, 1:35pm

It is exactly \n symbols. 2 bytes because of printable symbols instead of 1 byte in source message. But if I set max_bytes: 6144 I want to have 6144 bytes max message in logstash. Not 6145, not 6146, etc (depends on number of new lines). Sending 10K bytes message to filebeat and get 6156 bytes in logstash. Replacing all \n by \ and have exactly 6144 bytes.

steffens · November 2, 2017, 11:52am

It is the JSON encoding escaping special symbols. This is on network level only. Logstash does decode the message, resolving these special symbols -> After decoding \n takes only 1 byte.

vkakhnych · November 2, 2017, 12:28pm

Right, but the question is still the same - can we apply max_bytes filter after that escaping?

system · November 30, 2017, 12:28pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Filebeat sends incomplete message both in logstash and elasticsearch Beats filebeat	4	989	February 2, 2018
How can I increase the size of bytes that a log can have? Beats filebeat	10	942	November 1, 2022
Filebeat cut multiline on 500 line Beats	2	2942	November 6, 2017
Filebeat throwing exception with increased value of max_bytes Beats filebeat	7	2313	June 28, 2017
Process more than 3000 lines per record in filebeat Beats filebeat	4	487	June 14, 2018

Max_bytes and multilines

Related topics