Filebeat Implementation Details

NikolaeVarius · January 5, 2016, 3:02pm

Hi, I'm using filebeat for tons of servers in my Production environment and have some questions.

Does Filebeat compress logs while they're being streamed? We're trying to limit bandwidth usage between our servers and are trying to figure out the best way to compress logs. Is it possible to stream gzipped logs? If not, does anyone else have anything that allows for compressing on the fly?

When using load balancing filebeat to logstash, I can't seem to see anything that indicates that order of log lines is preserved. Is load balancing while preserving order of logs sent to ES impossible?

Thank you

magnusbaeck · January 5, 2016, 3:12pm

Does Filebeat compress logs while they're being streamed?

I don't think so.

When using load balancing filebeat to logstash, I can't seem to see anything that indicates that order of log lines is preserved. Is load balancing while preserving order of logs sent to ES impossible?

ES itself doesn't maintain the order of inserted documents, so unless you have a field with a monotonically increasing integer (like the log file's line number or file offset) there is no ordering apart from the implied order given by the timestamp.

andrewkroh · January 5, 2016, 3:34pm

When using the Logstash output, the events are always compressed with zlib (at compression level 3). There is no compression when sending straight to Elasticsearch.

NikolaeVarius · January 5, 2016, 4:52pm

Is there any way to modify compression level so that we can experiment with it?

andrewkroh · January 5, 2016, 5:23pm

Unfortunately, not at this time. See the code below. The value 3 is hard-coded. You could open an enhancement request in the beats repo for this issue; or better yet, open a PR.

github.com

elastic/beats/blob/f4b25bb16f0dc7e846509ae42455390bcfd5797c/libbeat/outputs/logstash/client.go#L236


		l.windowSize = minWindowSize
	}
}


func (l *lumberjackClient) compressEvents(
	events []common.MapStr,
) (uint32, []byte, error) {
	buf := bytes.NewBuffer(nil)


	// compress events
	compressor, _ := zlib.NewWriterLevel(buf, 3) // todo make compression level configurable?
	var sequence uint32
	for _, event := range events {
		sequence++
		err := l.writeDataFrame(event, sequence, compressor)
		if err != nil {
			logp.Critical("failed to encode event: %v", err)
			sequence-- //forget this last broken event and continue
		}
	}
	if err := compressor.Close(); err != nil {

steffens · January 5, 2016, 8:18pm

hi,

as already mentioned, when publishing to logstash, data will be gzipped using compression level 3. Compression level configuration is not supported yet, but I'm planning to add it soon.
in general when using load-balancing, we can not guarantee any timely indexing order for some of these reasons:

after sending to two different logstash instances, one instance might outpace the other
if one logstash instance fails during load-balancing, lines have to be resend to another instance, which might have processed subsequent lines already

This does not imply your data are all out of order. The timestamp send by filebeat is the time, the line was read + when using grok to parse the timestamp from log-lines you get more exact timestamps. We also ship an offset (the file offset in bytes), which gives you some order information.

Topic		Replies	Views
Filebeat compress output to file not working Beats filebeat	5	3962	February 5, 2018
How to make filebeat ship logs in the same order as in the log file Beats filebeat	3	4558	July 13, 2017
Filebeat to Logstash : How to keep lines order Beats filebeat	7	3166	November 28, 2018
ES ingest node vs logstash compression? Elasticsearch	3	1169	July 5, 2017
How to send logs to logstash from filebeat in the order they are printed in the log file Beats filebeat	7	5571	December 19, 2018

Filebeat Implementation Details

Related topics