Logstash with Kafka compression and decompression

hat_trick · April 10, 2018, 4:18am

Hi, I'm new to Elastic and very interested in this pipeline:
Data Sources -->LogStash --> Kafka -->LogStash --> ElasticSearch, where the first LS specifies gzip data compression with Kafka output plugin and the second LS enriches data with filter plugins.
I assume a gzip codec plugin is required on the second LS in order to process the data, does that mean decompression happens on the second LS? or on the final ES? Also, where does the compression actually happen, on the first LS or Kafka?

Thanks in advance!

hat_trick · April 10, 2018, 5:57pm

Anyone have experience on logstash kafka compressed data transfer? Thanks!!!

Badger · April 10, 2018, 6:10pm

No it is not. I have logstash reading from kafka, discarding 99% of the data and writing with gzip compression to another kafka instance. Another logstash instance reads that topic and it does not specify compression on the input.

hat_trick · April 10, 2018, 7:10pm

Thanks Badger, looks like you also have logstash -> kafka -> logstash.
I want to do some data processing work on the second logstash, in this case, I assume a gzip codec is required. Do you have the same data processing work running on the second logstash?

Do you happen to know where the compression happens, on the first logstash or on kafka

Badger · April 10, 2018, 7:17pm

You do not require a gzip codec on the second logstash instance. The kafka message header indicates whether the message is compressed, so the input plugin will know whether to decompress.

I believe the kafka producer (i.e. logstash) is expected to do the compression but I am not certain.

hat_trick · April 10, 2018, 8:05pm

Cool...This is interesting, then I guess elasticsearch can handle gzip somehow

system · May 8, 2018, 8:05pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Kafka input plugin compressed data Logstash	1	591	April 15, 2020
Logstash kafka as input Logstash	2	359	December 3, 2018
Filebeat Kafka output message compression Beats	4	874	November 28, 2018
Compress output from logstash Logstash	7	4184	March 23, 2018
Compress Json events (gzip, snappy, ...) before output Logstash	1	891	March 26, 2018

Logstash with Kafka compression and decompression

Related topics