Logstash consumers not splitting kafka partitions to consume from

oats_gal · February 7, 2019, 2:21pm

The Elastic stack that I support was recently upgraded from using logstash 2.4 to 6.4. I've been troubleshooting a decrease in efficiency and found an interesting difference in how the versions are handling reading off of Kafka partitions. When running Logstash 2.4, consumers would equally split the partitions available for a particular Kafka topic and only process logs off of those assigned to it. Since upgrading these same consumers to running Logstash 6.4, this is no longer happening. I can now see consumers processing logs off of almost all partitions, instead of splitting them up among themselves. This may not fully explain the performance loss that we're witnessing, but I think it could be contributing.

I haven't been able to find any new settings for the Kafka input plugin that would explain this change in behavior. I am using the same plugin configuration for both consumer versions:

input {
    kafka {
        bootstrap_servers => "${KAFKA_SERVER}:9092"
        group_id => "test-LogStashConsumerGroup"
        topics => ["test-logstash-topic"]
        max_partition_fetch_bytes => "10485760"
        codec => "json"
        consumer_threads => 1
        decorate_events => true
        client_id => "${HOSTNAME}"

    }
}

So I guess my question is, is it possible for the newer Kafka input plugin bundled with Logstash 6.x to process kafka partitions more similarly to the older versions?

oats_gal · February 7, 2019, 6:55pm

I found a setting that looked promising, partition.assignment.strategy, as you can set it to org.apache.kafka.clients.consumer.RoundRobinAssignor, it looks to default to org.apache.kafka.clients.consumer.RangeAssignor in newer versions. This setting looks like it will do what I want, but I am getting a peculiar error whenever adding it to Logstash, on startup I receive this message:

Exception in thread "Ruby-0-Thread-69: :1" org.apache.kafka.common.errors.InconsistentGroupProtocolException: The group member's supported protocols are incompatible with those of existing members or first group member tried to join with empty protocol type or empty protocol list.

I thought perhaps this was due to me reusing the same consumer group, however I still get this same error whenever I try to start Logstash with a brand new group as well.

Any ideas? I'm really banging my head on my desk with this one.

oats_gal · February 11, 2019, 9:26pm

It seems that the simplest solutions are often overlooked. The fact that I was performing a rolling upgrade of these Logstash containers was what was resulting in the error above. I completely deleted the stack and rebuilt it using the updated version, the one that had the RoundRobinAssignor flag. The stack started up and the Logstash consumers are now splitting up Kafka partitions in a more efficient manor. Whew!

Hopefully this helps someone else who runs into a similar issue. Thank you all.

system · March 11, 2019, 9:26pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Logstash kafka input consumer group not sharing partitions Logstash	3	1324	April 12, 2017
Kafka.consumer.RangeAssignor: No broker partitions consumed by consumer thread logstash_logstash-indexer Logstash	10	8663	July 6, 2017
Logstash kafka input consumer group not finding all partitions Logstash docker	1	339	August 1, 2021
Kafka consumer issue Logstash	3	1859	July 6, 2017
Logstash read from kafka in a round-robin way Logstash	1	643	July 6, 2017

Logstash consumers not splitting kafka partitions to consume from

Related topics