I'm seeing the following INFO message, the connection to Kafka is lost and I'm unable to connect to it.
2018-03-14T14:13:02,743][INFO ][logstash.outputs.kafka ] Sending batch to Kafka failed. Will retry after a delay. {:batch_size=>1, :failures=>1, :sleep=>0.01}
I've validate that Kafka is up and running.
listeners=PLAINTEXT://IP:9092
this line already exist in Kafka. I could telnet to the Kafka server even though LS is unable to output to it.
Does the connection ever resume? Does it require a restart of Logstash?
The timing is suspicious; could a scheduled task be restarting the Kafka process? In a previous job, we had a finnicky Kafka installation that someone "fixed" by killing it daily in cron.
If I reboot the box, it will work, but a few hours later, the error is back. But if I restart it using "initctl" it will work fine for a while. What the different in the two starting method?
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.