Hello,
I am sending logs from filebeat to Logstash and then from logstash to ElasticSearch.
The problem is Logstash is consuming too much bandwidth.I have attached network usage graph of logstash server.
(green is incoming from filebeat)
In the starting I am just writing to file on Logstash server instead of sending to ElasticSearch.You can see the incoming rate.Then when I turn on ElasticSearch output.The incoming rate decreases and also outgoing is 10 times more data.
Can anyone help please.What configurations changes or hardware improvements should I make?
Currently I have 1 ingest node,1 master node and 2 data nodes.
I added persistent queues in logstash of 30 gb.They gets filled up quickly.
So the reason is mostly back pressure by ElasticSearch.
Do you have any suggestion how do I handle this?
The graph is MiB per minute
Green is receiving and yellow is transmitting.I don't understand why it is exactly asymmetric.I have done speed test on this machine and it can give speed upto 500 Mb/s download and 500 Mb/s upload speed.
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.