Bottleneck Data Pipeline

jogoinar10 · December 13, 2018, 2:31pm

I have a csv with more than 300kr of rows per hr. I use filebeat to ship data into elasticsearch.
My problem is that, the sending of data is very slow like 2k of rows per 3-5mins only and sometimes it stops for a while.

Are there any config hacks which will make the data shipping faster?

TIA

Christian_Dahlqvist · December 13, 2018, 2:55pm

Where are you sending the data?

rugenl · December 13, 2018, 3:08pm

Also, what is the filebeat OS and is the CSV on local or shared disk?

jogoinar10 · December 14, 2018, 2:12am

from filebeat -> logstash -> elasticsearch

jogoinar10 · December 14, 2018, 2:13am

i'm using filebeat for Win OS. the CSV is stored in the local

rugenl · December 14, 2018, 5:09pm

Well, I guess the next step is to see if the delay is in harvesting or publishing. Have you checked the logs for filebeat and logstash? Do you have other beats sending OK?

jogoinar10 · December 15, 2018, 6:05am

yes. there's no error in the filebeat logs.

Christian_Dahlqvist · December 15, 2018, 12:00pm

What is the specification of your Elasticsearch cluster? What kind of hardware and storage are you using?

If you want to test if Elasticsearch is limiting throughput, you can e.g. temporarily replace the Elasticsearch output with a file output and see if that changes the throughput of data collected.

system · January 12, 2019, 12:00pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Filebeat sending data to Logstash seems too slow Beats filebeat	20	22457	June 1, 2017
Slow or stalled pipeline Beats filebeat	3	2383	August 19, 2016
Filebeat slowing to a halt within 20-30 minutes of starting Beats filebeat	4	1404	May 19, 2017
Speed limitations of filebeat? Beats filebeat	14	15263	July 5, 2017
Filebeat unable to cope with incoming logs Beats filebeat	7	1558	February 8, 2018

Bottleneck Data Pipeline

Related topics