Logstash cannot handle huge amount of data

longnx · January 4, 2019, 7:48am

Hi Experts,

We are using Logstash to get data from Oracle database using JDBC with SQL query base on ID and time stamp. The Oracle table has around 15 billions/ month and the data inserted continuous to that table.
It is around 1 millions new record / minute.
Logstash cannot catch up the number of data that inserted to table.
Can you suggest any solution to solve this issue?

rijinmp · January 4, 2019, 10:01am

How much ram you are currently allocated to Logstash in jvm options ?

Christian_Dahlqvist · January 4, 2019, 10:05am

How complex is the query? How long does it take to run the query and retrieve the results if you use e.g. a script? If running the query and extracting the data is not the bottleneck, would it be possible to partition the query and have multiple pipelines each process a subset of the data and that way increase parallelism?

longnx · January 7, 2019, 4:29am

Hi Christian_Dahlqvist
The query join 3 tables and insert it to elasticsearch. The oracle database already created partition and indexes. How can I do multiple pipelines?

longnx · January 7, 2019, 7:49am

I increase the RAM for it but nothing change

Christian_Dahlqvist · January 7, 2019, 8:02am

If you have a natural way to partition your data, you can create multiple pipelines where each has a jdbc input that contains a WHERE clause that ensures the inputs read different data.

system · February 4, 2019, 8:02am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Tuning logstash and elasticsearch for loading data from oracle database Elasticsearch	13	2178	September 17, 2019
Migrating 3 millions of records from RDBMS to Elastic Search using logstash Logstash	8	1363	August 14, 2020
Tune logstash JDBC input for huge datasets Logstash	5	968	May 16, 2023
Logstash progress in loading from oracle database? Logstash	1	504	September 5, 2017
Logstash jdbc-input taking long time to load data from Oracle DataBase Logstash	22	3970	October 5, 2020

Logstash cannot handle huge amount of data

Related topics