How can i pull data from hive to logstash

sesaravanan · August 3, 2017, 11:32am

Hi,

I'm very new to Logstash. I have access to read data from database, so I decided to pull data from a remote database to logstash. The data size will be minimum 1TB per day.
How can i do capacity planning in terms of storage/networking/cpu/memory?
What are the best practices I have to follow?

Thanks,
Saravanan

josephjohney · August 3, 2017, 11:35am

you can use the various database plugins in logstash .

for example jdbcJDBC plugin

sesaravanan · August 3, 2017, 11:55am

Thanks for the response joseph

yes I can use jdbc plugin to pull data but I want to do capacity planning for that.

sesaravanan · August 4, 2017, 3:59am

Is it possible to pull data TB's of data with one logstash instance ? or I need to setup clustering?. How clustering will work for pulling data from remote database to Logstash?

josephjohney · August 4, 2017, 5:22am

You might have to do the horizontal scaling of your logstash pipeline.

scaling might give you some generic thoughts on the ELK stack scaling.

For your case there will be slight modification on the inputs.

Also, Please check the JDBC streaming plugins, which might be useful for your case.

sesaravanan · August 4, 2017, 5:32am

okay..

system · September 1, 2017, 5:32am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Logstash cannot handle huge amount of data Logstash	6	1223	February 4, 2019
Logstash Capacity planning Logstash	4	1477	September 1, 2017
Oracle DB - JDBC Plugin high load Logstash	1	227	April 15, 2021
Scaling logstash nodes Logstash	7	946	January 27, 2021
How load big data from database using Logstash in elasticsearch index? Logstash	1	305	March 25, 2023

How can i pull data from hive to logstash

Related topics