Best practice to handle logstash configuratioon

tag_v · June 14, 2018, 6:14pm

Hello All,

My logstash configuration have almost 1500+ lines of code where i am taking input from beats and applying filters for extracting fields of various log types and output to kafka. Is there any way to handle huge configuration files or any best practices for logstash parsers?

Thanks in advance.

magnusbaeck · June 14, 2018, 6:58pm

I'd split that file into multiple files, but apart from that it's hard to give advice without seeing what it looks like.

I'd also strongly suggest using my tool Logstash Filter Verifier to test your filter configurations.

tag_v · June 15, 2018, 1:49am

Does this means splitting 1 large conf to many confs and run multiple logstash instances?

This is awsome i will try this tool.

Thanks in advance.

magnusbaeck · June 15, 2018, 5:59am

Does this means splitting 1 large conf to many confs and run multiple logstash instances?

That's one possibility, but I was thinking of splitting into multiple files and running a single instance.

tag_v · June 15, 2018, 6:53am

Heres the filter part of my conf. Please suggest on this.

Thanks in advance.

magnusbaeck · June 18, 2018, 6:13am

I obviously haven't studied all 2000 lines in any detail, but a few things stand out:

You're overusing DATA and GREEDYDATA in your grok expression. That's very inefficient and can lead to incorrect matches.
Having conditionals that inspect the exact source filename (if [source] == "/archives/log/10.90.250.151.log" etc) is probably not a great idea.

tag_v · June 19, 2018, 7:21am

can u please suggest any best practices for maintaining logstash parser confs where we have custom log patterns (other than usual standard log patterns provided by vendors).

magnusbaeck · June 19, 2018, 8:36am

I don't think I have any particular suggestions. In what way is your current situation unmanageable?

tag_v · June 19, 2018, 8:48am

with number of conf lines increased, we were facing latency for logs getting parsed. How to efficiently handle in this scenario? By writing multiple confs will improve performance? If so how to do this?

magnusbaeck · June 19, 2018, 9:51am

Have you looked into using the Logstash monitoring API to see which filters are adding the most time to the processing?

But as I said, reduce the amount of DATA and GREEDYDATA patterns. Don't spend time on other optimizations until your grok expressions have been cleaned up.

system · July 17, 2018, 9:51am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Logstash configuration in multiple files Logstash	3	411	April 28, 2020
How to manage large config files Logstash	9	5786	July 6, 2017
Logstash Multiple files config Logstash	2	2276	December 5, 2016
Filter unito too large Logstash	4	276	July 9, 2019
Question about multiple conf files Logstash	3	529	March 15, 2017

Best practice to handle logstash configuratioon

Related topics