Parsing Python Logs - Use Logstash or not

Mathew_Mathew · November 24, 2015, 3:42pm

My current logging infrastructure uses just filebeats and elasticsearch.
Would like to enhance the quality of the logs by parsing out individual fields columns.

Looking for recommendation on best practice to do this.

Add an extra logstash layer.
Format the logs with https://pypi.python.org/pypi/logstash_formatter ; but still send the logs directly to ElasticSearch. Because the output is json; it does not seem as if the Logstash is necessary.

In particular one problem I am struggling with is multiline outputs.

Thanks for the help.

ruflin · November 24, 2015, 4:03pm

The multiline problem will be fixed in one of the next versions. Follow the ticket here: https://github.com/elastic/filebeat/issues/89

Filebeat currently does not have a JSON input, it reads log files line by line, so you will still end up with strings. If you want to use grok for example to identify the timestamp, adding logstash is your best option.

About the python to json logging: If you send it directly to elasticsearch without filebeat, this should work, but it would mean you have to handle multiple servers or failure of sending packages yourself.

Topic		Replies	Views
Send JSON formatted logs directly to elasticsearch Elasticsearch	5	9064	April 4, 2017
Logging directly to elasticsearch Elasticsearch	8	9712	February 7, 2019
Elasticsearch json logs with filebeat module and ingest pipeline Beats filebeat	6	2294	February 21, 2020
Logback TCP Appender Vs Filebeat structured logging Logstash	3	4873	September 21, 2017
Log4j w/ Filebeats + Logstash - best practice(s)? Beats filebeat	2	8197	December 8, 2017

Parsing Python Logs - Use Logstash or not

Related topics