Indexing application log files to elasticsearch

shane_lee · July 13, 2016, 7:01am

Hi,

I am investigating what to use to index application log files into elasticsearch.

===Use Case===

I have a miroservice developed using spring boot. Logging framework I use is logback.

Options to log application output are:

File -- Basic file appender
TCP Socket -- LogstashTcpSocketAppender

In the first use case, output is plain text (not json). My options here I assume are to use lightweight shipper such as FileBeats (handle multilines) and then output to logstash.

OR
Output in json format over tcp and have logstash listen on that port

input { tcp { codec => "json" port => 5000 } }

Am i barking up the wrong tree or is that my two options!?

Thanks,
Shane.

magnusbaeck · July 13, 2016, 7:20am

I'd dump the logs to a local file in JSON format, then use Filebeat to ship that. I don't like shipping logs directly over the network since network or server outages could lead to either a blocked application or dropped logs.

shane_lee · July 13, 2016, 7:28am

Thanks for the quick reply mate.

Is it best practice to log output in json format than plain text?

Is it recommended then to

Ship logs directly from filebeats to elasticsearch
Ship logs from filebeats to logstash which outputs to elasticsearch

I read this interesting article recently and trying to understand the best solution!

Thanks,
Shane.

magnusbaeck · July 13, 2016, 7:33am

Is it best practice to log output in json format than plain text?

If you can control the logging format I think it's preferable since there's more or less no configuration to do (no multiline worries for example).

Is it recommended then to

Ship logs directly from filebeats to elasticsearch
Ship logs from filebeats to logstash which outputs to elasticsearch

Since Filebeat has basically zero features for processing or parsing events I don't think the first option is very useful.

shane_lee · July 13, 2016, 8:19am

+1 on the json format then!

This is the part I am not sure about.
The framework logstash encoder has an encoder called LoggingEventCompositeJsonEncoder that can provide greater flexibilty in the json format.

So I am thinking if I defined patterns at the logging level, do I really need to ship to logstash?

I understand your point about directly over the network. My colleague has mentioned that beats and/or logstash will have a retry mechanism in place for network failures. Is that true?

i see the 12 factor site recommends stdout! The Twelve-Factor App

Thanks,
Shane.

magnusbaeck · July 13, 2016, 10:31am

So I am thinking if I defined patterns at the logging level, do I really need to ship to logstash?

That depends on what kind of filtering you might want to do in Logstash, and if Elasticsearch is the only output you're interested in. There is no right or wrong here. It depends on your needs and preferences.

I understand your point about directly over the network. My colleague has mentioned that beats and/or logstash will have a retry mechanism in place for network failures. Is that true?

Yes.

shane_lee · July 13, 2016, 1:47pm

Thank you.

Topic		Replies	Views
Send JSON formatted logs directly to elasticsearch Elasticsearch	5	9020	April 4, 2017
Logback TCP Appender Vs Filebeat structured logging Logstash	3	4867	September 21, 2017
SImple direct from log file JSON to elasticsearch Beats	4	1210	October 13, 2016
Logging directly to elasticsearch Elasticsearch	8	9662	February 7, 2019
Problem with JSON logs Beats filebeat	3	561	March 9, 2019

Indexing application log files to elasticsearch

Related topics