Multiline codec with Docker log driver logs

sgarap · December 3, 2018, 11:38am

Hi,

I am using logstash 6.5 and filebeat 6.5. I want to ship all docker container logs to logstash/elasticsearch. The docker container logs are formatted through JSON log driver and each line of stack trace is created as a separate json. See below.

{"log":"[2018-11-09 15:21:46,920] WARN [ReplicaFetcher replicaId=3, leaderId=2, fetcherId=1] Error connecting to node kafka-logs-1.kafka-logs.default.svc.cluster.local:9092 (id: 2 rack: null) (org.apache.kafka.clients.NetworkClient)\n","stream":"stdout","time":"2018-11-09T15:21:46.930338319Z"}
{"log":"java.io.IOException: Can't resolve address: kafka-logs-1.kafka-logs.default.svc.cluster.local:9092\n","stream":"stdout","time":"2018-11-09T15:21:46.930371914Z"}
{"log":"\u0009at org.apache.kafka.common.network.Selector.doConnect(Selector.java:235)\n","stream":"stdout","time":"2018-11-09T15:21:46.930376969Z"}
{"log":"\u0009at org.apache.kafka.common.network.Selector.connect(Selector.java:214)\n","stream":"stdout","time":"2018-11-09T15:21:46.930381203Z"}
{"log":"\u0009at org.apache.kafka.clients.NetworkClient.initiateConnect(NetworkClient.java:864)\n","stream":"stdout","time":"2018-11-09T15:21:46.930385023Z"}
{"log":"\u0009at org.apache.kafka.clients.NetworkClient.ready(NetworkClient.java:265)\n","stream":"stdout","time":"2018-11-09T15:21:46.930388788Z"}
{"log":"[2018-11-09 15:21:47,013] INFO [ReplicaFetcher replicaId=3, leaderId=2, fetcherId=1] Retrying leaderEpoch request for partition logging-4 as the leader reported an error: UNKNOWN_SERVER_ERROR (kafka.server.ReplicaFetcherThread)\n","stream":"stdout","time":"2018-11-09T15:21:47.01591875Z"}

I tried to use Filebeat and multicodec plugin to put the stacktrace together and write it as single message to elasdticsearch.

For testing purposes, I was reading the logs from a file and I am forwarding them to logstash. I used the following filebeat.yml configuration

# filebeat.yml
filebeat.prospectors:
- type: log
  paths:
   - '/home/ubuntu/logstash/someapp.log'
  multiline.pattern: '^\\t'
  multiline.negate: false
  multiline.match: after

processors:
  - decode_json_fields:
      fields: ["message"]
      target: ""
      overwrite_keys: true

output.logstash:
  hosts: ["localhost:5044"]

logging.to_files: true
logging.to_syslog: false

Filebeat is able to stripout @stream @time from logs but multiline is not working. See log , message attributes.

{
          "beat" => {
        "hostname" => "playground",
            "name" => "playground",
         "version" => "6.5.1"
    },
    "@timestamp" => 2018-12-03T10:05:12.758Z,
           "log" => "\tat org.apache.kafka.clients.NetworkClient.ready(NetworkClient.java:265)\n",
        "offset" => 918,
       "message" => "{\"log\":\"\\u0009at org.apache.kafka.clients.NetworkClient.ready(NetworkClient.java:265)\\n\",\"stream\":\"stdout\",\"time\":\"2018-11-09T15:21:46.930388788Z\"}",
          "host" => {
        "name" => "playground"
    },
        "source" => "/home/ubuntu/logstash/someapp.log",
    "prospector" => {
        "type" => "log"
    },
          "tags" => [
        [0] "beats_input_codec_plain_applied"
    ],
          "time" => "2018-11-09T15:21:46.930388788Z",
        "stream" => "stdout",
         "input" => {
        "type" => "log"
    },
      "@version" => "1"
}

Can some one help me getting the stacktrace from above logs as a single message in logstash/elasticsearch?

Regards,
Sgarap

steffens · December 3, 2018, 11:24pm

Better try the docker input type, instead of the log input type. The docker input type removes the JSON envelope, such that the multiline filter can operate on the original log contents written by your application.

Markus_Schulz · December 4, 2018, 7:04am

hello, i've the same problem.
is there any way to test a configuration with "type docker" and a sample input logfile taken from docker?

my multiline pattern looks like:
multiline.pattern: '^[ \t]+(at|.{3}) |^Caused by:'
multiline.negate: false
multiline.match: after

but what i got in elasticsearch was sometime a little bit grouped (partial content from an exception) and duplicates and single entries from the stacktrace...in summary unusable...

steffens · December 4, 2018, 2:15pm

@Markus_Schulz can you please create a new topic? You might have different problems as are described here.

system · January 1, 2019, 2:15pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Parse Elasticsearch json logs in filebeat Beats docker , journalbeat , filebeat	7	985	February 10, 2023
Docker container logs and filebeat Beats	2	3345	July 5, 2017
Filebeat multiline docker logs Beats filebeat	6	7130	February 26, 2018
Java multiline with add_docker_metadata Beats	5	337	June 5, 2018
What does filebeat's multiline tool match Beats filebeat	4	1171	April 28, 2020

Multiline codec with Docker log driver logs

Related topics