How to stop Filebeat from shipping incorrect JSON to Elastic?

andreycha · October 21, 2024, 3:44pm

Hi,

Our applications emit logs in ECS format, a log entry per line. In Filebeat config we have a parser defined:

parsers:
  - ndjson:
      target: ''
      expand_keys: true
      overwrite_keys: true
      add_error_key: true

Then there is a processor in the ingest pipeline which converts some of the fields into lower case:

processors:
  - lowercase:
      field: 'log.level'
on_failure:
- set:
    field: error.message
    value: '{{ _ingest.on_failure_message }}'

Sometimes we have issues with free disk space which lead to incorrect log entries in the file, where one log entry is concatenated with previous incompletely written log entry, e.g.:

{"@timestamp":"2024-10-17T10:35:59.1814105+02:00","log.level":"Information","message":"Requ{"@timestamp":"2024-10-17T12:00:01.0603052+02:00","log.level":"Information","message":"Ok"}

Notice how the first message property is not written completely and then the second log entry is immediately appended to it.

When Filebeat processes such lines, it emits an error like "Error decoding JSON: invalid character '@' after object key:value pair" (error message varies a bit depending on where exactly previous log line was abrupted).

Finally, broken log lines are delivered into Elastic where error.message is set to "field [level] not present as part of path [log.level]".

We would like to just completely ignore such broken log lines. How to implement it properly? ignore_decoding_error option of ndjson parser seems to just control whether error log is emitted or not.

stephenb · October 21, 2024, 6:17pm

Hi @andreycha

processors:

Are executed in order, so perhaps the last processors should just be a drop_event processor based on the contents or existence of one of the two following...

add_error_key
If this setting is enabled, Filebeat adds an "error.message" and "error.type: json" key in case of JSON unmarshalling errors or when a message_key is defined in the configuration but cannot be used.

Something like

  - drop_event:
      when:
        equals:
          error.type: "json"

andreycha · November 13, 2024, 3:46pm

Hi @stephenb ,

Thank you, that should do the job!

Topic		Replies	Views
Add original message to documents logged with decode_json_fields Beats filebeat	4	904	March 29, 2017
Log JSON decoding errors of TCP input Beats filebeat	2	344	December 10, 2018
Process logs in json with a "message" key not working with filebeat 6.4.1 Beats filebeat	4	2509	November 1, 2018
Filebeat json parser is trying to parse excluded lines Beats docker , filebeat	6	1004	June 29, 2022
Ndjson.expand_keys doesn't seem to work Beats filebeat	5	487	September 7, 2022

How to stop Filebeat from shipping incorrect JSON to Elastic?

Related topics