Filebeat issue with multiple log entries

arcsons · February 5, 2025, 8:10am

Hello,
I have a Python script that fetches data and writes it to a log file named cb.json. The script runs every 5 minutes and retrieves the last 1,000 log entries. As a result, some log entries may be fetched multiple times
since it last 1,000 logs.
Filebeat is configured to collect these logs and send them to Elasticsearch. However, I'm encountering an issue where duplicate logs are appearing in my Elasticsearch index.
I'm seeking advice on how to prevent these duplicates from being indexed in Elasticsearch. Are there any recommended approaches or best practices to ensure that only unique log entries are stored?

I have tested with id, hashes, document, fingerprints and you name it, without success.

Thank you in advance for your assistance.

#:/etc/filebeat$ cat filebeat.yml
setup.template.name: "carbon_black"
setup.template.pattern: "carbon_black-*"
setup.template.enable: false
setup.ilm.enabled: false

filebeat.inputs:

type: log
enabled: true
paths:
- /home/arc/cb.json
  json.keys_under_root: true
  json.overwrite_keys: true
  processors:
- add_fields:
  target: event
  fields:
  dataset: "carbon_black.observations"

output.elasticsearch:
hosts: ["https://10.10.0.1:9200", "https://10.10.0.2:9200", "https://10.10.0.3:9200"]
protocol: "https"
ssl.verification_mode: "none"
username: "elastic"
password: "elastic"

output.elasticsearch.indices:

index: "carbon_black_observations-%{+yyyy.MM.dd}"
when.contains:
event.dataset: "carbon_black.observations"

arcsons · February 6, 2025, 9:06pm

I got help from the Slack channel.

Solution:

processors:

fingerprint:
fields: ["event_id", "device_timestamp"]
target_field: "@metadata._id"

Topic		Replies	Views
Filebeat addad an entry multiple times Beats filebeat	2	449	March 2, 2018
Filebeat sending duplicate logs to logstash Beats	2	2409	March 2, 2018
[SOLVED]Getting duplicate entries with Filebeat to Logstash Setup Beats filebeat	7	10025	December 5, 2018
Filebeat reads 1 event log multiple times Beats filebeat	5	1523	May 11, 2017
Duplication while logs are appended in Filebeat Beats filebeat	1	237	July 23, 2021

Filebeat issue with multiple log entries

Related topics