Regarding duplicacy of logs through filebeat at kibana

Priyanka_chauhan · September 14, 2022, 5:33am

Hi,
As attached screenshot of kibana, logs are repeated with same message in two format, only difference of it log.file.path as I seen here. These dhcp logs are coming through filebeat to kafka. Logs flow is: filebeat-kafka-loggstash-kibana. I am not understanding where is var/log/syslog path from there dhcp logs are coming , while I defined the path /var/log/kea/kea-dhcp4.log .

Can someone help me to find the issue and its solution to remove duplicacy of logs

my filebeat configuration is given here: 1. Download latest filebeat package for architecture x86_64 and transfer it to Debian server
2. Install it using the following command,

sudo dpkg -i filebeat-8.3.3-amd64.deb

sudo filebeat modules list

sudo filebeat modules enable system

Make the following changes in /etc/filebeat/modules.d/system.yml,

syslog:

enabled: true

auth:

enabled: true

Make the following changes in /etc/filebeat/filebeat.yml

============================== Filebeat inputs ===============================

filebeat.inputs:

type: filestream

id: dhcp2

enabled: true

/var/log/kea/kea-dhcp4.log

---------------------------- Elasticsearch Output ----------------------------

#output.elasticsearch:

#hosts: ["localhost:9200"]

---------------------------- Kafka Output ------------------------------------

output.kafka:

hosts: ["kafka1:9092", "kafka2:9092", "kafka3:9092"]

topic: “dhcp”

required_acks: 1

Validate configuration

sudo filebeat -e -c /etc/filebeat/filebeat.yml

Enable and start Filebeat

sudo systemctl enable filebeat

sudo systemctl start filebeat

warkolm · September 14, 2022, 5:54am

Please format your code/logs/config using the </> button, or markdown style back ticks. It helps to make things easy to read which helps us help you

That would be why, as that module reads /var/log/syslog.

Priyanka_chauhan · September 14, 2022, 7:08am

so you mean I have to disable this module ?
syslog:

enabled: true

auth:

enabled: true

Priyanka_chauhan · September 15, 2022, 10:36am

so you mean I have to disable this module ?
syslog:

enabled: true

auth:

enabled: true

Priyanka_chauhan · September 23, 2022, 8:18am

Hi, this is my filebeat configuration yml file. please check and solution of it


# This file is an example configuration file highlighting only the most common
# options. The filebeat.reference.yml file from the same directory contains all the
# supported options with more comments. You can use it as a reference.
#
# You can find the full configuration reference here:
# https://www.elastic.co/guide/en/beats/filebeat/index.html

# For more available modules and options, please see the filebeat.reference.yml sample
# configuration file.

# ============================== Filebeat inputs ===============================

filebeat.inputs:

# Each - is an input. Most options can be set at the input level, so
# you can use different inputs for various configurations.
# Below are the input specific configurations.

# filestream is an input for collecting log messages from files.
- type: filestream

  # Unique ID among all inputs, an ID is required.
  id: dhcp1

  # Change to true to enable this input configuration.
  enabled: true

  # Paths that should be crawled and fetched. Glob based paths.
  paths:
    - /var/log/kea/kea-dhcp4.log

  # Exclude lines. A list of regular expressions to match. It drops the lines that are
  # matching any regular expression from the list.
  #exclude_lines: ['^DBG']

  # Include lines. A list of regular expressions to match. It exports the lines that are
  # matching any regular expression from the list.
  #include_lines: ['^ERR', '^WARN']

  # Exclude files. A list of regular expressions to match. Filebeat drops the files that
  # are matching any regular expression from the list. By default, no files are dropped.
  #prospector.scanner.exclude_files: ['.gz$']

  # Optional additional fields. These fields can be freely picked
  # to add additional information to the crawled log files for filtering
  #fields:
  #  level: debug
  #  review: 1

# ============================== Filebeat modules ==============================

filebeat.config.modules:
  # Glob pattern for configuration loading
  path: ${path.config}/modules.d/*.yml

  # Set to true to enable config reloading
  reload.enabled: false

  # Period on which files under path should be checked for changes
  #reload.period: 10s

# ======================= Elasticsearch template setting =======================

setup.template.settings:
  index.number_of_shards: 1
  #index.codec: best_compression
  #_source.enabled: false


# ================================== General ===================================

# The name of the shipper that publishes the network data. It can be used to group
# all the transactions sent by a single shipper in the web interface.
#name:

# The tags of the shipper are included in their own field with each
# transaction published.
#tags: ["service-X", "web-tier"]

# Optional fields that you can specify to add additional information to the
# output.
#fields:
#  env: staging

# ================================= Dashboards =================================
# These settings control loading the sample dashboards to the Kibana index. Loading
# the dashboards is disabled by default and can be enabled either by setting the
# options here or by using the `setup` command.
#setup.dashboards.enabled: false

# The URL from where to download the dashboards archive. By default this URL
# has a value which is computed based on the Beat name and version. For released
# versions, this URL points to the dashboard archive on the artifacts.elastic.co
# website.
#setup.dashboards.url:

# =================================== Kibana ===================================

# Starting with Beats version 6.0.0, the dashboards are loaded via the Kibana API.
# This requires a Kibana endpoint configuration.
setup.kibana:

  # Kibana Host
  # Scheme and port can be left out and will be set to the default (http and 5601)
  # In case you specify and additional path, the scheme is required: http://localhost:5601/path
  # IPv6 addresses should always be defined as: https://[2001:db8::1]:5601
  #host: "localhost:5601"

  # Kibana Space ID
  # ID of the Kibana Space into which the dashboards should be loaded. By default,
  # the Default Space will be used.
  #space.id:

# =============================== Elastic Cloud ================================

# These settings simplify using Filebeat with the Elastic Cloud (https://cloud.elastic.co/).

# The cloud.id setting overwrites the `output.elasticsearch.hosts` and
# `setup.kibana.host` options.
# You can find the `cloud.id` in the Elastic Cloud web UI.
#cloud.id:

# The cloud.auth setting overwrites the `output.elasticsearch.username` and
# `output.elasticsearch.password` settings. The format is `<user>:<pass>`.
#cloud.auth:

# ================================== Outputs ===================================

# Configure what output to use when sending the data collected by the beat.

# ---------------------------- Elasticsearch Output ----------------------------
#output.elasticsearch:
  # Array of hosts to connect to.
  #hosts: ["localhost:9200"]

  # Protocol - either `http` (default) or `https`.
  #protocol: "https"

  # Authentication credentials - either API key or username/password.
  #api_key: "id:api_key"
  #username: "elastic"
  #password: "changeme"

# ------------------------------ Logstash Output -------------------------------
#output.logstash:
  # The Logstash hosts
  #hosts: ["localhost:5044"]

  # Optional SSL. By default is off.
  # List of root certificates for HTTPS server verifications
  #ssl.certificate_authorities: ["/etc/pki/root/ca.pem"]

  # Certificate for SSL client authentication
  #ssl.certificate: "/etc/pki/client/cert.pem"

  # Client Certificate Key
  #ssl.key: "/etc/pki/client/cert.key"

# ---------------------------- Kafka Output ------------------------------------
output.kafka:
  hosts: ["10.197.235.103:9092"]
  topic: "dhcp1"
  required_acks: 1

# ================================= Processors =================================
processors:
  - add_host_metadata:
      when.not.contains.tags: forwarded
  - add_cloud_metadata: ~
  - add_docker_metadata: ~
  - add_kubernetes_metadata: ~

# ================================== Logging ===================================

# Sets log level. The default log level is info.
# Available log levels are: error, warning, info, debug
#logging.level: debug

# At debug level, you can selectively enable logging only for some components.
# To enable all selectors use ["*"]. Examples of other selectors are "beat",
# "publisher", "service".
#logging.selectors: ["*"]

# ============================= X-Pack Monitoring ==============================
# Filebeat can export internal metrics to a central Elasticsearch monitoring
# cluster.  This requires xpack monitoring to be enabled in Elasticsearch.  The
# reporting is disabled by default.

# Set to true to enable the monitoring reporter.
#monitoring.enabled: false

# Sets the UUID of the Elasticsearch cluster under which monitoring data for this
# Filebeat instance will appear in the Stack Monitoring UI. If output.elasticsearch
# is enabled, the UUID is derived from the Elasticsearch cluster referenced by output.elasticsearch.
#monitoring.cluster_uuid:

# Uncomment to send the metrics to Elasticsearch. Most settings from the
# Elasticsearch output are accepted here as well.
# Note that the settings should point to your Elasticsearch *monitoring* cluster.
# Any setting that is not set is automatically inherited from the Elasticsearch
# output configuration, so if you have the Elasticsearch output configured such
# that it is pointing to your Elasticsearch monitoring cluster, you can simply
# uncomment the following line.
#monitoring.elasticsearch:

# ============================== Instrumentation ===============================

# Instrumentation support for the filebeat.
#instrumentation:
    # Set to true to enable instrumentation of filebeat.
    #enabled: false

    # Environment in which filebeat is running on (eg: staging, production, etc.)
    #environment: ""

    # APM Server hosts to report instrumentation results to.
    #hosts:
    #  - http://localhost:8200

    # API Key for the APM Server(s).
    # If api_key is set then secret_token will be ignored.
    #api_key:

    # Secret token for the APM Server(s).
    #secret_token:


# ================================= Migration ==================================

# This allows to enable 6.7 migration aliases
#migration.6_to_7.enabled: true

cat   /etc/filebeat/modules.d/system.yml


- module: system
  # Syslog
  syslog:
    enabled: true

    # Set custom paths for the log files. If left empty,
    # Filebeat will choose the paths depending on your OS.
    #var.paths:

  # Authorization logs
  auth:
    enabled: true

    # Set custom paths for the log files. If left empty,
    # Filebeat will choose the paths depending on your OS.
    #var.paths:

stephenb · September 23, 2022, 1:52pm

If you don't want the audit logs then just disable the whole system module.

Otherwise disable Just the syslog in the system module

- module: system
  # Syslog
  syslog:
    enabled: false

@Priyanka_chauhan BTW I fixed your formatting when the post is not well formatted it can make it very hard to help... some community members just skip poorly formatted code / question...

Priyanka_chauhan · September 26, 2022, 9:59am

If i am doing this ,logs not shipped to kafka.

stephenb · September 26, 2022, 4:23pm

It is unclear what your current state / status is....

What we are saying is use either the filebeat.input or the system module but not both.

It looks like you are ingesting both...

When you look at the filesystem do both log files actually exist?

Plus if filebeat already read / shipped the logs... restarting filebeat will not re-ship the existing log lines again... so unless there are new log lines it will not re-ship the data unless you clean up the filebeat data directory. See Here ...

Priyanka_chauhan · September 27, 2022, 4:23am

Thanks for the help. Issue is resolved now.

system · October 25, 2022, 6:23am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Kibana Duplicate data Logs	2	1001	October 4, 2021
Filebeat duplicate log Beats docker , filebeat	5	1067	April 22, 2020
FIlebeat Output Configuration Error Beats filebeat	26	622	September 9, 2024
Filebeats connected but no logs Beats filebeat	1	843	December 30, 2022
No matching indices found no indices match pattern filebeat-* Beats filebeat	18	10109	August 14, 2018

Regarding duplicacy of logs through filebeat at kibana

============================== Filebeat inputs ===============================

---------------------------- Elasticsearch Output ----------------------------

---------------------------- Kafka Output ------------------------------------

Related topics