Zeek - Filebeat module and Processors for DNS Lookups

thinkagain · December 19, 2020, 3:26pm

hi everyone,

I'm trying to get acquainted with the ELK platform and trying to understand how the different modules interact with each other.
Besides the (internal) ip addresses I would like the result of a dns lookup in the docs that end up in ES. The logs I am referring to are the ones from Zeek that are shipped to ES using Filebeat. This uses the Zeek module for Filebeat.

I found some documentation on processors that can be used with filebeat. There is even a special processor for DNS lookups that seems to do exactly what I am looking for.

I've added the required yaml entries to the filebeats.yml but nothing happens.
My question is whether I'm on the right path, can processors in the filebeat configuration be used when using a filebeats module (in this case the zeek module) or will this not work because the whole processing is being done by the module and therefore bypassing the standard pipeline.

If this is not going to work, any other suggestions how to enrich these log records with dns lookups?

Mario_Castro · December 22, 2020, 10:33am

What's your resulting yaml that isn't working? Can you paste your filebeat and module configurations?

Internally, the pipeline in Elasticsearch Ingest node is the one in charge of events manipulation. Check the JSON in the Ingest Node to double check if your changes are being updated in the node https://www.elastic.co/guide/en/elasticsearch/reference/current/ingest.html

thinkagain · December 22, 2020, 7:20pm

Hi Mario,

thanks for your assistance.
I added en entry to the "processors" section in the filebeat.yml, see yml below.

================================= Processors =================================

processors:

add_host_metadata:
when.not.contains.tags: forwarded
add_cloud_metadata: ~
add_docker_metadata: ~
add_kubernetes_metadata: ~
- dns:
** type: reverse**
** action: append**
** transport: tls**
** fields:**
** server.ip: server.hostname**
** client.ip: client.hostname**
** success_cache:**
** capacity.initial: 1000**
** capacity.max: 10000**
** min_ttl: 1m**
** failure_cache:**
** capacity.initial: 1000**
** capacity.max: 10000**
** ttl: 1m**
** nameservers: ['192.168.1.1', '192.168.1.1']**
** timeout: 500ms**
** tag_on_failure: [_dns_reverse_lookup_failed]**

================================== Logging ===================================

I made no further modifications to the zeek.yml except that I added the paths where the logs reside

Module: zeek

Docs: /guide/en/beats/filebeat/7.6/filebeat-module-zeek.html

module: zeek
capture_loss:
enabled: true
var.paths: ["/opt/zeek/logs/current/capture_loss.log"]
connection:
enabled: true
var.paths: ["/opt/zeek/logs/current/conn.log"]
dce_rpc:
enabled: true
var.paths: ["/opt/zeek/logs/current/dce_rpc.log"]

The document you linked suggests to do an ingest on the ES node right?
How would I setup the filebeat in order to achieve this? Is this something I need to configure inside the standard ES config?

I cant seem to find the location (on the filebeat machine) how to manipulate the standard processing as being done by the standard filebeat zeek module.

With your reference to "Ingest node" I noticed within the ES configuration under Stack Management the "Ingest Node Pipelines" menu option. Within this menu a separate "Ingest Node Pipeline" exists per zeek logfile. Hmm, is that where the enrichment and fieldmapping magic happens? Is al the processing being done on the Elastic node itself...makes sense you should appoint dedicated ingest nodes as stated in the document.
I'm going to try whether making adjustments in the ingest pipeline is perhaps the way to do this.

These are obviously newbee questions and that is so true. Please point me in the right direction to prevent make all the beginner mistakes.

Mario_Castro · December 23, 2020, 3:50pm

Please, can you format the conf you pasted, the forum is markdown compatible? In yaml it's very important to see if the configuration is correctly formatted and you'll be surprised of how many times it's just an indentation issue

thinkagain · December 23, 2020, 7:16pm

Hi Mario,

another beginner mistake. Retry attempt of my yaml underneath.

I dug a little further into the configuration. I belief that the predefined modules do not look at the main /etc/filebeat/filebeat.yml file.

I found that for each (in this case Zeek) logfile within a module a separate connection.yml exists. This yaml file describes the way the fields are parsed and renamed by filebeat.
The second stage occurs at Elasticsearch by the Ingest Node Pipeline. This pipeline is created by filebeat during the setup and is created based on a template which is present in the /usr/share/zeek//ingest/pipeline.yml. In the second stage enrichment with GeoIP is done (amongst others).

My conclusion is that the enrichment with DNS reverse lookups should be added in the 2nd stage (but could in theory also be done during the first stage by Filebeat).

I noticed that the Ingest Pipeline at Elasticsearch is created in JSON which makes editing a little more difficult (at leat for me).

My Question remains what the recommended way is to make this adjustment, there seem to be multiple scenario's:

Scenario 1: Let the enrichment occur at Filebeat (stage1) by adding the additional processor to the connection.yml

Scenario 2: Let the enrichment occur at Elastic (stage 2) by adding the additional processor in Yaml to the ingest\pipeline.yml file on the filebeat machine and re-run the setup.

Scenario 3: Let the enrichment occur at Elastic (stage 2) by adding the additional processor in JSON to the Ïngest Node Pipeline".

If I do the adjustment as described in scenario 1 or 2, what happens when filebeat gets updated, will this overwrite my customizations?

# ================================= Processors =================================
processors:
  - add_host_metadata:
      when.not.contains.tags: forwarded
  - add_cloud_metadata: ~
  - add_docker_metadata: ~
  - add_kubernetes_metadata: ~
  - dns:
      type: reverse
      action: append
      transport: tls
      fields:
        server.ip: server.hostname
        client.ip: client.hostname
      success_cache:
        capacity.initial: 1000
        capacity.max: 10000
        min_ttl: 1m
      failure_cache:
        capacity.initial: 1000
        capacity.max: 10000
        ttl: 1m
      nameservers: ['192.168.1.1', '192.168.1.2']
      timeout: 500ms
      tag_on_failure: [_dns_reverse_lookup_failed]

# ================================== Logging ===================================

thinkagain · December 23, 2020, 9:30pm

Another update on this subject.

As mentioned in previous post, I thought there were 3 possible scenario's.
It turned out that scenario 2 and 3 will not work since the DNS processor only exists for Filebeat and doesn't for Elastic.

I did manage to get the first scenario up and running by adding the DNS processor entries to the module specific connection.yml in the processor section.
Path: /usr/share/filebeat/module/zeek/connection/config/connection.yml

This is the yaml I've added in the processor section to get it to work:

  - dns:
      type: reverse
      action: replace
      transport: udp
      fields:
        source.ip: source.domain
        destination.ip: destination.domain
      success_cache:
        capacity.initial: 1000
        capacity.max: 10000
        min_ttl: 1m
      failure_cache:
        capacity.initial: 1000
        capacity.max: 10000
        ttl: 1m
      nameservers: ['192.168.1.1', '192.168.1.2']
      timeout: 500ms
      tag_on_failure: [_dns_reverse_lookup_failed]

system · January 20, 2021, 11:31pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Ingest pipeline not working for filebeat Elasticsearch	2	3113	May 13, 2020
I can't see Zeek's http.log in Kibana but everything else (DNS, SSL, etc.) is fine Beats filebeat	2	1563	March 2, 2020
Zeek ingest using logstash on elastic cloud Logstash	9	1212	May 6, 2021
Filebeat modules Beats beats-module , filebeat	14	467	September 8, 2021
Ingesting PCAP dataset into ELK via Zeek and Zeek scripting Beats filebeat	4	1257	September 21, 2021

Zeek - Filebeat module and Processors for DNS Lookups

================================= Processors =================================

================================== Logging ===================================

Module: zeek

Docs: /guide/en/beats/filebeat/7.6/filebeat-module-zeek.html

Related topics