Filebeat + Apache2 configuration help!

eddie1 · September 13, 2017, 9:39pm

Hi,

I'm new to the ELK stack and am trying to figure out how to configure filebeat+apache 2. I have the system doing some basic work such as syslog going from filebeat->logstash->ES, but I'm finding the documentation for setting up apache2 module very sparse.

Basically I have an apache 2.2 server outputting in a custom log format. I want to use filebeat and the apache2 module to read those logs, push the data to logstash->ES. The documentation on the elastic site is not very detailed on what needs to be done.

How do I get filebeat + apache module to read the custom format? I have some basic configuration for the module (ie. file paths) but where and how do I setup the custom log format I'm using? My apache2 custom log format:
LogFormat "%{X-ClientIP}i %{True-Client-IP}i %h %D %l %u %t "%r" %>s %b "%{Referer}i" "%{User-Agent}i" %X "%{u_locale}C" "%{u_locale}i" "%{u_locale}o"" combined
What do I need to do on the logstash server?

I have not configured any prospectors (I'm assuming I need to based on the log output when I try to set this up):
./filebeat -e --modules apache2 -setup

beat.go:346: CRIT Exiting: Error getting config for fielset apache2/access: Error interpreting the template of the prospector: template: text:3:22: executing "text" at <.paths>: range can't iterate over /apache/logs/access.log
Exiting: Error getting config for fielset apache2/access: Error interpreting the template of the prospector: template: text:3:22: executing "text" at <.paths>: range can't iterate over /apache/logs/access.log

filebeat.yml;
#------------------------------- Apache2 Module ------------------------------
filebeat.modules:

module: apache2
access:
enabled: true
var.pipeline: with_plugins
var.paths: /apache/logs/access.log
#prospector:
error:
enabled: true
var.paths: /apache/logs/error.log
#prospector:

andrewkroh · September 14, 2017, 3:41pm

The modules in Filebeat currently have a limitation in that they can only be used when the data is sent directly to Elasticsearch. This is because they rely on the Elasticsearch Ingest Node feature.

Since you are using Logstash already and you have a custom format I recommend that you add a grok filter to your LS config to parse the data. This tool is helpful for developing a grok pattern for you custom logs.

In filebeat add a new prospector to pick up your Apache logs.

filebeat.prospectors:
- paths:
  - /apache/logs/access*
  tags: [apache_access]
- paths:
  - /apache/logs/error*
  tags: [apache_error]
- paths:
  - /my/syslog/logs
  tags: [syslog]

output.logstash:
  hosts: ["127.0.0.1:5044"]

Then in Logstash add some grok filters for your Apache log formats.

input {
  beats {
    port => 5044
  }
}

filter {
  if "apache_access" in [tags] {
    grok {
      # You'll need to customize the pattern for your log format.
      match => { "message" => "%{COMBINEDAPACHELOG}" }
    }
    date {
      match => [ "timestamp" , "dd/MMM/yyyy:HH:mm:ss Z" ]
    }
  } else if "apache_error" in [tags] {
    // Add grok for your error log format.
  }
}

output {
  elasticsearch {
    hosts => "localhost:9200"
    manage_template => false
    index => "%{[@metadata][beat]}-%{+YYYY.MM.dd}" 
    document_type => "%{[@metadata][type]}" 
  }
}

https://www.elastic.co/guide/en/logstash/current/config-examples.html

eddie1 · October 6, 2017, 8:02pm

Thank you Andrew. I was able to get things working. Still having a bit of trouble, and i think it has to do with the IP address /grok expression I'm using.

Right now, we collect three possible IP addresses (one passed in by our load balancer (of the CDN node), one from our CDN (of the actual user), and one if something goes directly to our webserver). If the request didn't go through the load balancer or CDN, we basically have a dash in the 1st and 2nd IP addresses log entries.

When setting up the fields, I was only able to give it a text field type and not the IP address field type because of the dashes. Is there anything I can do to use the IP address type so we can do IP address searches (eg. a subnet search)

Thanks!

andrewkroh · October 6, 2017, 9:08pm

You could add a mutate filter to remove the offending field. Or you could add additional grok patterns that ignore the - entirely.

filter {
  if [my_ip_field] == "-" {
    mutate {
      remove_field => [ "my_ip_field" ]
    }
  }
}

system · November 3, 2017, 9:08pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
ELK setup for Apache Application Logs Beats filebeat	6	1727	June 28, 2018
Filebeat and apache2 module Beats filebeat	3	1638	June 12, 2017
Cant get apache module working Beats filebeat	3	1276	January 24, 2018
Logstash Custom file for apache configuration Logstash	3	308	July 27, 2019
Need help with setting up Apache2 to monitor the session time on every website Beats filebeat	42	3086	December 4, 2018

Filebeat + Apache2 configuration help!

Related topics