Hi there,
We have just started with the ELK stack to monitor our servers. The servers are running on Debian Jessie (8.8) and ELK Stack 5.5.
On most of the machines metricbeat works as expected. But I have an issue with some servers: not all fields with system.x.y were filled in Kibana. Only "system.process.x.y" is being filled with data, other fields (eg. system.memory. ...) are empty / unused in Kibana.
Both groups of servers are installed with an automatic installation tool, so the installed software is nearly identical.
The installed OS is
Linux 3.16.0-4-amd64 #1 SMP Debian 3.16.43-2 (2017-04-30) x86_64 GNU/Linux
Metricbeat is
metricbeat 5.5.0
metricbeat.yml (the same on all servers)
#========================== Modules configuration ============================
metricbeat.modules:
#------------------------------- System Module -------------------------------
- module: system
metricsets:
- cpu
- load
- core
- diskio
- filesystem
- fsstat
- memory
- network
- process
# Sockets (linux only)
#- socket
enabled: true
period: 30s
processes: ['.*']
#================================ Outputs =====================================
#----------------------------- Logstash output --------------------------------
output.logstash:
# The Logstash hosts
hosts: ["123.123.123.123:5044"]
I have started metricbeat in debugmode to collect logs on both groups of servers (snippet from a "non-working" server below) . The output looks to me as if metricbeat collects data for fields like system.memory.actual.free but they are not visible in Kibana or maybe they were not send via logstash to elasticsearch.
root@server: bin/metricbeat -c /etc/metricbeat/metricbeat.yml -e -d "*" 2>&1 | tee /root/mbeat-outp-root.log
...
"system": {
"memory": {
"actual": {
"free": 1068235440128,
"used": {
"bytes": 16046555136,
"pct": 0.014800
}
},
"free": 1064366252032,
"swap": {
"free": 17179865088,
"total": 17179865088,
"used": {
"bytes": 0,
"pct": 0.000000
}
},
"total": 1084281995264,
"used": {
"bytes": 19915743232,
"pct": 0.018400
}
}
....
What is the best way to debug this issue? Is there an easy way to check the data flow?
It makes me crazy that one group of servers sends the complete dataset via logstash to the elasticsearch instance and another group of servers sends only a part of the dataset ...
Best regards
Olaf