Thank you Mat and Stephen!!!!
So, after reading all your suggestions - decided to bite the bullet and start from scratch!
so I did the following:
- got a new ES8.4.3 cloud cluster - clean, with no additional monitoring settings
- updated my filebeat.yml to use minimal monitoring config - only set the "
monitoring.enabled: true
" setting
here is what it looks like now: (a couple of extra settings for pipelines - leaving them here just in case they might be a problem - unlikely)
# ---------------------------- Elasticsearch Output ----------------------------
output.elasticsearch:
enabled: true
index: "ibc-parsed-logs"
parameters.pipeline: "geoip-info"
hosts: ${ES_HOSTS}
protocol: "https"
# Authentication credentials - either API key or username/password.
api_key: ${ES_API_KEY}
# The maximum number of events to bulk in a single Elasticsearch bulk API index request.
# The default is 50.
#bulk_max_size: 100
# ================================= Processors =================================
processors:
- add_host_metadata:
when.not.contains.tags: forwarded
- decode_json_fields:
fields: ["message"]
add_error_key: true
document_id: "event_uuid"
# ================================== Logging ===================================
logging.metrics.enabled: true
logging.enabled: true
# Sets log level. The default log level is info.
# Available log levels are: error, warning, info, debug
logging.level: debug
logging.to_files: true
logging.files:
path: /usr/share/filebeat/f_logs
name: filebeat
keepfiles: 10
permissions: 0640
# At debug level, you can selectively enable logging only for some components.
# To enable all selectors use ["*"]. Examples of other selectors are "beat",
# "publisher", "service".
logging.selectors: ["*"]
# ============================= X-Pack Monitoring ==============================
monitoring.enabled: true
- make sure I have correct cloud.id, 'hosts' and api key values (beats format)
Now, when I run filebeat, I do not see any errors anymore (!) and I can see in filebeat's logs that the monitoring events are being sent to the ES cluster:
{"log.level":"debug","@timestamp":"2022-10-18T14:55:21.509Z","log.logger":"monitoring","log.origin":{"file.name":"processing/processors.go","file.line":210},"message":"Publish event: {\n \"@timestamp\": \"2022-10-18T14:55:21.505Z\",\n \"@metadata\": {\n \"beat\": \"filebeat\",\n \"type\": \"_doc\",\n \"version\": \"8.4.3\",\n \"type\": \"beats_stats\",\n \"interval_ms\": 10000,\n \"params\": {\n \"interval\": \"10s\"\n }\n },\n \"metrics\": {\n \"registrar\": {\n \"states\": {\n \"current\": 0,\n \"update\": 0,\n \"cleanup\": 0\n },\n \"writes\": {\n \"total\": 0,\n \"fail\": 0,\n \"success\": 0\n }\n },\n \"filebeat\": {\n \"input\": {\n \"netflow\": {\n \"flows\": 0,\n \"packets\": {\n \"received\": 0,\n \"dropped\": 0\n }\n },\n \"log\": {\n \"files\": {\n \"renamed\": 0,\n \"truncated\": 0\n }\n }\n },\n \"events\": {\n \"added\": 0,\n \"done\": 0,\n \"active\": 0\n },\n \"harvester\": {\n \"closed\": 0,\n \"running\": 0,\n \"open_files\": 0,\n \"skipped\": 0,\n \"started\": 0\n }\n },\n \"libbeat\": {\n \"pipeline\": {\n \"queue\": {\n \"max_events\": 4096,\n \"acked\": 0\n },\n \"clients\": 1,\n \"events\": {\n \"retry\": 0,\n \"active\": 0,\n \"total\": 0,\n \"filtered\": 0,\n \"published\": 0,\n \"failed\": 0,\n \"dropped\": 0\n }\n },\n \"config\": {\n \"module\": {\n \"running\": 0,\n \"starts\": 0,\n \"stops\": 0\n },\n \"scans\": 0,\n \"reloads\": 0\n },\n \"output\": {\n \"read\": {\n \"bytes\": 0,\n \"errors\": 0\n },\n \"type\": \"elasticsearch\",\n \"events\": {\n \"failed\": 0,\n \"dropped\": 0,\n \"duplicates\": 0,\n \"active\": 0,\n \"toomany\": 0,\n \"batches\": 0,\n \"total\": 0,\n \"acked\": 0\n },\n \"write\": {\n \"bytes\": 0,\n \"errors\": 0\n }\n }\n },\n \"beat\": {\n \"cpu\": {\n \"total\": {\n \"time\": {\n \"ms\": 1820\n },\n \"value\": 1820,\n \"ticks\": 1820\n },\n \"user\": {\n \"ticks\": 700,\n \"time\": {\n \"ms\": 700\n }\n },\n \"system\": {\n \"ticks\": 1120,\n \"time\": {\n \"ms\": 1120\n }\n }\n },\n \"runtime\": {\n \"goroutines\": 76\n },\n \"info\": {\n \"version\": \"8.4.3\",\n \"uptime\": {\n \"ms\": 180354\n },\n \"ephemeral_id\": \"68d58d94-3678-4cda-9759-7348b4354bd9\",\n \"name\": \"filebeat\"\n },\n \"cgroup\": {\n \"cpuacct\": {\n \"id\": \"/\",\n \"total\": {\n \"ns\": 1875475443\n }\n },\n \"memory\": {\n \"id\": \"/\",\n \"mem\": {\n \"usage\": {\n \"bytes\": 52338688\n },\n \"limit\": {\n \"bytes\": 9223372036854771712\n }\n }\n },\n \"cpu\": {\n \"id\": \"/\",\n \"cfs\": {\n \"quota\": {\n \"us\": 0\n },\n \"period\": {\n \"us\": 100000\n }\n },\n \"stats\": {\n \"periods\": 0,\n \"throttled\": {\n \"periods\": 0,\n \"ns\": 0\n }\n }\n }\n },\n \"handles\": {\n \"open\": 19,\n \"limit\": {\n \"soft\": 1048576,\n \"hard\": 1048576\n }\n },\n \"memstats\": {\n \"memory_sys\": 34161672,\n \"gc_next\": 20490032,\n \"rss\": 138559488,\n \"memory_total\": 67309832,\n \"memory_alloc\": 13446248\n }\n },\n \"system\": {\n \"cpu\": {\n \"cores\": 8\n },\n \"load\": {\n \"1\": 0.09,\n \"5\": 0.1,\n \"15\": 0.04,\n \"norm\": {\n \"15\": 0.005,\n \"1\": 0.0113,\n \"5\": 0.0125\n }\n }\n }\n },\n \"beat\": {\n \"uuid\": \"1fc0f527-3173-46be-a029-ca70c420c1c7\",\n \"type\": \"filebeat\",\n \"version\": \"8.4.3\",\n \"name\": \"ccf8bd83efd8\",\n \"host\": \"ccf8bd83efd8\"\n }\n}","service.name":"filebeat","ecs.version":"1.6.0"}
{"log.level":"debug","@timestamp":"2022-10-18T14:55:21.545Z","log.logger":"monitoring","log.origin":{"file.name":"memqueue/eventloop.go","file.line":197},"message":"handle ACKs: 1","service.name":"filebeat","ecs.version":"1.6.0"}
{"log.level":"debug","@timestamp":"2022-10-18T14:55:21.545Z","log.logger":"monitoring","log.origin":{"file.name":"memqueue/eventloop.go","file.line":216},"message":"try ack index: (idx=0, i=0, seq=0)\n","service.name":"filebeat","ecs.version":"1.6.0"}
{"log.level":"debug","@timestamp":"2022-10-18T14:55:21.546Z","log.logger":"monitoring","log.origin":{"file.name":"memqueue/eventloop.go","file.line":220},"message":"no state set","service.name":"filebeat","ecs.version":"1.6.0"}
{"log.level":"debug","@timestamp":"2022-10-18T14:55:21.547Z","log.logger":"monitoring","log.origin":{"file.name":"memqueue/eventloop.go","file.line":199},"message":"handle ACK took: 2.2609ms","service.name":"filebeat","ecs.version":"1.6.0"}
{"log.level":"debug","@timestamp":"2022-10-18T14:55:21.547Z","log.logger":"monitoring","log.origin":{"file.name":"memqueue/ackloop.go","file.line":95},"message":"ackloop: return ack to broker loop:1","service.name":"filebeat","ecs.version":"1.6.0"}
HOWEVER, when I go to Stack Monitoring in the ES cluster - I still see nothing - no .monitoring indices are create.... And in fact, I don't see anything at ll like what you showed....
This is what I get when go to Stack Monitoring:
I feel I am sooooo close and yet ....
Is this something in configuration or permissions missing, potentially?
Thank you !!