Hello, I've been trying to figure this out for a while now.. and initially I thought it had to do with my beats agent no matching my ELK version, but I updated everything today and the problem persists.
On the Kibana dashboard for Metricbeat, there is a visualization called "Top Hosts By CPU (Realtime) [Metricbeat System] ECS" which is a bar chart. I have data in the appropriate fields from my metricbeat agents for all hosts, but some of the hosts don't show up on the chart itself. Some will get a % while some get a dash, some have the bar and some don't I can't figure it out because there are no errors or fields missing as far as I can tell.
If I put in like 6 hours back to "now".. the bars populate, but not with accurate values from the recent records. Do I have a timezone problem somewhere?
Set Kibana to use UTC in Advanced Settings (it was browser)
Checked the timestamps in metricbeat-* index
The devices all have the same time in UTC, the @timestamp field is in UTC in the index.. but, the problem still persists.. it's really odd, cuz if you look at the screenshot I posted.. the percentages are there, but the bars don't appear.
Hey Stephen.. sorry for the late response.. I'm actually in an Elasticsearch engineering class that's eating up a bit of time
I did click on the host overview for one with a dash and one without a dash and got some data missing for the one's with a dash as you might expect. What's interesting is that all the hosts with dashes are Windows and the others are *nix boxes.
Is there maybe something wrong with my beats config that's causing this? Nothing has changed recently.. but, this problem started a few versions back. I'm on version 7.13.1 across the board, but this has definitely been happening since before 7.13.0 for sure.
It looks like the value the first visualization (Top Hosts By CPU (Realtime)) depends on is system.cpu.user.pct some records in the metricbeat-* index have the value, and some don't.. but even the Windows records have the mix...
Very confusing.. even if there is missing data for Windows.. why would that cause the bars to not render?
I think you're overthinking it I think it's a bug in the visualization and apologies but I don't really have time to track it down right now. If I remember it had something to do with sorting or something like that.
Make a different visualization it's good practice going to lens make your own.
If I were you I wouldn't get hung up on that create a new visualization learn lens go into tsvb look at it closely that dash isn't worth losing days over.
In all the beats you have to look at the metricsets within the beats they'll be Network, system process metricsets etc... The CPUs metrics will be in the system ones not in the network. So there's different types of data based on the metricset.
Take a look at the common fields in the metricset.name and you will start to get it.
Heh.. I hear ya on the time thing, I appreciate the leads though..
Interestingly, I added a "1h" value in the "Offset series time by (1m, 1h, 1w, 1d)" of that chart.. and it started working, so I'm going to try and figure out what "Offset series time by (1m, 1h, 1w, 1d)" actually represents and why that restored the visualization. I'm sure that the data is no longer representative of what it was intended to represent now that I've changed the parameters, so I just need to understand what that parameter actually does.. lol
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.