We run some metricbeat in our machines. And collect the run log every 10 mins.
After check the log at /etc/log/metricbeat, we feel puzzled. And we don't find any document about it.
One log record like followed: fetches.system-core.events=40 fetches.system-core.success=10 fetches.system-cpu.events=10 fetches.system-cpu.success=10 fetches.system-diskio.events=12 fetches.system-diskio.success=2 fetches.system-load.events=10 fetches.system-load.success=10 fetches.system-memory.events=10 fetches.system-memory.success=10 fetches.system-network.events=710 fetches.system-network.success=10 fetches.system-process.events=58 fetches.system-process.success=2 libbeat.es.call_count.PublishEvents=22 libbeat.es.publish.read_bytes=14517 libbeat.es.publish.write_bytes=570384 libbeat.es.published_and_acked_events=850 libbeat.publisher.messages_in_worker_queues=850 libbeat.publisher.published_events=850
I guess the "fetches" means the collector, and the libbeat is the transfer.
"fetches.system-core.events=40" & "fetches.system-core.success=10"
The above means there are 30 event transfer failed to the Elasticsearch?
If it's failed, where can I find the data, or it be dropped?
But the "libbeat.publisher.published_events=850" means all of the event is transfer successful?
fetches.system-core.events indicates the number of fetches from the system. The number of fetches which is succesful is equals to the value of fetches.system-core.success. The value offetches.system-core.error show how many fetches failed. These events are not forwarded to ES, because there is nothing to be forwarded due to errors. You cannot find that data, because MB hes failed to fetch it from the system.
libbeat.publisher.published_events=850 means that 850 events were published. But it is not yet acknowledged by the output, so it is unknown if the events had made it succesfully to the output. libbeat.es.published_and_acked_events=850 shows that all 850 events were acknowledged, meaning that those made it to ES successfully.
@kvch
thanks so much,I have learned the meaning of each parameter,but why so many events are failed to fetch ,and how to find out why, could you give me some suggestions?
I don't think that there is an error in your metirbeat or host. The sum of fetches.*.events equals the sum of libbeat.es.published_and_acked_events. If you don't have a log line which includes fetches.system-core.errors, it means that it is 0, so there were not any errors.
If you don't see error field in your metricbeat events, it works correctly.
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.