Auditbeat 8.10.2 caused several physical servers to crash

kpeterson-pmts · November 3, 2023, 4:16pm

We upgraded auditbeats from version 8.6.2 to version 8.10.2 using automation tooling. The upgrade was first tested on some of our EC2 instances, and had no issues. When we applied the upgrade to a subset of physical hosts in the datacenter, they ALL crashed and rebooted. All host are running CentOS 7, kernel version 3.10.0-1160.99.1.el7.x86_64 on Dell PowerEdge R430 servers. The host would reboot, come up to a login prompt, remain up for about a minute, and then crash and reboot again. A check of the syslog showed NO messages to indicate root cause, there are no messages in syslog between the end of the last boot and the journal message from the start of the next boot. To fix the issue, we had to disable the auditbeats service with systemctl, and wait for the server to crash and reboot one last time, then revert the upgrade.

A few of the hosts were set to prompt on error, and the BIOS reported that the system had been rebooted due to a watchdog timeout.

We would like to know the root cause of this issue.

Tchapou · November 28, 2023, 2:03pm

Hello, did you find a solution for your issue ? We have a similar problem on multiple Dell R650 servers with Debian 11. After starting auditbeat, the watchdog timer reboot constantly the server

system · December 26, 2023, 4:04pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Auditbeat: system/socket dataset setup failed - guess_inet_sock failed: timeout while waiting Beats auditbeat	1	218	December 21, 2023
Auditbeat not work in Ubuntu 22.04 after a reboot or shutdown Beats auditbeat	6	17	November 1, 2024
Auditbeat randomly shuts down Beats auditbeat	24	1866	November 4, 2022
Auditbeat crashes few seconds after start Beats auditbeat	4	638	June 13, 2019
Auditbeat memory leak Beats auditbeat	6	1978	January 20, 2021

Auditbeat 8.10.2 caused several physical servers to crash

Related topics