...
Customers may encounter fatal bus errors for PCI1360 and PCI1318. When the system is working properly.The life cycle log shows as following: 2024-07-10 05:14:39 61 PCI1360 A bus fatal error was detected on a component at slot 1. 2024-07-10 05:14:42 62 PCI1318 A fatal error was detected on a component at bus 8 device 0 function 0. 2024-07-10 05:14:43 63 NIC100 The NIC in Slot 1 Port 1 network link is down. 2024-07-10 05:14:43 64 PCI1318 A fatal error was detected on a component at bus 7 device 5 function 0. 2024-07-10 05:14:46 66 PST0209 System BIOS has halted due to Non-Maskable Interrupt (NMI). 2024-07-10 05:48:04 92 NIC100 The NIC in Slot 1 Port 1 network link is down. 2024-07-10 05:48:20 94 NIC100 The NIC in Slot 1 Port 1 network link is down. 2024-07-13 01:59:23 115 PCI1360 A bus fatal error was detected on a component at slot 1. 2024-07-13 01:59:25 116 PCI1318 A fatal error was detected on a component at bus 8 device 0 function 0. 2024-07-13 01:59:29 119 PCI1318 A fatal error was detected on a component at bus 7 device 5 function 0. 2024-07-13 01:59:31 121 PCI1360 A bus fatal error was detected on a component at slot 1. 2024-07-13 01:59:31 122 PCI1318 A fatal error was detected on a component at bus 8 device 0 function 0. 2024-07-16 22:48:49 198 PCI1360 A bus fatal error was detected on a component at slot 1. 2024-07-16 22:48:50 199 PCI1318 A fatal error was detected on a component at bus 8 device 0 function 0. 2024-07-16 22:48:56 203 PCI1360 A bus fatal error was detected on a component at slot 1. 2024-07-16 22:48:57 204 PCI1318 A fatal error was detected on a component at bus 8 device 0 function 0. 2024-07-16 22:49:01 208 PCI1318 A fatal error was detected on a component at bus 7 device 5 function 0. 2024-07-17 00:23:42 236 SEC0033 The chassis is open while the power is off. 2024-07-17 00:32:30 243 PCI1360 A bus fatal error was detected on a component at slot 1. 2024-07-17 00:32:32 244 PCI1318 A fatal error was detected on a component at bus 8 device 0 function 0. 2024-07-17 00:32:34 247 PCI1318 A fatal error was detected on a component at bus 7 device 5 function 0. 2024-07-17 00:32:37 249 PCI1360 A bus fatal error was detected on a component at slot 1. 2024-07-17 00:32:38 250 PCI1318 A fatal error was detected on a component at bus 8 device 0 function 0. 2024-07-17 00:32:47 254 CPU0704 CPU 4 machine check error detected. 2024-07-17 00:33:08 268 CPU0704 CPU 1 machine check error detected. There is an error in the booting figure as following which is point to slot1 BCM5719 network card: Troubleshooting: BIOS and iDRAC firmware is the latest version.Customer have multiple units servers, but only this server has bus fatal error. Perform an AC power-cycle and, and the problem remains.Reseat the Broadcom NetXtreme BCM5719, issue remains.
There is a BCM5719 network card occurred unstable issue.
Follow the recommended actions in the event itself.Do one or all the steps below: Make sure the device, BIOS, and iDRAC are updated to the latest firmware.If batch issue happened. Update the OS version for patches or hotfixes. Update the device drivers.Perform an AC power-cycle.If possible, Make sure that the cables are properly routed and connected. Remove and reinstall the device.Disable the slot1 in BIOS or the best way is swap other PCIe slot for testing, if the issue follows the Broadcom NetXtreme BCM5719. You may want to consider replacing this network card.