You are not logged in.
Hi, just updated to Linux 6.13.6 (Core linux package) and after rebooting, the system froze when reaching the graphical interface. I had to hard-reset it. After that, the subsequent boot worked fine.
The following errors were logged from the failed boot:
[ 7.061322] mce: [Hardware Error]: Machine check events logged
[ 7.061324] mce: [Hardware Error]: CPU 0: Machine Check: 0 Bank 3: be00000000800400
[ 7.061328] fbcon: Taking over console
[ 7.061330] mce: [Hardware Error]: TSC 0 MISC fffffff0
[ 7.061334] mce: [Hardware Error]: PROCESSOR 0:406f1 TIME 1741604680 SOCKET 0 APIC 0 microcode 0
[ 7.061338] mce: [Hardware Error]: Machine check events logged
[ 7.061338] mce: [Hardware Error]: CPU 0: Machine Check: 0 Bank 4: fe00000000800400
[ 7.061341] mce: [Hardware Error]: TSC 0 ADDR ffffffff8b76e000 MISC ffffffff8b76e60d
[ 7.061344] mce: [Hardware Error]: PROCESSOR 0:406f1 TIME 1741604680 SOCKET 0 APIC 0 microcode 0
[ 7.061347] mce: [Hardware Error]: CPU 0: Machine Check: 0 Bank 17: fe200000000c110a
[ 7.061350] mce: [Hardware Error]: TSC 0 ADDR ffffffc0 MISC e0fc385600402086
[ 7.061353] mce: [Hardware Error]: PROCESSOR 0:406f1 TIME 1741604680 SOCKET 0 APIC 0 microcode 0
[ 7.061355] mce: [Hardware Error]: CPU 0: Machine Check: 0 Bank 18: fe200000000c110a
[ 7.061358] mce: [Hardware Error]: TSC 0 ADDR 383ffff03000 MISC 28fc381604402086
[ 7.061361] mce: [Hardware Error]: PROCESSOR 0:406f1 TIME 1741604680 SOCKET 0 APIC 0 microcode 0
[ 7.061363] mce: [Hardware Error]: CPU 0: Machine Check: 0 Bank 19: fe200000000c110a
[ 7.061365] mce: [Hardware Error]: TSC 0 ADDR 80a18040 MISC b8ffa85605500086
[ 7.061369] mce: [Hardware Error]: PROCESSOR 0:406f1 TIME 1741604680 SOCKET 0 APIC 0 microcode 0
[ 7.061372] mce: [Hardware Error]: CPU 2: Machine Check: 0 Bank 3: be00000000800400
[ 7.061374] mce: [Hardware Error]: TSC 0 ADDR ffffffff8b76e000 MISC ffffffff8b76e60d PPIN ee14fa167541e6f5
[ 7.061378] mce: [Hardware Error]: PROCESSOR 0:406f1 TIME 1741604680 SOCKET 0 APIC 4 microcode b000040
[ 7.061381] mce: [Hardware Error]: CPU 9: Machine Check: 0 Bank 3: f200000000800400
[ 7.061383] mce: [Hardware Error]: TSC 0 PPIN ee14fa167541e6f5
[ 7.061386] mce: [Hardware Error]: PROCESSOR 0:406f1 TIME 1741604680 SOCKET 0 APIC 14 microcode b000040
Does anyone know where it could come from?
Last edited by OpusOne (2025-03-10 11:43:58)
Offline
This issue is really hard to debug if you cannot reproduce this ... Does it come up again after you downgrade to .5, reboot, reboot and then upgrade to .6 and then reboot again?
Offline
Yep I know it is. Possibly it's unrelated to the kernel 6.13.6. I had this kind of mce error before, but it was at least one year ago, and when it happened (which was not systematic of course), it was always upon resume, not upon booting, and it came with a "green screen" (AMD GPU here). After a given kernel update (don't remember which exactly now), it never happened again.
So, first time it happens again in a long time now, and maybe this kernel update is unrelated. It's very hard to tell. I can't reproduce it.
There's one thing that I do with this machine, that I'm wondering if it could cause this eventually: I almost never shut it down. I use standby/resume all the time, and reboot it when updating the kernel, but without shutting the machine down ever (or extremely rarely). I wonder if by doing so, the BIOS may end up exhibiting some bug causing this. But again, it hadn't happened in months, so, I don't know.
Last edited by OpusOne (2025-03-10 14:14:09)
Offline