You are not logged in.

#1 2026-01-24 22:51:35

Fizzfish52
Member
Registered: 2026-01-24
Posts: 1

kernel [Hardware Error] in Journal log

Kernel: Linux 6.18.6-arch1-1
CPU: 5800x3D -30 all core (PPT 127, TDC 80, EDC 125 and everything else related is on Auto, used this for 3 years now)

This is the second time I got this error without a single trace, the first was a month ago and something similar to this (or exactly the same I dont remember sadly). If I dont check journal I wouldnt even know about it. No crashes, no instability in my everyday usage, no kernel panics or freezes, its like a phantom or something. I stress tested with a lot of stuff since I have this rig, from mprime to linpack, you can name it, they always passed. Every memtest passed for 10 hours, and I ran like 20 since I built this pc. I'm no expert but I would say its stable since I rock with this particular PBO/XMP combo for 3 years now, never had a single problem with it. I can game, render, compile, edit, benchmark or whatever without crashes or freezes. Temps arent and never were a problem, it has a 360mm AIO and great airflow, I take care of the dust too.

Jan 23 12:04:45 archlinux kernel: [Hardware Error]: Corrected error, no action required.
Jan 23 12:04:45 archlinux kernel: [Hardware Error]: CPU:1 (19:21:2) MC25_STATUS[-|CE|-|-|PCC|SyndV|UECC|-|Poison|-]: 0x936d28e0
c0c749ff
Jan 23 12:04:45 archlinux kernel: [Hardware Error]: IPID: 0x0000000000000000, Syndrome: 0x0000000000000000
Jan 23 12:04:45 archlinux kernel: [Hardware Error]: Bank 25 is reserved.
Jan 23 12:04:45 archlinux kernel: [Hardware Error]: cache level: L3/GEN, tx: RESV


If I'm not mistaken, this could be related to RAM more than CPU? I run nothing crazy, just 3200mhz on XMP because it passed every memtest. I cant really find relevant info on this, everyone says something different but from what I've gathered so far, these errors should be followed by crashes right? I got no crashes and I dont really know what causes this because this error showed up randomly when I let it run overnight and the PC was completely idle with only Firefox opened with a single tab.

I really hope its not a hardware issue because that would stress me out (and I'm broke). So I can think of a few things that could be related to this maybe:
1. I updated my BIOS to the latest stable version a month ago, this could be a coincidence but the first error showed up around the same time. (3.90 on ASrock B550 PG4)
2. Could be that the 6.18 kernel line is not friendly with this BIOS or something just buggy? I havent noticed this error before 6.18, but I could be wrong.
3. Something changed in voltages? Or do I need to change some voltages? This is an older AM4 mobo so the BIOS updates only patched security stuff, I dont think it messed up voltages.
Also I upgraded my PSU and GPU just 2 days after I got the first error, so I think I can rule out those.

Is this ghost chasing/overthinking? I would appreciate some insights on this. Thank you!!
I will try to reproduce it by letting the pc run overnight again.

Offline

#2 2026-01-25 02:21:14

Ryexa
Member
Registered: 2026-01-19
Posts: 2

Re: kernel [Hardware Error] in Journal log

I have a similar problem that’s been bothering me for a few days, but in my case I’m getting random crashes and reboots, and some kernel panics (blue screen, and the QR code doesn’t display properly).

This is what the journal shows me:

ene 24 20:40:26 archlinux kernel: [Hardware Error]: Corrected error, no action required.
ene 24 20:40:26 archlinux kernel: [Hardware Error]: CPU:2 (15:65:1) MC0_STATUS[-|CE|MiscV|AddrV|-|-|-]: 0x9c07000000000000
ene 24 20:40:26 archlinux kernel: [Hardware Error]: Error Addr: 0x0000000000000000
ene 24 20:40:26 archlinux kernel: [Hardware Error]: MC0 Error:
ene 24 20:40:26 archlinux kernel: [Hardware Error]: Corrupted MC0 MCE info?
ene 24 20:40:26 archlinux kernel: [Hardware Error]: cache level: RESV, tx: INSN
ene 24 20:40:26 archlinux kernel: [Hardware Error]: System Fatal error.
ene 24 20:40:26 archlinux kernel: [Hardware Error]: CPU:2 (15:65:1) MC1_STATUS[Over|UE|MiscV|-|PCC|-|-]: 0xfa94000000000000
ene 24 20:40:26 archlinux kernel: [Hardware Error]: MC1 Error:
ene 24 20:40:26 archlinux kernel: [Hardware Error]: Corrupted MC1 MCE info?
ene 24 20:40:26 archlinux kernel: [Hardware Error]: cache level: RESV, tx: INSN
ene 24 20:40:26 archlinux kernel: [Hardware Error]: Corrected error, no action required.
ene 24 20:40:26 archlinux kernel: [Hardware Error]: CPU:2 (15:65:1) MC2_STATUS[-|CE|MiscV|AddrV|-|-|-]: 0x9c07000000000000
ene 24 20:40:26 archlinux kernel: [Hardware Error]: Error Addr: 0x0000000000000000
ene 24 20:40:26 archlinux kernel: [Hardware Error]: MC2 Error:
ene 24 20:40:26 archlinux kernel: [Hardware Error]: cache level: RESV, tx: INSN
ene 24 20:40:26 archlinux kernel: [Hardware Error]: Corrected error, no action required.
ene 24 20:40:26 archlinux kernel: [Hardware Error]: CPU:3 (15:65:1) MC0_STATUS[-|CE|MiscV|AddrV|-|-|-]: 0x9c07000000000000
ene 24 20:40:26 archlinux kernel: [Hardware Error]: Error Addr: 0x0000000000000000
ene 24 20:40:26 archlinux kernel: [Hardware Error]: MC0 Error:
ene 24 20:40:26 archlinux kernel: [Hardware Error]: Corrupted MC0 MCE info?
ene 24 20:40:26 archlinux kernel: [Hardware Error]: cache level: RESV, tx: INSN
ene 24 20:40:26 archlinux kernel: [Hardware Error]: System Fatal error.
ene 24 20:40:26 archlinux kernel: [Hardware Error]: CPU:3 (15:65:1) MC1_STATUS[-|UE|MiscV|AddrV|PCC|-|-]: 0xbfc0000000000000
ene 24 20:40:26 archlinux kernel: [Hardware Error]: Error Addr: 0xbfc0000000000000
ene 24 20:40:26 archlinux kernel: [Hardware Error]: MC1 Error:
ene 24 20:40:26 archlinux kernel: [Hardware Error]: Corrupted MC1 MCE info?
ene 24 20:40:26 archlinux kernel: [Hardware Error]: cache level: RESV, tx: INSN
ene 24 20:40:26 archlinux kernel: [Hardware Error]: Corrected error, no action required.
ene 24 20:40:26 archlinux kernel: [Hardware Error]: CPU:3 (15:65:1) MC2_STATUS[-|CE|MiscV|AddrV|-|-|-]: 0x9c07000000000000
ene 24 20:40:26 archlinux kernel: [Hardware Error]: Error Addr: 0x0000000000000000
ene 24 20:40:26 archlinux kernel: [Hardware Error]: MC2 Error:
ene 24 20:40:26 archlinux kernel: [Hardware Error]: cache level: RESV, tx: INSN

and about 72 lines of GPU (integrated) spam:

Jan 24 20:40:59 'hostname' kernel: amdgpu: smu8_send_msg_to_smc_with_parameter(0x0004) aborted; SMU still servicing msg (0x0009)

A few days ago I posted something about this, but then I saw there was a similar thread, although the issue wasn’t exactly the same… I kept an eye on what people were posting there, but it’s not useful for me.

Sometimes I can go hours without any issues, and then it suddenly reboots/crashes randomly (it usually reboots).

Offline

#3 2026-01-25 09:22:02

seth
Member
From: Don't DM me only for attention
Registered: 2012-09-03
Posts: 73,309

Offline

Board footer

Powered by FluxBB