You are not logged in.

#1 2024-03-18 04:50:06

curlywei
Member
Registered: 2022-05-30
Posts: 10

[SOLVED] Fatal error after update system: [Hardware Error] MC5_STATUS

Mar 18 12:28:55 lab521-amd-1 kernel: [Hardware Error]: System Fatal error.
Mar 18 12:28:55 lab521-amd-1 kernel: [Hardware Error]: CPU:22 (17:71:0) MC5_STATUS[-|UE|MiscV|AddrV|PCC|TCC|SyndV|-|-|-]: 0xbea0000000000108
Mar 18 12:28:55 lab521-amd-1 kernel: [Hardware Error]: Error Addr: 0x0000780342423ab8
Mar 18 12:28:55 lab521-amd-1 kernel: [Hardware Error]: IPID: 0x000500b000000000, Syndrome: 0x000000004d000000
Mar 18 12:28:55 lab521-amd-1 kernel: [Hardware Error]: Execution Unit Ext. Error Code: 0
Mar 18 12:28:55 lab521-amd-1 kernel: [Hardware Error]: cache level: RESV, tx: GEN, mem-tx: GEN
Mar 18 12:28:56 lab521-amd-1 (udev-worker)[544]: event7: Failed to call EVIOCSKEYCODE with scan code 0x7c, and key code 190: Invalid argument

Here list are I upgrade today

alsa-card-profiles amd-ucode iana-etc aom libldap sqlite libunistring e2fsprogs audit pcre2 ca-certificates-mozilla gnupg icu libxml2 expat pacman archlinux-keyring shadow at-spi2-core bluez bluez-libs containerd iproute2 docker efibootmgr nss harfbuzz librsvg libunwind mesa electron28 electron29 erlang-nox fcitx5-qt fzf libpaper ghostscript gtkmm3 harfbuzz-icu net-snmp hplip libva kicad kicad-library kicad-library-3d libopenmpt sane libksane qt5-translations qt5-base qt5-declarative qt5-multimedia qt5-speech qt5-wayland qt5-x11extras qt5-svg kolourpaint lib32-libxml2 lib32-pcre2 lib32-at-spi2-core lib32-expat lib32-harfbuzz lib32-libunistring lib32-mesa lib32-sqlite libpipewire libreoffice-fresh libva-mesa-driver mkinitcpio linux linux-firmware-whence linux-firmware linux-headers lsof meson openssh pacman-contrib pacutils pipewire pkgconf procps-ng python-shiboken2 pyside2 python-async-timeout python-babel python-beautifulsoup4 python-fonttools python-pyparsing python-google-api-core python-google-api-python-client python-lxml python-more-itertools python-platformdirs python-pluggy python-pydantic python-ruamel-yaml python-sqlparse python-wheel qt5-script qt5-xmlpatterns qt5-tools qcad qt5-3d qt5-charts qt5-connectivity qt5-datavis3d qt5-doc qt5-examples qt5-gamepad qt5-graphicaleffects qt5-imageformats qt5-location qt5-lottie qt5-networkauth qt5-purchasing qt5-quick3d qt5-quickcontrols qt5-quickcontrols2 qt5-quicktimeline qt5-remoteobjects qt5-scxml qt5-sensors qt5-serialport qt5-serialbus qt5-virtualkeyboard qt5-webchannel qt5-webengine qt5-websockets qt5-webglplugin qt5-webview run-parts rustup zsh yay python-mock drawio-desktop brave-bin visual-studio-code-bin

* My kernel version: 6.8.1.arch1-1
* Kernel firmware version: 20240312.3b128b60-1
* Systemd version: 255.4-2

Does anyone encounter and/or solved this problem?

Last edited by curlywei (2024-03-26 10:28:15)

Offline

#2 2024-03-18 06:03:41

agapito
Member
From: Who cares.
Registered: 2008-11-13
Posts: 666

Re: [SOLVED] Fatal error after update system: [Hardware Error] MC5_STATUS


Excuse my poor English.

Offline

#3 2024-03-19 01:07:55

marcs
Member
From: Italy
Registered: 2007-09-07
Posts: 63

Re: [SOLVED] Fatal error after update system: [Hardware Error] MC5_STATUS

I got a random reboot with a similar error message after reboot:

mar 19 01:56:46 orion kernel: microcode: Current revision: 0x08701030
mar 19 01:56:46 orion kernel: mce: [Hardware Error]: Machine check events logged
mar 19 01:56:46 orion kernel: [Hardware Error]: Uncorrected, software containable error.
mar 19 01:56:46 orion kernel: fbcon: Taking over console
mar 19 01:56:46 orion kernel: [Hardware Error]: CPU:14 (17:71:0) MC1_STATUS[Over|UE|MiscV|AddrV|-|TCC|-|-|Poison|-]: 0xfc800800000c0859
mar 19 01:56:46 orion kernel: [Hardware Error]: Error Addr: 0x00000018c8db2200
mar 19 01:56:46 orion kernel: [Hardware Error]: IPID: 0x000100b000000000
mar 19 01:56:46 orion kernel: [Hardware Error]: Instruction Fetch Unit Ext. Error Code: 12
mar 19 01:56:46 orion kernel: [Hardware Error]: cache level: L1, mem/io: IO, mem-tx: IRD, part-proc: SRC (no timeout)
mar 19 01:56:46 orion kernel: mce: [Hardware Error]: Machine check events logged
mar 19 01:56:46 orion kernel: [Hardware Error]: System Fatal error.
mar 19 01:56:46 orion kernel: [Hardware Error]: CPU:14 (17:71:0) MC5_STATUS[-|UE|MiscV|AddrV|PCC|TCC|SyndV|-|-|-]: 0xbea0000000000108
mar 19 01:56:46 orion kernel: [Hardware Error]: Error Addr: 0x0001ffffa2db1ff2
mar 19 01:56:46 orion kernel: [Hardware Error]: IPID: 0x000500b000000000, Syndrome: 0x000000004d000000
mar 19 01:56:46 orion kernel: [Hardware Error]: Execution Unit Ext. Error Code: 0
mar 19 01:56:46 orion kernel: [Hardware Error]: cache level: RESV, tx: GEN, mem-tx: GEN

kernel:  6.8.1.arch1-1
firmware: 20240312.3b128b60-1
systemd: 255.4-2

To answer to the link agapito posted:
I haven't undervolted my CPU, CPU's parameter on UEFI are at default with XMP/DOCP enabled for RAM.

Offline

#4 2024-03-19 02:00:27

agapito
Member
From: Who cares.
Registered: 2008-11-13
Posts: 666

Re: [SOLVED] Fatal error after update system: [Hardware Error] MC5_STATUS

marcs wrote:

To answer to the link agapito posted:
I haven't undervolted my CPU, CPU's parameter on UEFI are at default with XMP/DOCP enabled for RAM.

if you read my previous message in that same link I said this: I had random reboots in all the Zen 3 processors I have used on my motherboard, using the stock-auto settings. After many hours of testing and calibration using CoreCycler and Curve Optimizer I have not had any in almost 2 years.


Excuse my poor English.

Offline

#5 2024-03-19 13:22:02

marcs
Member
From: Italy
Registered: 2007-09-07
Posts: 63

Re: [SOLVED] Fatal error after update system: [Hardware Error] MC5_STATUS

agapito wrote:
marcs wrote:

To answer to the link agapito posted:
I haven't undervolted my CPU, CPU's parameter on UEFI are at default with XMP/DOCP enabled for RAM.

if you read my previous message in that same link I said this: I had random reboots in all the Zen 3 processors I have used on my motherboard, using the stock-auto settings. After many hours of testing and calibration using CoreCycler and Curve Optimizer I have not had any in almost 2 years.

I missed that. I have a Zen 2 processor and never happened to me until now, I got a R9 3900X.

Offline

#6 2024-03-20 06:31:51

agapito
Member
From: Who cares.
Registered: 2008-11-13
Posts: 666

Re: [SOLVED] Fatal error after update system: [Hardware Error] MC5_STATUS

marcs wrote:
agapito wrote:
marcs wrote:

To answer to the link agapito posted:
I haven't undervolted my CPU, CPU's parameter on UEFI are at default with XMP/DOCP enabled for RAM.

if you read my previous message in that same link I said this: I had random reboots in all the Zen 3 processors I have used on my motherboard, using the stock-auto settings. After many hours of testing and calibration using CoreCycler and Curve Optimizer I have not had any in almost 2 years.

I missed that. I have a Zen 2 processor and never happened to me until now, I got a R9 3900X.

Have you tested your memory for many hours with a program like mprime while your graphics card was working at 100%?

If we rule out memory problems, I'm afraid your processor has degraded, which means it needs more voltage than before. If I am correct, the solution is to apply a positive voltage offset (+0,125) to the entire CPU, since in Zen2 there is no Curve Optimizer.

I get the feeling that modern processors are not built to last for many years.


Excuse my poor English.

Offline

#7 2024-03-20 09:25:45

marcs
Member
From: Italy
Registered: 2007-09-07
Posts: 63

Re: [SOLVED] Fatal error after update system: [Hardware Error] MC5_STATUS

agapito wrote:

Have you tested your memory for many hours with a program like mprime while your graphics card was working at 100%?

If we rule out memory problems, I'm afraid your processor has degraded, which means it needs more voltage than before. If I am correct, the solution is to apply a positive voltage offset (+0,125) to the entire CPU, since in Zen2 there is no Curve Optimizer.

I get the feeling that modern processors are not built to last for many years.

I haven't tested my memory with mprime for many hours, but I do use Linux only on this machine for many hours a day, building from source quite often (ramping up all 24 threads at 100%), I use multiple VMs, so it got some real-life testing from daily usage but I never used mprime for an extensive test. This reboot happened not long after using kernel 6.8.1 after upgrading from kernel 6.7, but it may be circumstantial.

Last edited by marcs (2024-03-20 09:26:12)

Offline

#8 2024-03-20 12:39:26

agapito
Member
From: Who cares.
Registered: 2008-11-13
Posts: 666

Re: [SOLVED] Fatal error after update system: [Hardware Error] MC5_STATUS

marcs wrote:
agapito wrote:

Have you tested your memory for many hours with a program like mprime while your graphics card was working at 100%?

If we rule out memory problems, I'm afraid your processor has degraded, which means it needs more voltage than before. If I am correct, the solution is to apply a positive voltage offset (+0,125) to the entire CPU, since in Zen2 there is no Curve Optimizer.

I get the feeling that modern processors are not built to last for many years.

I haven't tested my memory with mprime for many hours, but I do use Linux only on this machine for many hours a day, building from source quite often (ramping up all 24 threads at 100%), I use multiple VMs, so it got some real-life testing from daily usage but I never used mprime for an extensive test. This reboot happened not long after using kernel 6.8.1 after upgrading from kernel 6.7, but it may be circumstantial.

I had that same error in the past and I also found a way to reproduce it on my PC 100% of the time, that is why I am so sure it is due to the lack of voltage. Even the log warns you that you are dealing with a hardware error. Unfortunately by the looks of it, your CPU has degraded over time and now requires a bit more voltage than before. Or maybe you upgraded to a new bios recently and the board sends less voltage than before to the CPU. In any case the solution is the same, you must increase the voltage to your CPU.


Excuse my poor English.

Offline

#9 2024-03-25 09:57:32

curlywei
Member
Registered: 2022-05-30
Posts: 10

Re: [SOLVED] Fatal error after update system: [Hardware Error] MC5_STATUS

Hi everyone:
Sorry that I reply to late.
I turn off C-state and PBO from my MB.
Now my system is no problem.

It's very strange.
I kept PBO and C status on for a few years before that.
The problem didn't occur until I recently upgraded my operating system.
My hardwares:
CPU: AMD 3900x
MB: ASUS ROG Crosshair VIII Hero (WI-FI)
VGA: AMD RX570

Last edited by curlywei (2024-03-25 12:04:35)

Offline

Board footer

Powered by FluxBB