You are not logged in.

#1 2021-06-27 16:14:09

roti
Member
Registered: 2011-08-07
Posts: 16

Hardware Error

Hi,

I have following messages in syslog:

mce: [Hardware Error]: Machine check events logged
mce: [Hardware Error]: CPU 0: Machine Check: 0 Bank 4: a600000000020408
mce: [Hardware Error]: TSC 0 ADDR fef13b80
mce: [Hardware Error]: PROCESSOR 0:506c9 TIME 1624802551 SOCKET 0 APIC 0 microcode 1e


I installed rasdaemon but unfortunately it starts recording mce errors after the error occurs, so that ras-mc-ctl does not report any errors. Any ideas how to find out what's wrong?  (From what I noticed, the error shows up only at boot).


Razvan

Offline

#2 2021-06-27 16:35:34

V1del
Forum Moderator
Registered: 2012-10-16
Posts: 25,104

Re: Hardware Error

TSC related issues could be due to outdated microcodes, make sure your microcode is setup properly: https://wiki.archlinux.org/title/Microc … ly_loading

Offline

#3 2021-06-29 04:38:48

Ropid
Member
Registered: 2015-03-09
Posts: 1,069

Re: Hardware Error

This is an event that happens very early at boot, before the OS is started. This seems to happen for many people without the hardware showing any problems when in use in the OS later.

You can hide this first event with "mce=nobootlog" on the kernel command line.

There's something interesting mentioned about this issue in the kernel doc file "boot-options.rst", see here:

   mce=bootlog
                Enable logging of machine checks left over from booting.
                Disabled by default on AMD Fam10h and older because some BIOS
                leave bogus ones.
                If your BIOS doesn't do that it's a good idea to enable though
                to make sure you log even machine check events that result
                in a reboot. On Intel systems it is enabled by default.
   mce=nobootlog
                Disable boot machine check logging.

The interesting part is that the kernel already sets "mce=nobootlog" by itself for certain systems.

Offline

Board footer

Powered by FluxBB