You are not logged in.

#1 2014-03-29 13:45:28

Johaug
Member
Registered: 2013-06-15
Posts: 13

Machine Check Exception at shutdown, system freezes before login, etc.

Lately I have experienced a couple of problems with my system, namely:
1. some seconds after I reach the login screen (I'm not running any display manger), the system freezes for a few seconds
2. I am unable to wake the machine from suspend (pm-suspend). It apparently hangs with no output to the screen.
3. after initiating a shutdown (poweroff), I get this output, before the system reboots:

Unmounting /oldroot.
Unmounting /oldroot/run.
Unmounting /oldroot/dev.
Unmounting /oldroot/sys.
Unmounting /oldroot/proc.
Unmounting /oldroot.
All filesystems unmounted.
Deactivating swaps.
All swaps deactivated.
Detaching loop devices.
All loop devices detached.
Detaching DM devices.
All DM devices detached.
Storage is finalized.
Powering off.
[  124.333684] mce: [Hardware Error]: CPU 6: Machine Check Exception: 4 Bank 5: be00000000800400
[  124.333735] mce: [Hardware Error]: TSC 60b975ee8b ADDR 3fff81406095 MISC 7fff
[  124.333795] mce: [Hardware Error]: PROCESSOR 0:106a4 TIME 1395783630 SOCKET 0 APIC 5 microcode 10
[  124.333845] mce: [Hardware Error]: Run the above through 'mcelog --ascii'
[  124.333886] mce: [Hardware Error]: CPU 2: Machine Check Exception: 4 Bank 5: be00000000800400
[  124.333934] mce: [Hardware Error]: TSC 60b9750173 ADDR 3fff81406095 MISC 7fff
[  124.333993] mce: [Hardware Error]: PROCESSOR 0:106a4 TIME 1395783630 SOCKET 0 APIC 4 microcode 10
[  124.334043] mce: [Hardware Error]: Run the above through 'mcelog --ascii'
[  124.334082] mce: [Hardware Error]: Machine check: Processor context corrupt
[  124.334123] Kernel panic - not syncing: Fatal Machine check
[  124.334160] drm_kms_helper: panic occurred, switching back to text console
[  124.334204] Rebooting in 30 seconds..

Any suggestions on how to solve this?

Also, this might be of relevance:
lspci

00:00.0 Host bridge: Intel Corporation 5520/5500/X58 I/O Hub to ESI Port (rev 12)
00:01.0 PCI bridge: Intel Corporation 5520/5500/X58 I/O Hub PCI Express Root Port 1 (rev 12)
00:03.0 PCI bridge: Intel Corporation 5520/5500/X58 I/O Hub PCI Express Root Port 3 (rev 12)
00:07.0 PCI bridge: Intel Corporation 5520/5500/X58 I/O Hub PCI Express Root Port 7 (rev 12)
00:14.0 PIC: Intel Corporation 7500/5520/5500/X58 I/O Hub System Management Registers (rev 12)
00:14.1 PIC: Intel Corporation 7500/5520/5500/X58 I/O Hub GPIO and Scratch Pad Registers (rev 12)
00:14.2 PIC: Intel Corporation 7500/5520/5500/X58 I/O Hub Control Status and RAS Registers (rev 12)
00:14.3 PIC: Intel Corporation 7500/5520/5500/X58 I/O Hub Throttle Registers (rev 12)
00:1a.0 USB controller: Intel Corporation 82801JI (ICH10 Family) USB UHCI Controller #4
00:1a.1 USB controller: Intel Corporation 82801JI (ICH10 Family) USB UHCI Controller #5
00:1a.2 USB controller: Intel Corporation 82801JI (ICH10 Family) USB UHCI Controller #6
00:1a.7 USB controller: Intel Corporation 82801JI (ICH10 Family) USB2 EHCI Controller #2
00:1b.0 Audio device: Intel Corporation 82801JI (ICH10 Family) HD Audio Controller
00:1c.0 PCI bridge: Intel Corporation 82801JI (ICH10 Family) PCI Express Root Port 1
00:1c.2 PCI bridge: Intel Corporation 82801JI (ICH10 Family) PCI Express Root Port 3
00:1c.3 PCI bridge: Intel Corporation 82801JI (ICH10 Family) PCI Express Root Port 4
00:1c.4 PCI bridge: Intel Corporation 82801JI (ICH10 Family) PCI Express Root Port 5
00:1c.5 PCI bridge: Intel Corporation 82801JI (ICH10 Family) PCI Express Root Port 6
00:1d.0 USB controller: Intel Corporation 82801JI (ICH10 Family) USB UHCI Controller #1
00:1d.1 USB controller: Intel Corporation 82801JI (ICH10 Family) USB UHCI Controller #2
00:1d.2 USB controller: Intel Corporation 82801JI (ICH10 Family) USB UHCI Controller #3
00:1d.7 USB controller: Intel Corporation 82801JI (ICH10 Family) USB2 EHCI Controller #1
00:1e.0 PCI bridge: Intel Corporation 82801 PCI Bridge (rev 90)
00:1f.0 ISA bridge: Intel Corporation 82801JIR (ICH10R) LPC Interface Controller
00:1f.2 IDE interface: Intel Corporation 82801JI (ICH10 Family) 4 port SATA IDE Controller #1
00:1f.3 SMBus: Intel Corporation 82801JI (ICH10 Family) SMBus Controller
00:1f.5 IDE interface: Intel Corporation 82801JI (ICH10 Family) 2 port SATA IDE Controller #2
01:00.0 Network controller: Qualcomm Atheros AR5418 Wireless Network Adapter [AR5008E 802.11(a)bgn] (PCI-Express) (rev 01)
02:00.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI] RV770 [Radeon HD 4870]
02:00.1 Audio device: Advanced Micro Devices, Inc. [AMD/ATI] RV770 HDMI Audio [Radeon HD 4850/4870]
04:00.0 Ethernet controller: Marvell Technology Group Ltd. 88E8056 PCI-E Gigabit Ethernet Controller (rev 12)
05:00.0 IDE interface: Marvell Technology Group Ltd. 88SE6121 SATA II / PATA Controller (rev b2)
06:00.0 RAID bus controller: Marvell Technology Group Ltd. 88SE6440 SAS/SATA PCIe controller (rev 02)
07:00.0 Ethernet controller: Marvell Technology Group Ltd. 88E8056 PCI-E Gigabit Ethernet Controller (rev 12)
09:02.0 FireWire (IEEE 1394): VIA Technologies, Inc. VT6306/7/8 [Fire II(M)] IEEE 1394 OHCI Controller (rev c0)
ff:00.0 Host bridge: Intel Corporation Xeon 5500/Core i7 QuickPath Architecture Generic Non-Core Registers (rev 04)
ff:00.1 Host bridge: Intel Corporation Xeon 5500/Core i7 QuickPath Architecture System Address Decoder (rev 04)
ff:02.0 Host bridge: Intel Corporation Xeon 5500/Core i7 QPI Link 0 (rev 04)
ff:02.1 Host bridge: Intel Corporation Xeon 5500/Core i7 QPI Physical 0 (rev 04)
ff:03.0 Host bridge: Intel Corporation Xeon 5500/Core i7 Integrated Memory Controller (rev 04)
ff:03.1 Host bridge: Intel Corporation Xeon 5500/Core i7 Integrated Memory Controller Target Address Decoder (rev 04)
ff:03.4 Host bridge: Intel Corporation Xeon 5500/Core i7 Integrated Memory Controller Test Registers (rev 04)
ff:04.0 Host bridge: Intel Corporation Xeon 5500/Core i7 Integrated Memory Controller Channel 0 Control Registers (rev 04)
ff:04.1 Host bridge: Intel Corporation Xeon 5500/Core i7 Integrated Memory Controller Channel 0 Address Registers (rev 04)
ff:04.2 Host bridge: Intel Corporation Xeon 5500/Core i7 Integrated Memory Controller Channel 0 Rank Registers (rev 04)
ff:04.3 Host bridge: Intel Corporation Xeon 5500/Core i7 Integrated Memory Controller Channel 0 Thermal Control Registers (rev 04)
ff:05.0 Host bridge: Intel Corporation Xeon 5500/Core i7 Integrated Memory Controller Channel 1 Control Registers (rev 04)
ff:05.1 Host bridge: Intel Corporation Xeon 5500/Core i7 Integrated Memory Controller Channel 1 Address Registers (rev 04)
ff:05.2 Host bridge: Intel Corporation Xeon 5500/Core i7 Integrated Memory Controller Channel 1 Rank Registers (rev 04)
ff:05.3 Host bridge: Intel Corporation Xeon 5500/Core i7 Integrated Memory Controller Channel 1 Thermal Control Registers (rev 04)
ff:06.0 Host bridge: Intel Corporation Xeon 5500/Core i7 Integrated Memory Controller Channel 2 Control Registers (rev 04)
ff:06.1 Host bridge: Intel Corporation Xeon 5500/Core i7 Integrated Memory Controller Channel 2 Address Registers (rev 04)
ff:06.2 Host bridge: Intel Corporation Xeon 5500/Core i7 Integrated Memory Controller Channel 2 Rank Registers (rev 04)
ff:06.3 Host bridge: Intel Corporation Xeon 5500/Core i7 Integrated Memory Controller Channel 2 Thermal Control Registers (rev 04)

uname -a

Linux euclid 3.13.7-1-ARCH #1 SMP PREEMPT Mon Mar 24 20:06:08 CET 2014 x86_64 GNU/Linux

Please tell me if there's other information I should provide. Thank you in advance.

Offline

#2 2014-04-01 15:48:52

MadTux
Member
Registered: 2009-09-20
Posts: 553

Re: Machine Check Exception at shutdown, system freezes before login, etc.

Did you add additional hardware? To rule out memory issues it may be helpful to run memtest. Does this happen directly after a cold boot, or, in other words, could it be caused by overheating?

Offline

#3 2014-04-01 21:18:53

combuster
Member
From: Serbia
Registered: 2008-09-30
Posts: 711
Website

Re: Machine Check Exception at shutdown, system freezes before login, etc.

If this is a server, do you have an OBM card (iLO, DRAC etc) ? MCE's should be logged there, they could provide more info than kernel does. Generally, this could be a DIMM problem, or a BIOS bug (if you rule out overheating). You could test your memory either with memtest or an EFI/BIOS utility/module if there is one.

Offline

#4 2014-04-01 21:41:54

Johaug
Member
Registered: 2013-06-15
Posts: 13

Re: Machine Check Exception at shutdown, system freezes before login, etc.

No hardware components have been added recently. I did try to perform a shutdown immediately after a cold boot, but the error still occurred.

It's not a server, and I don't have any OBM card. I will run a memory test tomorrow and report back.

One thing I haven't mentioned, I think these problems started after an electrical outage that happened while this computer was in suspend mode.

Thank you for the suggestions.

Last edited by Johaug (2014-04-01 22:35:56)

Offline

#5 2014-04-02 21:24:26

Johaug
Member
Registered: 2013-06-15
Posts: 13

Re: Machine Check Exception at shutdown, system freezes before login, etc.

I have now performed a memory test with Memtest86+. 8 passes, no errors.

Last edited by Johaug (2014-04-03 20:05:43)

Offline

#6 2014-04-03 04:43:08

combuster
Member
From: Serbia
Registered: 2008-09-30
Posts: 711
Website

Re: Machine Check Exception at shutdown, system freezes before login, etc.

Does this happen when you boot Arch from live USB and then initiate shutdown ? If does not, you could try to force fsck on all partitions, even though this one is a strech.

Offline

#7 2014-04-03 15:49:51

Johaug
Member
Registered: 2013-06-15
Posts: 13

Re: Machine Check Exception at shutdown, system freezes before login, etc.

The live USB wouldn't even boot. It gets to ‘Welcome to’, which is displayed half cut off horizontally. Then apparently nothing happens for a couple of seconds, before the system reboots. Perhaps is it the same error as I get when I try to do a shutdown.

Offline

#8 2014-04-04 05:39:06

combuster
Member
From: Serbia
Registered: 2008-09-30
Posts: 711
Website

Re: Machine Check Exception at shutdown, system freezes before login, etc.

OK, that is a strong indicator that a hardware failiure is in question (in case this happens on another distribution or version of archlinux installer).

You could check hdd health with smartctl and try with another GPU. Memory is OK. nothing overheats, if hdd and gpu is ok then CPU/MB or PSU is probably toast.

Last edited by combuster (2014-04-04 05:49:04)

Offline

#9 2014-04-04 21:13:48

Johaug
Member
Registered: 2013-06-15
Posts: 13

Re: Machine Check Exception at shutdown, system freezes before login, etc.

Changing the GPU solved the problems. What should I do from here? I mean, to decide whether it is a hardware or software problem.

Thank you for the help.

Offline

#10 2014-04-04 23:05:14

combuster
Member
From: Serbia
Registered: 2008-09-30
Posts: 711
Website

Re: Machine Check Exception at shutdown, system freezes before login, etc.

Does this also solve your problems when rebooting (no more MCE's) ? If so, you should check if that Radeon is working on another machine. Running a few burn-in benchmarks on it should give you an indicator wheter it can be RMA'd or sold etc.

Offline

#11 2014-04-07 09:36:23

Johaug
Member
Registered: 2013-06-15
Posts: 13

Re: Machine Check Exception at shutdown, system freezes before login, etc.

It seemed to solve all my problems, yes. However, the graphics card caused no problems in some other machine I tested it in. I will thus continue the hithereto followed approach to see whether I can track down the problem.

Offline

#12 2014-04-18 22:20:46

Johaug
Member
Registered: 2013-06-15
Posts: 13

Re: Machine Check Exception at shutdown, system freezes before login, etc.

New discovery: Unplugging one of my two monitors apparently solves the problem(s). There is a thread here describing what seems to be the same issue.

Offline

#13 2014-04-20 13:34:42

Johaug
Member
Registered: 2013-06-15
Posts: 13

Re: Machine Check Exception at shutdown, system freezes before login, etc.

The issues seem to be caused by a bug in Radeon DPM. Using the kernel parameter radeon.dpm=0 gets me around them.

Last edited by Johaug (2014-04-20 16:40:56)

Offline

Board footer

Powered by FluxBB