You are not logged in.
After running stable for years, my Intel i5-6400 system suddenly (starting May 25th) suffers from random complete freezes up to several times a day.
Usual logs do not help, but after attaching a serial console, I was finally able to track down these freezes to kernel panics:
kernel panic 20200607-2307 (at the end): https://pastebin.com/Rxeiz4WG
kernel panic 20200609-1136: https://pastebin.com/TB397Hm3
kernel panic 20200609-1807: https://pastebin.com/TA1ctCQq
kernel panic 20200609-2118: https://pastebin.com/W0cxTmx3
I already ran memtest86+ to no avail and even ran the system with only one memory module at one time, so RAM seems to be ok. Filessystems and SMART parameters are ok as well.
Switching to the 5.4.43 LTS kernel did not help. There was an upgrade to intel-ucode, but I don't think it affects my CPU (I still get "microcode updated early to revision 0xd6, date = 2019-10-03" on boot).
Freezes are not related to heavy system load situations - rather they occur while doing some light work or even while the machine is more or less idle.
I also once stopped sddm to shut down graphics mode, only to run into a freeze in text mode also.
I'm really clueless now and really need help in interpreting the kernel panic messages or any other advise.
TIA
Heinz
More hardware details:
$ lspci
00:00.0 Host bridge: Intel Corporation Xeon E3-1200 v5/E3-1500 v5/6th Gen Core Processor Host Bridge/DRAM Registers (rev 07)
00:02.0 VGA compatible controller: Intel Corporation HD Graphics 530 (rev 06)
00:08.0 System peripheral: Intel Corporation Xeon E3-1200 v5/v6 / E3-1500 v5 / 6th/7th/8th Gen Core Processor Gaussian Mixture Model
00:14.0 USB controller: Intel Corporation 100 Series/C230 Series Chipset Family USB 3.0 xHCI Controller (rev 31)
00:14.2 Signal processing controller: Intel Corporation 100 Series/C230 Series Chipset Family Thermal Subsystem (rev 31)
00:15.0 Signal processing controller: Intel Corporation 100 Series/C230 Series Chipset Family Serial IO I2C Controller #0 (rev 31)
00:15.1 Signal processing controller: Intel Corporation 100 Series/C230 Series Chipset Family Serial IO I2C Controller #1 (rev 31)
00:16.0 Communication controller: Intel Corporation 100 Series/C230 Series Chipset Family MEI Controller #1 (rev 31)
00:17.0 SATA controller: Intel Corporation Q170/Q150/B150/H170/H110/Z170/CM236 Chipset SATA Controller [AHCI Mode] (rev 31)
00:1c.0 PCI bridge: Intel Corporation 100 Series/C230 Series Chipset Family PCI Express Root Port #1 (rev f1)
00:1c.2 PCI bridge: Intel Corporation 100 Series/C230 Series Chipset Family PCI Express Root Port #3 (rev f1)
00:1d.0 PCI bridge: Intel Corporation 100 Series/C230 Series Chipset Family PCI Express Root Port #9 (rev f1)
00:1d.2 PCI bridge: Intel Corporation 100 Series/C230 Series Chipset Family PCI Express Root Port #11 (rev f1)
00:1d.3 PCI bridge: Intel Corporation 100 Series/C230 Series Chipset Family PCI Express Root Port #12 (rev f1)
00:1e.0 Signal processing controller: Intel Corporation 100 Series/C230 Series Chipset Family Serial IO UART #0 (rev 31)
00:1f.0 ISA bridge: Intel Corporation Z170 Chipset LPC/eSPI Controller (rev 31)
00:1f.2 Memory controller: Intel Corporation 100 Series/C230 Series Chipset Family Power Management Controller (rev 31)
00:1f.3 Audio device: Intel Corporation 100 Series/C230 Series Chipset Family HD Audio Controller (rev 31)
00:1f.4 SMBus: Intel Corporation 100 Series/C230 Series Chipset Family SMBus (rev 31)
02:00.0 USB controller: ASMedia Technology Inc. ASM1142 USB 3.1 Host Controller
04:00.0 PCI bridge: ASMedia Technology Inc. ASM1083/1085 PCIe to PCI Bridge (rev 03)
05:01.0 Multimedia controller: Philips Semiconductors SAA7146 (rev 01)
06:00.0 Ethernet controller: Realtek Semiconductor Co., Ltd. RTL8111/8168/8411 PCI Express Gigabit Ethernet Controller (rev 15)
$ lsusb
Bus 004 Device 001: ID 1d6b:0003 Linux Foundation 3.0 root hub
Bus 003 Device 001: ID 1d6b:0002 Linux Foundation 2.0 root hub
Bus 002 Device 001: ID 1d6b:0003 Linux Foundation 3.0 root hub
Bus 001 Device 005: ID 045e:00db Microsoft Corp. Natural Ergonomic Keyboard 4000 V1.0
Bus 001 Device 004: ID 046d:0892 Logitech, Inc. OrbiCam
Bus 001 Device 003: ID 0a12:0001 Cambridge Silicon Radio, Ltd Bluetooth Dongle (HCI mode)
Bus 001 Device 007: ID 05e3:0723 Genesys Logic, Inc. GL827L SD/MMC/MS Flash Card Reader
Bus 001 Device 006: ID 1a40:0101 Terminus Technology Inc. Hub
Bus 001 Device 002: ID 062a:4102 MosArt Semiconductor Corp. 2.4G Wireless Mouse
Bus 001 Device 001: ID 1d6b:0002 Linux Foundation 2.0 root hub
Last edited by HarveyK (2020-06-11 09:14:48)
Offline
Offline
Thanks a lot, that looks very similar!
Since https://www.spinics.net/lists/netdev/msg658503.html says
> I hit the crash in kernels 5.4.43 through 5.7.
I now downgraded to 5.4.42-1-lts - let's hope for the best!
Thanks again
Heinz
Offline
5.4.42 contains https://git.kernel.org/pub/scm/linux/ke … bb2d4a3755 I would suggest 5.4.41 or you could revert that commit.
Edit:
Or 5.6.13.
Last edited by loqs (2020-06-09 23:13:38)
Offline
5.4.42 contains https://git.kernel.org/pub/scm/linux/ke … bb2d4a3755 I would suggest 5.4.41 or you could revert that commit.
Edit:
Or 5.6.13.
Important info, thanks again
Heinz
Offline
Solved by downgrading to 5.4.41-1-lts.
Will keep an eye on https://bugzilla.kernel.org/show_bug.cgi?id=208003.
Offline