You are not logged in.
Pages: 1
Hello everyone,
I've been having problems with several kinds of seemingly random, frequent freezes and crashes in the last couple days. So far I have experienced the following:
Most commonly: Instant lockups. The entire system freezes immediately and mouse/keyboard no longer work, but are still powered. After plugging them back in they no longer light up until system reset. REISUB doesn't seem to work, although SysRq+R was once somehow logged even after the freeze. These happened fairly infrequently (1-3/month) ever since I started using Arch (and Linux in general) in late April, but have been getting more common in the last few weeks. Might just be the third type but with all graphical programs being frozen, however there is no log output in this case.
Only once so far (today): Hard crash, the machine just stops working suddenly, all screens turn off and input devices become unpowered.
Recently: Massive random load spikes. Load avg climbs continuously and input devices stop working after several seconds, reconnecting them has the same results as with type 1. Once, the 1-minute load average reached over 30 before I reset the PC because the CPU was starting to overheat. The logs indicate that the issue is cores locking up. Some programs seem to keep running, e.g. my status bar sometimes still updates the time and load avg. I have unfortunately not been able to test if SysRq works so far.
For the load spikes there are errors in the journal, but for the other two crashes it simply cuts off immediately before. Here are some logs, as output by 'journalctl -k -o short-iso -b <num>', for all types of crash anyway:
https://pastebin.com/10eEc1Vj https://pastebin.com/6VHqmyti https://pastebin.com/spaQw3cg https://pastebin.com/hXTSLdW3
My system is a desktop PC with a 1st Gen Ryzen CPU and an NVIDIA GPU: https://pastebin.com/aUPMc4ZH
I'm running X.org with i3 as the WM.
I have already tried setting processor.max_cstate and intel_idle.max_cstate to 5 in my kernel command line parameters, since I read that going above that could cause crashes on first generation Ryzen chips, however even after that the issues still remain.
Offline
Make sure your BIOS/UEFI is the latest version, and disable c-state there (often called quiet and cool or something like that on AMD boards iirc)
Offline
Pages: 1