You are not logged in.
SOLUTION: Raiser cable was pcie gen 3, needed to set the correct pcie gen in the BIOS settings.
Hello!
So I've basically spend the past few day trying to solve this issue but so far have been unsuccessful. I'm mostly new to linux but aside from this problem I've been able to get around. I have a duel monitor setup with the following specs:
- Aorus Z790 Xtreme,
- 14900KS,
- 96GB of GSKILL Ram
- Aorus 3080 Xtreme
- 1600W Seasonic power supplyI use KDE-Plasma on Wayland. So this issue most often happens when trying to run my game on one monitor and have my web browser on my 2nd (sometimes playing youtube or reading).
I've noticed it happens mostly when I try to run games via Proton (on steam and lutris). The freeze happens and it locks up my system both display and input (I tried changing to a different tty but it didnt help).
I'm kind of out of ideas what might be causing the issue. I will provide a journalctl but I'm not sure how to execute the command so it gives us useful information.
Any help is appreciated.
EDIT: Kernel command line: "rw loglevel=3 quiet pcie_aspm=off acpi_enforce_resources=lax kernel.split_lock_mitigate=0 nvidia-drm.modeset=1"
Last edited by DragonWhelp (2025-07-04 01:58:04)
Offline
Can you then still ping the system or ssh into it?
Does this also happen on plasma/X11? (In any even please then post your Xorg log, https://wiki.archlinux.org/title/Xorg#General )
Does it happen if you run on GL instead of vulkan?
https://wiki.archlinux.org/title/Steam/ … _emulation
Offline
Nope was not able to ping it.
Don't have X11 installed. Is this the correct file? https://pastebin.com/9x7XriXi
So far I've had it freeze in Hearthstone, Counter Strike, Euro truck simulator 2, Phasmophobia and I also tested Skyrim.
The system basically requires me to power it off, I can't even use the reset button.
Offline
CPU or GPU overheating maybe?
Try to follow the temperatures on one screen during the game and see if the lockup happens when it is a too high value.
zʇıɹɟʇıɹʞsuɐs AUR || Cycling in Budapest with a helmet camera || Revised log levels proposal: "FYI" "WTF" and "OMG" (John Barnette)
Offline
Is this the correct file?
Yes, apparently you've X11 installed after all - you might wanna test it. As well as https://wiki.archlinux.org/title/Steam/ … _emulation
x-ref: https://bbs.archlinux.org/viewtopic.php … 1#p2247651
Also (on wayland) post the output of
qdbus6 org.kde.KWin /KWin supportInformationYou've a hybrid setup but all outputs are connected to the nvidia GPU and that's also what the X11 server is running on - makes me wonder whether kwin_wayland runs reverse-prime on the IGP, leading up to the 1fps situation in https://wiki.archlinux.org/title/PRIME#Reverse_PRIME
(Don't LIBGL_DRI3_DISABLE=true - this currently will cause software rendering!)
This would however not explain
The system basically requires me to power it off, I can't even use the reset button.
=> tried https://wiki.archlinux.org/title/Keyboa … el_(SysRq) ?
High temperatures would hopefully cause an emergency shutdown/reboot - but why is there
pcie_aspm=off acpi_enforce_resources=lax kernel.split_lock_mitigate=0?
Offline
Is this the correct file?
Yes, apparently you've X11 installed after all - you might wanna test it. As well as https://wiki.archlinux.org/title/Steam/ … _emulation
x-ref: https://bbs.archlinux.org/viewtopic.php … 1#p2247651Hmm I will look into that.
Also (on wayland) post the output of
qdbus6 org.kde.KWin /KWin supportInformationYou've a hybrid setup but all outputs are connected to the nvidia GPU and that's also what the X11 server is running on - makes me wonder whether kwin_wayland runs reverse-prime on the IGP, leading up to the 1fps situation in https://wiki.archlinux.org/title/PRIME#Reverse_PRIME
(Don't LIBGL_DRI3_DISABLE=true - this currently will cause software rendering!)This would however not explain
The system basically requires me to power it off, I can't even use the reset button.
=> tried https://wiki.archlinux.org/title/Keyboa … el_(SysRq) ?
Will look into that as well.High temperatures would hopefully cause an emergency shutdown/reboot - but why is there
pcie_aspm=off acpi_enforce_resources=lax kernel.split_lock_mitigate=0?
Temperatures are sometimes up into 90C but i kind of expect that from a 14900K tho.Those were posted on forums with similar issues to mine.
I seem to get freezes even with lower temperatures, but as I've said it only happens when I run a game.
Last edited by DragonWhelp (2025-06-29 20:06:41)
Offline
Compositing is active
Compositing Type: OpenGL
OpenGL vendor string: NVIDIA Corporation
OpenGL renderer string: NVIDIA GeForce RTX 3080/PCIe/SSE2
OpenGL version string: 3.1.0 NVIDIA 575.64
OpenGL platform interface: EGL
OpenGL shading language version string: 1.40 NVIDIA via Cg compiler
Driver: NVIDIA
Driver version: 575.64Wayland runs on the nvidia GPU as well.
Try both, X11 and proton on OpenGL rather than vulkan - you're kind of in the ballpark of the other thread.
Offline
Compositing is active Compositing Type: OpenGL OpenGL vendor string: NVIDIA Corporation OpenGL renderer string: NVIDIA GeForce RTX 3080/PCIe/SSE2 OpenGL version string: 3.1.0 NVIDIA 575.64 OpenGL platform interface: EGL OpenGL shading language version string: 1.40 NVIDIA via Cg compiler Driver: NVIDIA Driver version: 575.64Wayland runs on the nvidia GPU as well.
Try both, X11 and proton on OpenGL rather than vulkan - you're kind of in the ballpark of the other thread.
Getting a bit late here, so I will check on that tomorrow. You think it might be a conflict between the 3080 and the iGPU?
Offline
The hybrid setup, the monitors attached the the nvidia GPU and vulkan - at least if the other thread is relevant here (the main difference is that you're not running on the intel chip)
We'll see.
Offline
So switching to OpenGL did not fix my issue.I've tried that with CoD Black Ops (2010).
The other thing that was more shocking tho, I disabled my iGPU and got the same freeze twice so far.
I have not tried X11 yet, do I need to install X11-Plasma?
EDIT: Sometimes it takes longer to freeze and sometimes I will have it freeze the instant it tries to render anything in a game.
I used PROTON_USE_WINED3D=1 for Black Ops btw.
EDIT 2: Reading his issue a bit more, mine is a complete system freeze, can't do anything send any commands, and even the reset button doesn't work. The only way to reset is a hard one.
I noticed the screen stays frozen for a few more moments (like 1-2 seconds) after the hard restart.
EDIT 3: I tried both nvidia and nvidia-open drivers as well.
Last edited by DragonWhelp (2025-06-30 12:41:04)
Offline
Keep "dmesg -w" running in a terminal that you can see while playing the game (always on top and don't fullscreen the game)
Can you catch some errors there before the system freezes?
Offline
Nothing useful, just a few UFW firewall allows/blocks. Need me to post them?
Last edited by DragonWhelp (2025-06-30 18:08:29)
Offline
No, I had hoped the kernel errors would flare up.
If it's not the hardware: https://wiki.archlinux.org/title/Kdump
If it is the hardware: do you get MCE errors after the reboot (will show up in the journal/dmesg)
Offline
[ 7.161427] platform regulatory.0: Direct firmware load for regulatory.db failed with error -2
[ 7.161431] cfg80211: failed to load regulatory.dbNo MCE errors and that is the only thing that appeared in dmesg --since "20:00" other then
[ 1.307153] RAS: Correctable Errors collector initialized.EDIT: And checking the logs again the last msg were the UFW ones, then it just loaded up the kernel after the reboot.
Should I use kdumpst or simple-kdump?
Last edited by DragonWhelp (2025-06-30 18:24:41)
Offline
No errors will be logged after the system locks up, hence the "keep dmesg in sight" approach.
Test the behavior w/ the LTS kernel, then prepare to setup kdump for a crash kernel.
Offline
So uhh that went badly, kept getting stuck on "Starting Terminate Playmouth boot screen". I logged into a different tty and when I trying to do "startplasma-wayland" the entire thing has frozen.
Offline
What? kdump or the LTS kernel?
Did you forget to install nvida-lts?
Offline
Oops my bad
I'm looking to see if it crashes now(with lts). so far had a few bcachefs errors (probably due to older kernel). And those
41.637108] x86/split lock detection: #AC: CHTTPClientThre/3276 took a split_lock trap at address: 0xebb66c4f
[ 46.856943] x86/split lock detection: #AC: CHTTPClientThre/3592 took a split_lock trap at address: 0xebb66c4f
[ 47.108796] x86/split lock detection: #AC: CHTTPClientThre/3593 took a split_lock trap at address: 0xebb66c4fEDIT: Alright can confirm the lts kernel gives me the same freeze.
EDIT 2: Got kdumpst installed and working. Going to see what happens now.
Last edited by DragonWhelp (2025-06-30 19:54:26)
Offline
so far had a few bcachefs errors
Major. Red Flag. The FS is highly experimental. (No this is isn't about the recent new cycle
)
Can you run w/o (on eg. ext4)?
Offline
so far had a few bcachefs errors
Major. Red Flag. The FS is highly experimental. (No this is isn't about the recent new cycle
)
Can you run w/o (on eg. ext4)?
Ah, sure I can reformat them (still a relativity new setup from 9 days ago).
Jun 30 22:07:41 DragonLand-Main systemd[1]: Starting kdumpst loader boot-time service...
Jun 30 22:07:41 DragonLand-Main root[1406]: kdumpst: pstore-RAM was loaded successfully
Jun 30 22:07:41 DragonLand-Main systemd[1]: Finished kdumpst loader boot-time service.
Jun 30 22:09:18 DragonLand-Main dolphin[2359]: kf.coreaddons: Failed to open "/var/crash/kdumpst/kdump-initrd-6.15.4-arch2-1.img" Permission deniedSo it loaded and it created a kdump image but did not create anything else (it should create something in /logs right?)
EDIT: Guessing that is the kernel image kdumpst is using?
Last edited by DragonWhelp (2025-06-30 20:32:09)
Offline
The initramfs - no idea why dolphin is trying to access it (except if your clicked it or so)
Offline
The initramfs - no idea why dolphin is trying to access it (except if your clicked it or so)
Yeah im used to opening img or iso files from windows ![]()
But I'm kind of stuck here now. Anything else I could try?
Offline
https://wiki.archlinux.org/title/Kdump# … the_kernel
Then actually trigger the real crash and hope that the crash kernel steps in and you can https://wiki.archlinux.org/title/Kdump# … _core_dump
But my money is actually on bcachefs, the FS is highly experimental and has a bunch of known caveats - you should really not use that as root FS or trust any data to it for now.
(Fooling around w/ it on an optional partition and some throw-away data just to figure how and how good it works is fine - might still halt the kernel, though
)
Offline
I decided to just nuke my installation from orbit and start from beginning. I will update you if the problem is gone or if it still happens. ![]()
Offline
Alright so when I try to boot the arch iso i get:
pcieport 0000:00:01.0: AER: Error of this Agent is reported firstAnd the exact same line for nouveau
Offline