You are not logged in.
As of a few days ago my system will just randomly stop, with my main/primary monitor going black and my secondary monitor going green, sit for a few seconds/a minute and reboot. I hadn't changed anything major prior to this starting, nor has anything I tried worked. Not doing anything specific when this happens (I don't think?), just seems to happen at random intervals.
journalctl -b -1 doesn't display any logs related to the crash and just gets cut off, nor have I been able to find anything in most other log files. Tried using journalctl --follow as per the advice of this thread (I don't have RAM disappearing so I'm guessing it's not the same issue) but haven't been able to trigger it yet. Here's the cut-off log (redacted ip/mac address for obvious reasons)
A full list of what I've tried to fix this;
Upgrading my system
Using the LTS kernel
Remove kernel parameters for IOMMU & Unlocking my AMDGPU's features
Upgrading my BIOS to the latest (as a side affect also re-installing GRUB)
Ran short SMART tests on all my drives to make sure they weren't dropping out or something
Ran a 4 pass memtest which passed.
Re-mounting my GPU.
System specs in-case it's related to them;
System:
Kernel: 6.13.8-arch1-1 arch: x86_64 bits: 64 compiler: gcc v: 14.2.1
Desktop: Hyprland v: 0.48.1 Distro: Arch Linux
Machine:
Type: Desktop Mobo: Gigabyte model: B650 GAMING X AX v: x.x
serial: <superuser required> UEFI: American Megatrends LLC. v: FB3
date: 02/05/2025
CPU:
Info: 8-core model: AMD Ryzen 7 7800X3D bits: 64 type: MT MCP arch: Zen 4
rev: 2 cache: L1: 512 KiB L2: 8 MiB L3: 96 MiB
Speed (MHz): avg: 4023 min/max: 545/5050 boost: enabled cores: 1: 4023
2: 4023 3: 4023 4: 4023 5: 4023 6: 4023 7: 4023 8: 4023 9: 4023 10: 4023
11: 4023 12: 4023 13: 4023 14: 4023 15: 4023 16: 4023 bogomips: 134140
Flags: avx avx2 ht lm nx pae sse sse2 sse3 sse4_1 sse4_2 sse4a ssse3 svm
Graphics:
Device-1: Advanced Micro Devices [AMD/ATI] Navi 32 [Radeon RX 7700 XT /
7800 XT] vendor: Sapphire driver: amdgpu v: kernel arch: RDNA-3
bus-ID: 03:00.0
Device-2: Advanced Micro Devices [AMD/ATI] Raphael vendor: Gigabyte
driver: amdgpu v: kernel arch: RDNA-2 bus-ID: 12:00.0 temp: 37.0 C
Display: wayland server: X.Org v: 24.1.6 with: Xwayland v: 24.1.6
compositor: Hyprland v: 0.48.1 driver: X: loaded: amdgpu
unloaded: modesetting dri: radeonsi gpu: amdgpu resolution:
1: 1920x1080~240Hz 2: 1920x1080~75Hz
API: EGL v: 1.5 drivers: kms_swrast,radeonsi,swrast platforms:
active: gbm,wayland,x11,surfaceless,device inactive: N/A
API: OpenGL v: 4.6 compat-v: 4.5 vendor: amd mesa v: 25.0.2-arch1.2
glx-v: 1.4 direct-render: yes renderer: AMD Radeon RX 7800 XT (radeonsi
navi32 LLVM 19.1.7 DRM 3.61 6.13.8-arch1-1)
API: Vulkan v: 1.4.309 drivers: N/A surfaces: xcb,xlib,wayland devices: 2
Info: Tools: api: clinfo, eglinfo, glxinfo, vulkaninfo de: kscreen-doctor
gpu: corectrl wl: swaymsg, wayland-info, wlr-randr
x11: xdpyinfo, xprop, xrandr
Audio:
Device-1: Advanced Micro Devices [AMD/ATI] Navi 31 HDMI/DP Audio
driver: snd_hda_intel v: kernel bus-ID: 03:00.1
Device-2: Advanced Micro Devices [AMD/ATI] Rembrandt Radeon High
Definition Audio driver: snd_hda_intel v: kernel bus-ID: 12:00.1
Device-3: Advanced Micro Devices [AMD] Family 17h/19h/1ah HD Audio
vendor: Gigabyte driver: snd_hda_intel v: kernel bus-ID: 12:00.6
Device-4: Focusrite-Novation Scarlett Solo 4th Gen driver: snd-usb-audio
type: USB bus-ID: 1-2.3:6
Device-5: SteelSeries ApS GameDAC driver: hid-generic,snd-usb-audio,usbhid
type: USB bus-ID: 7-1.2.2:5
API: ALSA v: k6.13.8-arch1-1 status: kernel-api
Server-1: sndiod v: N/A status: off
Server-2: JACK v: 1.9.22 status: off
Server-3: PipeWire v: 1.4.1 status: off
Server-4: PulseAudio v: 17.0-43-g3e2bb status: active
Network:
Device-1: Realtek RTL8852CE PCIe 802.11ax Wireless Network
driver: rtw89_8852ce v: kernel port: d000 bus-ID: 09:00.0
IF: wlan0 state: up mac: <filter>
Device-2: Realtek RTL8125 2.5GbE vendor: Gigabyte driver: r8169 v: kernel
port: c000 bus-ID: 0a:00.0
IF: enp10s0 state: down mac: <filter>
IF-ID-1: virbr0 state: down mac: <filter>
Bluetooth:
Device-1: Realtek Bluetooth Radio driver: btusb v: 0.8 type: USB
bus-ID: 1-9:7
Report: btmgmt ID: hci0 rfk-id: 0 state: up address: <filter> bt-v: 5.3
lmp-v: 12
Drives:
Local Storage: total: 4.55 TiB used: 1.84 TiB (40.4%)
ID-1: /dev/nvme0n1 vendor: Western Digital model: WDS100T2B0C
size: 931.51 GiB temp: 29.9 C
ID-2: /dev/sda vendor: Seagate model: ST2000VM003-1CT164 size: 1.82 TiB
ID-3: /dev/sdb vendor: Seagate model: ST2000DM008-2UB102 size: 1.82 TiB
Partition:
ID-1: / size: 496.77 GiB used: 438.25 GiB (88.2%) fs: ext4
dev: /dev/nvme0n1p5
ID-2: /boot size: 1.25 GiB used: 214.9 MiB (16.8%) fs: vfat
dev: /dev/nvme0n1p1
Swap:
ID-1: swap-1 type: partition size: 7.45 GiB used: 0 KiB (0.0%)
dev: /dev/nvme0n1p6
Sensors:
System Temperatures: cpu: 56.5 C mobo: 24.0 C
Fan Speeds (rpm): N/A
GPU: device: amdgpu temp: 52.0 C fan: 0 device: amdgpu temp: 37.0 C
Info:
Memory: total: 32 GiB note: est. available: 30.46 GiB used: 9.17 GiB (30.1%)
Processes: 440 Uptime: 15m Init: systemd
Packages: 2301 Compilers: clang: 19.1.7 gcc: 14.2.1 Shell: Zsh v: 5.9
inxi: 3.3.37No idea how to track/debug this sort of issue without any logs of any kind, so I'm not sure where to go from here. Any help is appreciated, and if you need more info than what I've provided here just let me know. Thanks.
Edit: Clarified wording on logs, attached my journalctl from a crash and used inxi for system info instead.
Edit: Updated the list of stuff I've tried
Last edited by Livaco (2025-04-10 12:58:15)
Offline
What do you mean by "journalctl -b -1" doesn't display anything? The log is empty? If so, check if "Storage" is set to "auto" (the default) in /etc/systemd/journald.conf) and if you have a directory called /var/log/journal. If set to "auto" and that directory does not exist, you do not have persistent logs between boots.
Last edited by twelveeighty (2025-04-01 23:31:51)
Offline
What do you mean by "journalctl -b -1" doesn't display anything? The log is empty? If so, check if "Storage" is set to "auto" (the default) in /etc/systemd/journald.conf) and if you have a directory called /var/log/journal. If set to "auto" and that directory does not exist, you do not have persistent logs between boots.
Sorry this is bad wording on my part - "doesn't display anything" as in it doesn't display any crash logs or errors at the time of the crash anything, just gets cut off when the system crashes. Best I can see is my wifi driver complaining all the time & some programs core dumping, but not near the point of crashing.
I'll clarify it & add a pastebin to the log in the original post since I realized I forgot to last night.
Offline
This error is happening way too often, my guess is this is linked to your problem:
Apr 01 22:22:43 silverhand kernel: rtw89_8852ce 0000:09:00.0: c2h class 9 func 6 not support
Apr 01 22:22:45 silverhand kernel: rtw89_8852ce 0000:09:00.0: c2h class 9 func 6 not support
Apr 01 22:22:47 silverhand kernel: rtw89_8852ce 0000:09:00.0: c2h class 9 func 6 not support
Apr 01 22:22:49 silverhand kernel: rtw89_8852ce 0000:09:00.0: c2h class 9 func 6 not support
...ad infinitum...Could this be: https://bbs.archlinux.org/viewtopic.php?id=286109?
Offline
This error is happening way too often, my guess is this is linked to your problem:
Apr 01 22:22:43 silverhand kernel: rtw89_8852ce 0000:09:00.0: c2h class 9 func 6 not support Apr 01 22:22:45 silverhand kernel: rtw89_8852ce 0000:09:00.0: c2h class 9 func 6 not support Apr 01 22:22:47 silverhand kernel: rtw89_8852ce 0000:09:00.0: c2h class 9 func 6 not support Apr 01 22:22:49 silverhand kernel: rtw89_8852ce 0000:09:00.0: c2h class 9 func 6 not support ...ad infinitum...Could this be: https://bbs.archlinux.org/viewtopic.php?id=286109?
I've tried that solution already in the past and found it didn't work, from what I can tell it's actually been patched in 6.14?
For the record though I don't think this is what's causing it, as it's been spamming that message at me for months without actual issue.
Offline
Hi - Just an update on this, managed to have a crash that did actually put something seemingly significant into the logs before the system conked out (full log);
...
Apr 03 12:29:07 silverhand kernel: amdgpu 0000:03:00.0: amdgpu: Dumping IP State
Apr 03 12:29:07 silverhand kernel: amdgpu 0000:03:00.0: amdgpu: SMU: response:0xFFFFFFFF for index:41 param:0x00000000 message:DisallowGfxOff?
Apr 03 12:29:07 silverhand kernel: amdgpu 0000:03:00.0: amdgpu: Failed to disable gfxoff!
...Doing some googling around there seems to be threads around the first line from nov/dec of 2024? Nothing on the other two (gfxoff) though. If anyone can clarify what it's complaining about or if it's even relevant that would be greatly appreciated, since I'm a bit out of my depth looking at this ![]()
If it's not relevant my next step will be to enable & use SysRq to sync to disk before my PC gets the chance to reboot so hopefully something gets saved to logs; there have been some crashes I can still hear my audio come through before it goes, so the Kernel still seems to be at least running pre-reboot?
Last edited by Livaco (2025-04-04 21:07:51)
Offline
7800X3D
That seems to be an APU if I'm not mistaken?
https://www.reddit.com/r/AMDHelp/commen … artifacts/
Might be related since I see something related to DisallowGfxOff in your log. Try disabling it from BIOS, or integrated graphics altogether?
Offline
Livaco wrote:7800X3D
That seems to be an APU if I'm not mistaken?
https://www.reddit.com/r/AMDHelp/commen … artifacts/
Might be related since I see something related to DisallowGfxOff in your log. Try disabling it from BIOS, or integrated graphics altogether?
Huh, odd bug but agreed it seems to be widespread, disabled it in my BIOS and will update if it crashes again. Cheers
Offline
Apr 03 12:29:07 silverhand kernel: amdgpu 0000:03:00.0: amdgpu: Failed to disable gfxoff!was 13s before the journal ends - is this even close to the time when the system shut down?
Generally, https://wiki.archlinux.org/title/Ryzen#Troubleshooting
Offline
Apr 03 12:29:07 silverhand kernel: amdgpu 0000:03:00.0: amdgpu: Failed to disable gfxoff!was 13s before the journal ends - is this even close to the time when the system shut down?
I'm fairly certain, since as I said in the OP it will sit for a variable time before finally rebooting & I can sometimes hear my audio continue playing (as was the case with that boot), so it has to keep running before the reboot somehow.
Offline
If/when this happens the next time, try to reboot the system using the https://wiki.archlinux.org/title/Keyboa … el_(SysRq) before it reboots by itself.
Though for the latter, either the system overheats or you're running into CPU/memory/power related issues (see the linked ryzen paragraphs)
Offline
Sorry for the lack of updates, I have one now in that it went away for a couple of days, but as of today is back and has happened like 3 times in 5 minutes - I outright can't get more than a few minutes of usage now without this happening.
I tried syncing to disk and stuff all three times, so here's what I've got;
I have university coursework due next week that I need this PC to be active to do, so I'll be going back to Windows until this gets solved lest I corrupt my hard drive with constantly crashing. ![]()
EDIT: This is also happening in Windows, so this is hardware at this point, not Linux. I'll mark this as solved and go investigating, cheers.
Last edited by Livaco (2025-04-10 12:58:02)
Offline
Apr 10 09:27:33 silverhand kernel: amdgpu 0000:03:00.0: [drm] fb0: amdgpudrmfb frame buffer device
Apr 10 10:15:27 silverhand kernel: amdgpu 0000:03:00.0: [drm] *ERROR* [CRTC:79:crtc-0] flip_done timed out
Apr 10 10:15:27 silverhand kernel: amdgpu 0000:03:00.0: [drm] *ERROR* [CRTC:83:crtc-1] flip_done timed out
Apr 10 10:15:27 silverhand kernel: amdgpu 0000:03:00.0: amdgpu: Dumping IP State
Apr 10 10:15:27 silverhand kernel: amdgpu 0000:03:00.0: amdgpu: SMU: response:0xFFFFFFFF for index:41 param:0x00000000 message:DisallowGfxOff?
Apr 10 10:15:27 silverhand kernel: amdgpu 0000:03:00.0: amdgpu: Failed to disable gfxoff!
Apr 10 10:15:31 silverhand kernel: [drm:amdgpu_dm_atomic_check [amdgpu]] *ERROR* [CRTC:79:crtc-0] hw_done or flip_done timed out
Apr 10 10:15:41 silverhand kernel: amdgpu 0000:03:00.0: [drm] *ERROR* flip_done timed out
Apr 10 10:15:41 silverhand kernel: amdgpu 0000:03:00.0: [drm] *ERROR* [CRTC:79:crtc-0] commit wait timed out
Apr 10 10:15:49 silverhand kernel: amdgpu 0000:03:00.0: amdgpu: SMU: response:0xFFFFFFFF for index:41 param:0x00000000 message:DisallowGfxOff?
Apr 10 10:15:49 silverhand kernel: amdgpu 0000:03:00.0: amdgpu: Failed to disable gfxoff!
Apr 10 10:15:52 silverhand kernel: amdgpu 0000:03:00.0: [drm] *ERROR* flip_done timed out
Apr 10 10:15:52 silverhand kernel: amdgpu 0000:03:00.0: [drm] *ERROR* [CRTC:83:crtc-1] commit wait timed out
Apr 10 10:16:02 silverhand kernel: amdgpu 0000:03:00.0: [drm] *ERROR* flip_done timed out
Apr 10 10:16:02 silverhand kernel: amdgpu 0000:03:00.0: [drm] *ERROR* [CONNECTOR:101:DP-2] commit wait timed out
Apr 10 10:16:12 silverhand kernel: amdgpu 0000:03:00.0: [drm] *ERROR* flip_done timed out
Apr 10 10:16:12 silverhand kernel: amdgpu 0000:03:00.0: [drm] *ERROR* [CONNECTOR:107:HDMI-A-1] commit wait timed out
Apr 10 10:16:22 silverhand kernel: amdgpu 0000:03:00.0: [drm] *ERROR* flip_done timed out
Apr 10 10:16:22 silverhand kernel: amdgpu 0000:03:00.0: [drm] *ERROR* [PLANE:52:plane-2] commit wait timed outThough notably since it's an APU, make sure to check mentioned https://wiki.archlinux.org/title/Ryzen#Troubleshooting !
Offline