You are not logged in.
Pages: 1
System Specs:
Ryzen 9 5900x
SAPPHIRE PURE Radeon RX 7900 GRE
Super Flower Leadex VII XG 1300W
MSI MAG B550 TOMAHAWK
Symptoms:
after a random amount of time, my monitors will go dark and say "no signal".
I can still ssh into the computer but can't restart normally. Holding down the power button to restart has been successful in all but one case, where the VGA light stayed on.
I'm not doing anything in particular to make it happen.
journalctl logs:
Sep 08 21:32:21 louis kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* GPU Recovery Failed: -19
Sep 08 21:32:28 louis kernel: amdgpu 0000:2d:00.0: amdgpu: MES failed to respond to msg=MISC (WAIT_REG_MEM)
Sep 08 21:32:28 louis kernel: [drm:amdgpu_mes_reg_write_reg_wait [amdgpu]] *ERROR* failed to reg_write_reg_wait
Sep 08 21:32:29 louis kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx_0.0.0 timeout, signaled seq=1122833, emitted seq=1122835
Sep 08 21:32:29 louis kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process information: process kwin_x11 pid 1897 thread kwin_x11:cs0 pid 1928
Sep 08 21:32:29 louis kernel: amdgpu 0000:2d:00.0: amdgpu: GPU reset begin!
Sep 08 21:32:29 louis kernel: amdgpu 0000:2d:00.0: amdgpu: device lost from bus!
Sep 08 21:32:29 louis kernel: amdgpu 0000:2d:00.0: amdgpu: GPU reset end with ret = -19
Sep 08 21:32:29 louis kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* GPU Recovery Failed: -19
Sep 08 21:32:31 louis kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring sdma0 timeout, signaled seq=28641, emitted seq=28643
Sep 08 21:32:31 louis kernel: amdgpu 0000:2d:00.0: amdgpu: GPU reset begin!
Sep 08 21:32:31 louis kernel: amdgpu 0000:2d:00.0: amdgpu: device lost from bus!
Sep 08 21:32:31 louis kernel: amdgpu 0000:2d:00.0: amdgpu: GPU reset end with ret = -19
Sep 08 21:32:31 louis kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* GPU Recovery Failed: -19
Sep 08 21:32:32 louis kernel: amdgpu 0000:2d:00.0: amdgpu: MES failed to respond to msg=MISC (WAIT_REG_MEM)
Sep 08 21:32:32 louis kernel: [drm:amdgpu_mes_reg_write_reg_wait [amdgpu]] *ERROR* failed to reg_write_reg_wait
Sep 08 21:32:36 louis kernel: amdgpu 0000:2d:00.0: amdgpu: MES failed to respond to msg=MISC (WAIT_REG_MEM)
Sep 08 21:32:36 louis kernel: [drm:amdgpu_mes_reg_write_reg_wait [amdgpu]] *ERROR* failed to reg_write_reg_wait
Sep 08 21:32:40 louis kernel: amdgpu 0000:2d:00.0: amdgpu: MES failed to respond to msg=MISC (WAIT_REG_MEM)
Sep 08 21:32:40 louis kernel: [drm:amdgpu_mes_reg_write_reg_wait [amdgpu]] *ERROR* failed to reg_write_reg_wait
Sep 08 21:32:44 louis kernel: amdgpu 0000:2d:00.0: amdgpu: MES failed to respond to msg=MISC (WAIT_REG_MEM)
Sep 08 21:32:44 louis kernel: [drm:amdgpu_mes_reg_write_reg_wait [amdgpu]] *ERROR* failed to reg_write_reg_wait
Sep 08 21:32:48 louis kernel: amdgpu 0000:2d:00.0: amdgpu: MES failed to respond to msg=MISC (WAIT_REG_MEM)
Sep 08 21:32:48 louis kernel: [drm:amdgpu_mes_reg_write_reg_wait [amdgpu]] *ERROR* failed to reg_write_reg_wait
Sep 08 21:32:56 louis kernel: amdgpu 0000:2d:00.0: amdgpu: MES failed to respond to msg=MISC (WAIT_REG_MEM)
Sep 08 21:32:56 louis kernel: [drm:amdgpu_mes_reg_write_reg_wait [amdgpu]] *ERROR* failed to reg_write_reg_wait
Sep 08 21:32:59 louis kernel: amdgpu 0000:2d:00.0: amdgpu: MES failed to respond to msg=MISC (WAIT_REG_MEM)
Sep 08 21:32:59 louis kernel: [drm:amdgpu_mes_reg_write_reg_wait [amdgpu]] *ERROR* failed to reg_write_reg_wait
Sep 08 21:33:03 louis kernel: amdgpu 0000:2d:00.0: amdgpu: MES failed to respond to msg=MISC (WAIT_REG_MEM)
Sep 08 21:33:03 louis kernel: [drm:amdgpu_mes_reg_write_reg_wait [amdgpu]] *ERROR* failed to reg_write_reg_wait
Sep 08 21:33:07 louis kernel: amdgpu 0000:2d:00.0: amdgpu: MES failed to respond to msg=MISC (WAIT_REG_MEM)
Sep 08 21:33:07 louis kernel: [drm:amdgpu_mes_reg_write_reg_wait [amdgpu]] *ERROR* failed to reg_write_reg_wait
Sep 08 21:33:11 louis kernel: amdgpu 0000:2d:00.0: amdgpu: MES failed to respond to msg=MISC (WAIT_REG_MEM)
Sep 08 21:33:11 louis kernel: [drm:amdgpu_mes_reg_write_reg_wait [amdgpu]] *ERROR* failed to reg_write_reg_wait
Sep 08 21:33:15 louis kernel: amdgpu 0000:2d:00.0: amdgpu: MES failed to respond to msg=MISC (WAIT_REG_MEM)
Sep 08 21:33:15 louis kernel: [drm:amdgpu_mes_reg_write_reg_wait [amdgpu]] *ERROR* failed to reg_write_reg_wait
Sep 08 21:33:19 louis kernel: amdgpu 0000:2d:00.0: amdgpu: MES failed to respond to msg=MISC (WAIT_REG_MEM)
Sep 08 21:33:19 louis kernel: [drm:amdgpu_mes_reg_write_reg_wait [amdgpu]] *ERROR* failed to reg_write_reg_wait
Sep 08 21:33:22 louis kernel: amdgpu 0000:2d:00.0: amdgpu: MES failed to respond to msg=MISC (WAIT_REG_MEM)
Sep 08 21:33:22 louis kernel: [drm:amdgpu_mes_reg_write_reg_wait [amdgpu]] *ERROR* failed to reg_write_reg_wait
Sep 08 21:33:30 louis kernel: amdgpu 0000:2d:00.0: amdgpu: MES failed to respond to msg=MISC (WAIT_REG_MEM)
Sep 08 21:33:30 louis kernel: [drm:amdgpu_mes_reg_write_reg_wait [amdgpu]] *ERROR* failed to reg_write_reg_wait
Sep 08 21:33:30 louis kernel: amdgpu 0000:2d:00.0: amdgpu: MES ring buffer is full.
Sep 08 21:33:30 louis kernel: clocksource: Long readout interval, skipping watchdog check: cs_nsec: 3295000766 wd_nsec: 3295003275
Sep 08 21:33:33 louis kernel: amdgpu 0000:2d:00.0: amdgpu: MES ring buffer is full.
Sep 08 21:33:37 louis kernel: amdgpu 0000:2d:00.0: amdgpu: MES ring buffer is full.
Sep 08 21:33:40 louis kernel: amdgpu 0000:2d:00.0: amdgpu: MES ring buffer is full.
Sep 08 21:33:43 louis kernel: amdgpu 0000:2d:00.0: amdgpu: MES ring buffer is full.
Sep 08 21:33:47 louis kernel: amdgpu 0000:2d:00.0: amdgpu: MES ring buffer is full.
Sep 08 21:33:50 louis kernel: amdgpu 0000:2d:00.0: amdgpu: MES ring buffer is full.
Adding this to the kernel parameters didn't change anything
pcie_port_pm=off pcie_aspm.policy=performance
The graphics card is basically new (2 months old), and I've only had minor issues before this (like SDDM not showing a login prompt for 20+ seconds after the screen turns on).
This particular issue started a day or two ago and gets worse the more I turn it on in a day. For some reason, it didn't work last night more than 5 minutes but was fine for hours today before being a pain again.
Last edited by onshore0927 (2024-09-10 05:37:45)
Offline
Sep 08 21:32:21 louis kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* GPU Recovery Failed: -19
The snippet starts with a failure to revover, the causing incident is missing.
This particular issue started a day or two ago and gets worse the more I turn it on in a day.
Temperature issue? What if you dial up the fans (make sure the GPU ones actually turn up) and/or point a room ventilator at the GPU?
Edit: you didn't just forget to attach the dedicated power supply to the GPU, did you?
Last edited by seth (2024-09-09 07:23:22)
Offline
hmm i've already tried putting in a different gpu and it seems to be working okay for now.
I did have all the power supplies connected.
This is the only other log I have saved from one of the crashes
Sep 08 15:49:24 louis kernel: amdgpu 0000:2d:00.0: amdgpu: MES failed to respond to msg=MISC (WAIT_REG_MEM)
Sep 08 15:49:24 louis kernel: [drm:amdgpu_mes_reg_write_reg_wait [amdgpu]] *ERROR* failed to reg_write_reg_wait
Sep 08 15:49:27 louis kernel: amdgpu 0000:2d:00.0: amdgpu: MES failed to respond to msg=MISC (WAIT_REG_MEM)
Sep 08 15:49:27 louis kernel: [drm:amdgpu_mes_reg_write_reg_wait [amdgpu]] *ERROR* failed to reg_write_reg_wait
Sep 08 15:49:30 louis kernel: amdgpu 0000:2d:00.0: [drm] *ERROR* [CRTC:79:crtc-0] flip_done timed out
Sep 08 15:49:30 louis kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx_0.0.0 timeout, signaled seq=7407573, emitted seq=7407575
Sep 08 15:49:30 louis kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process information: process firefox pid 2412 thread firefox:cs0 pid 2416
Sep 08 15:49:30 louis kernel: amdgpu 0000:2d:00.0: amdgpu: GPU reset begin!
Sep 08 15:49:30 louis kernel: amdgpu 0000:2d:00.0: amdgpu: device lost from bus!
Sep 08 15:49:30 louis kernel: amdgpu 0000:2d:00.0: amdgpu: GPU reset end with ret = -19
Sep 08 15:49:30 louis kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* GPU Recovery Failed: -19
Sep 08 15:49:37 louis kernel: wlp5s0: deauthenticating from 24:62:ce:27:8d:f2 by local choice (Reason: 3=DEAUTH_LEAVING)
Sep 08 15:49:37 louis kernel: r8169 0000:2a:00.0 enp42s0: Link is Down
Sep 08 15:49:38 louis kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring sdma0 timeout, signaled seq=38647, emitted seq=38649
Sep 08 15:49:38 louis kernel: amdgpu 0000:2d:00.0: amdgpu: GPU reset begin!
Sep 08 15:49:38 louis kernel: amdgpu 0000:2d:00.0: amdgpu: device lost from bus!
Sep 08 15:49:38 louis kernel: amdgpu 0000:2d:00.0: amdgpu: GPU reset end with ret = -19
Sep 08 15:49:38 louis kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* GPU Recovery Failed: -19
Sep 08 15:49:40 louis kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx_0.0.0 timeout, signaled seq=7407573, emitted seq=7407575
Sep 08 15:49:40 louis kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process information: process firefox pid 2412 thread firefox:cs0 pid 2416
Sep 08 15:49:40 louis kernel: amdgpu 0000:2d:00.0: amdgpu: GPU reset begin!
Sep 08 15:49:40 louis kernel: amdgpu 0000:2d:00.0: amdgpu: device lost from bus!
Sep 08 15:49:40 louis kernel: amdgpu 0000:2d:00.0: amdgpu: GPU reset end with ret = -19
Sep 08 15:49:40 louis kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* GPU Recovery Failed: -19
Sep 08 15:49:41 louis kernel: amdgpu 0000:2d:00.0: amdgpu: MES failed to respond to msg=MISC (WAIT_REG_MEM)
Sep 08 15:49:41 louis kernel: [drm:amdgpu_mes_reg_write_reg_wait [amdgpu]] *ERROR* failed to reg_write_reg_wait
Sep 08 15:49:42 louis kernel: PM: suspend entry (deep)
Sep 08 15:49:42 louis kernel: Filesystems sync: 0.001 seconds
Sep 08 15:50:15 louis kernel: Freezing user space processes
Sep 08 15:50:15 louis kernel: amdgpu 0000:2d:00.0: amdgpu: MES failed to respond to msg=MISC (WAIT_REG_MEM)
Sep 08 15:50:15 louis kernel: [drm:amdgpu_mes_reg_write_reg_wait [amdgpu]] *ERROR* failed to reg_write_reg_wait
Sep 08 15:50:15 louis kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring sdma1 timeout, signaled seq=18949, emitted seq=18951
Sep 08 15:50:15 louis kernel: amdgpu 0000:2d:00.0: amdgpu: GPU reset begin!
Sep 08 15:50:15 louis kernel: amdgpu 0000:2d:00.0: amdgpu: device lost from bus!
Sep 08 15:50:15 louis kernel: amdgpu 0000:2d:00.0: amdgpu: GPU reset end with ret = -19
Sep 08 15:50:15 louis kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* GPU Recovery Failed: -19
Sep 08 15:50:15 louis kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring sdma0 timeout, signaled seq=38647, emitted seq=38649
Sep 08 15:50:15 louis kernel: amdgpu 0000:2d:00.0: amdgpu: GPU reset begin!
Sep 08 15:50:15 louis kernel: amdgpu 0000:2d:00.0: amdgpu: device lost from bus!
Sep 08 15:50:15 louis kernel: amdgpu 0000:2d:00.0: amdgpu: GPU reset end with ret = -19
Sep 08 15:50:15 louis kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* GPU Recovery Failed: -19
Sep 08 15:50:15 louis kernel: amdgpu 0000:2d:00.0: amdgpu: MES failed to respond to msg=MISC (WAIT_REG_MEM)
Sep 08 15:50:15 louis kernel: [drm:amdgpu_mes_reg_write_reg_wait [amdgpu]] *ERROR* failed to reg_write_reg_wait
Sep 08 15:50:15 louis kernel: amdgpu 0000:2d:00.0: amdgpu: MES failed to respond to msg=MISC (WAIT_REG_MEM)
Sep 08 15:50:15 louis kernel: [drm:amdgpu_mes_reg_write_reg_wait [amdgpu]] *ERROR* failed to reg_write_reg_wait
Sep 08 15:50:15 louis kernel: amdgpu 0000:2d:00.0: amdgpu: MES failed to respond to msg=MISC (WAIT_REG_MEM)
Sep 08 15:50:15 louis kernel: [drm:amdgpu_mes_reg_write_reg_wait [amdgpu]] *ERROR* failed to reg_write_reg_wait
Sep 08 15:50:15 louis kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring sdma1 timeout, signaled seq=18949, emitted seq=18951
Sep 08 15:50:15 louis kernel: amdgpu 0000:2d:00.0: amdgpu: GPU reset begin!
Sep 08 15:50:15 louis kernel: amdgpu 0000:2d:00.0: amdgpu: device lost from bus!
Sep 08 15:50:15 louis kernel: amdgpu 0000:2d:00.0: amdgpu: GPU reset end with ret = -19
Sep 08 15:50:15 louis kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* GPU Recovery Failed: -19
Sep 08 15:50:15 louis kernel: amdgpu 0000:2d:00.0: amdgpu: MES failed to respond to msg=MISC (WAIT_REG_MEM)
Sep 08 15:50:15 louis kernel: [drm:amdgpu_mes_reg_write_reg_wait [amdgpu]] *ERROR* failed to reg_write_reg_wait
Sep 08 15:50:15 louis kernel: Freezing user space processes completed (elapsed 17.713 seconds)
Sep 08 15:50:15 louis kernel: OOM killer disabled.
Sep 08 15:50:15 louis kernel: Freezing remaining freezable tasks
Sep 08 15:50:15 louis kernel: Freezing remaining freezable tasks completed (elapsed 0.001 seconds)
Sep 08 15:50:15 louis kernel: printk: Suspending console(s) (use no_console_suspend to debug)
Sep 08 15:50:15 louis kernel: [drm] evicting device resources failed
Sep 08 15:50:15 louis kernel: amdgpu 0000:2d:00.0: PM: device_prepare(): pci_pm_prepare returns -16
Sep 08 15:50:15 louis kernel: amdgpu 0000:2d:00.0: PM: not prepared for power transition: code -16
Sep 08 15:50:15 louis kernel: PM: Some devices failed to suspend, or early wake event detected
Sep 08 15:50:15 louis kernel: OOM killer enabled.
Sep 08 15:50:15 louis kernel: Restarting tasks ... done.
Sep 08 15:50:15 louis kernel: random: crng reseeded on system resumption
Sep 08 15:50:16 louis kernel: PM: suspend exit
Sep 08 15:50:16 louis kernel: PM: suspend entry (s2idle)
Sep 08 15:50:16 louis kernel: Filesystems sync: 0.001 seconds
Sep 08 15:50:31 louis kernel: Freezing user space processes
Sep 08 15:50:31 louis kernel: Freezing user space processes completed (elapsed 0.001 seconds)
Sep 08 15:50:31 louis kernel: OOM killer disabled.
Sep 08 15:50:31 louis kernel: Freezing remaining freezable tasks
Sep 08 15:50:31 louis kernel: Freezing remaining freezable tasks completed (elapsed 0.001 seconds)
Sep 08 15:50:31 louis kernel: printk: Suspending console(s) (use no_console_suspend to debug)
Sep 08 15:50:31 louis kernel: [drm] evicting device resources failed
Sep 08 15:50:31 louis kernel: amdgpu 0000:2d:00.0: PM: device_prepare(): pci_pm_prepare returns -16
Sep 08 15:50:31 louis kernel: amdgpu 0000:2d:00.0: PM: not prepared for power transition: code -16
Sep 08 15:50:31 louis kernel: PM: Some devices failed to suspend, or early wake event detected
Sep 08 15:50:31 louis kernel: OOM killer enabled.
Sep 08 15:50:31 louis kernel: Restarting tasks ... done.
Sep 08 15:50:31 louis kernel: random: crng reseeded on system resumption
Sep 08 15:50:32 louis kernel: PM: suspend exit
Sep 08 15:50:32 louis kernel: snd_hda_intel 0000:2d:00.1: Unable to change power state from D3hot to D0, device inaccessible
Sep 08 15:50:32 louis kernel: snd_hda_intel 0000:2d:00.1: CORB reset timeout#2, CORBRP = 65535
Sep 08 15:50:32 louis kernel: RTL8226B_RTL8221B 2.5Gbps PHY r8169-0-2a00:00: attached PHY driver (mii_bus:phy_addr=r8169-0-2a00:00, irq=MAC)
Sep 08 15:50:32 louis kernel: amdgpu 0000:2d:00.0: amdgpu: SMU: response:0xFFFFFFFF for index:41 param:0x00000000 message:DisallowGfxOff?
Sep 08 15:50:32 louis kernel: amdgpu 0000:2d:00.0: amdgpu: Failed to disable gfxoff!
Sep 08 15:50:32 louis kernel: r8169 0000:2a:00.0 enp42s0: Link is Down
Sep 08 15:50:32 louis kernel: Generic FE-GE Realtek PHY r8169-0-600:00: attached PHY driver (mii_bus:phy_addr=r8169-0-600:00, irq=MAC)
Sep 08 15:50:32 louis kernel: input: Eclipse (AVRCP) as /devices/virtual/input/input34
Sep 08 15:50:32 louis kernel: r8169 0000:06:00.0 enp6s0: Link is Down
Sep 08 15:50:33 louis kernel: ACPI: \: failed to evaluate _DSM bf0212f2-788f-c64d-a5b3-1f738e285ade (0x1001)
Sep 08 15:50:33 louis kernel: ACPI: \: failed to evaluate _DSM bf0212f2-788f-c64d-a5b3-1f738e285ade (0x1001)
Sep 08 15:50:33 louis kernel: ACPI: \: failed to evaluate _DSM bf0212f2-788f-c64d-a5b3-1f738e285ade (0x1001)
Sep 08 15:50:33 louis kernel: ACPI: \: failed to evaluate _DSM bf0212f2-788f-c64d-a5b3-1f738e285ade (0x1001)
Sep 08 15:50:33 louis kernel: ACPI: \: failed to evaluate _DSM bf0212f2-788f-c64d-a5b3-1f738e285ade (0x1001)
Sep 08 15:50:33 louis kernel: ACPI: \: failed to evaluate _DSM bf0212f2-788f-c64d-a5b3-1f738e285ade (0x1001)
Sep 08 15:50:33 louis kernel: ACPI: \: failed to evaluate _DSM bf0212f2-788f-c64d-a5b3-1f738e285ade (0x1001)
Sep 08 15:50:33 louis kernel: ACPI: \: failed to evaluate _DSM bf0212f2-788f-c64d-a5b3-1f738e285ade (0x1001)
Sep 08 15:50:33 louis kernel: ACPI: \: failed to evaluate _DSM bf0212f2-788f-c64d-a5b3-1f738e285ade (0x1001)
Sep 08 15:50:36 louis kernel: amdgpu 0000:2d:00.0: amdgpu: MES failed to respond to msg=MISC (WAIT_REG_MEM)
Sep 08 15:50:36 louis kernel: [drm:amdgpu_mes_reg_write_reg_wait [amdgpu]] *ERROR* failed to reg_write_reg_wait
Sep 08 15:50:38 louis kernel: input: Eclipse (AVRCP) as /devices/virtual/input/input35
Sep 08 15:50:40 louis kernel: amdgpu 0000:2d:00.0: amdgpu: MES failed to respond to msg=MISC (WAIT_REG_MEM)
Sep 08 15:50:40 louis kernel: [drm:amdgpu_mes_reg_write_reg_wait [amdgpu]] *ERROR* failed to reg_write_reg_wait
Sep 08 15:50:42 louis kernel: amdgpu 0000:2d:00.0: [drm] *ERROR* flip_done timed out
Sep 08 15:50:42 louis kernel: amdgpu 0000:2d:00.0: [drm] *ERROR* [CRTC:79:crtc-0] commit wait timed out
Sep 08 15:50:44 louis kernel: amdgpu 0000:2d:00.0: amdgpu: MES failed to respond to msg=MISC (WAIT_REG_MEM)
Sep 08 15:50:44 louis kernel: [drm:amdgpu_mes_reg_write_reg_wait [amdgpu]] *ERROR* failed to reg_write_reg_wait
Sep 08 15:50:47 louis kernel: amdgpu 0000:2d:00.0: amdgpu: MES failed to respond to msg=MISC (WAIT_REG_MEM)
Sep 08 15:50:47 louis kernel: [drm:amdgpu_mes_reg_write_reg_wait [amdgpu]] *ERROR* failed to reg_write_reg_wait
Sep 08 15:50:51 louis kernel: amdgpu 0000:2d:00.0: amdgpu: MES failed to respond to msg=MISC (WAIT_REG_MEM)
Sep 08 15:50:51 louis kernel: [drm:amdgpu_mes_reg_write_reg_wait [amdgpu]] *ERROR* failed to reg_write_reg_wait
Sep 08 15:50:55 louis kernel: amdgpu 0000:2d:00.0: amdgpu: MES failed to respond to msg=MISC (WAIT_REG_MEM)
Sep 08 15:50:55 louis kernel: [drm:amdgpu_mes_reg_write_reg_wait [amdgpu]] *ERROR* failed to reg_write_reg_wait
Sep 08 15:50:55 louis kernel: PM: suspend entry (deep)
Sep 08 15:50:55 louis kernel: Filesystems sync: 0.001 seconds
Sep 08 15:51:12 louis kernel: Freezing user space processes
Sep 08 15:51:12 louis kernel: Freezing user space processes completed (elapsed 0.001 seconds)
Sep 08 15:51:12 louis kernel: OOM killer disabled.
Sep 08 15:51:12 louis kernel: Freezing remaining freezable tasks
Sep 08 15:51:12 louis kernel: Freezing remaining freezable tasks completed (elapsed 0.001 seconds)
Sep 08 15:51:12 louis kernel: printk: Suspending console(s) (use no_console_suspend to debug)
Sep 08 15:51:12 louis kernel: [drm] evicting device resources failed
Sep 08 15:51:12 louis kernel: amdgpu 0000:2d:00.0: PM: device_prepare(): pci_pm_prepare returns -16
Sep 08 15:51:12 louis kernel: amdgpu 0000:2d:00.0: PM: not prepared for power transition: code -16
Sep 08 15:51:12 louis kernel: PM: Some devices failed to suspend, or early wake event detected
Sep 08 15:51:12 louis kernel: OOM killer enabled.
Sep 08 15:51:12 louis kernel: Restarting tasks ... done.
Sep 08 15:51:12 louis kernel: random: crng reseeded on system resumption
Sep 08 15:51:12 louis kernel: PM: suspend exit
Sep 08 15:51:12 louis kernel: PM: suspend entry (s2idle)
Sep 08 15:51:12 louis kernel: Filesystems sync: 0.001 seconds
Sep 08 15:51:27 louis kernel: Freezing user space processes
Sep 08 15:51:27 louis kernel: Freezing user space processes completed (elapsed 0.001 seconds)
Sep 08 15:51:27 louis kernel: OOM killer disabled.
Sep 08 15:51:27 louis kernel: Freezing remaining freezable tasks
Sep 08 15:51:27 louis kernel: Freezing remaining freezable tasks completed (elapsed 0.001 seconds)
Sep 08 15:51:27 louis kernel: printk: Suspending console(s) (use no_console_suspend to debug)
Sep 08 15:51:27 louis kernel: [drm] evicting device resources failed
Sep 08 15:51:27 louis kernel: amdgpu 0000:2d:00.0: PM: device_prepare(): pci_pm_prepare returns -16
Sep 08 15:51:27 louis kernel: amdgpu 0000:2d:00.0: PM: not prepared for power transition: code -16
Sep 08 15:51:27 louis kernel: PM: Some devices failed to suspend, or early wake event detected
Sep 08 15:51:27 louis kernel: OOM killer enabled.
Sep 08 15:51:27 louis kernel: Restarting tasks ... done.
Sep 08 15:51:27 louis kernel: random: crng reseeded on system resumption
Sep 08 15:51:28 louis kernel: PM: suspend exit
Sep 08 15:51:28 louis kernel: RTL8226B_RTL8221B 2.5Gbps PHY r8169-0-2a00:00: attached PHY driver (mii_bus:phy_addr=r8169-0-2a00:00, irq=MAC)
Sep 08 15:51:28 louis kernel: r8169 0000:2a:00.0 enp42s0: Link is Down
Sep 08 15:51:28 louis kernel: Generic FE-GE Realtek PHY r8169-0-600:00: attached PHY driver (mii_bus:phy_addr=r8169-0-600:00, irq=MAC)
Sep 08 15:51:28 louis kernel: r8169 0000:06:00.0 enp6s0: Link is Down
Sep 08 15:51:29 louis kernel: input: Eclipse (AVRCP) as /devices/virtual/input/input36
Sep 08 15:51:29 louis kernel: ACPI: \: failed to evaluate _DSM bf0212f2-788f-c64d-a5b3-1f738e285ade (0x1001)
Sep 08 15:51:29 louis kernel: ACPI: \: failed to evaluate _DSM bf0212f2-788f-c64d-a5b3-1f738e285ade (0x1001)
Sep 08 15:51:29 louis kernel: ACPI: \: failed to evaluate _DSM bf0212f2-788f-c64d-a5b3-1f738e285ade (0x1001)
Sep 08 15:51:29 louis kernel: ACPI: \: failed to evaluate _DSM bf0212f2-788f-c64d-a5b3-1f738e285ade (0x1001)
Sep 08 15:51:29 louis kernel: ACPI: \: failed to evaluate _DSM bf0212f2-788f-c64d-a5b3-1f738e285ade (0x1001)
Sep 08 15:51:29 louis kernel: ACPI: \: failed to evaluate _DSM bf0212f2-788f-c64d-a5b3-1f738e285ade (0x1001)
Sep 08 15:51:29 louis kernel: ACPI: \: failed to evaluate _DSM bf0212f2-788f-c64d-a5b3-1f738e285ade (0x1001)
Sep 08 15:51:29 louis kernel: ACPI: \: failed to evaluate _DSM bf0212f2-788f-c64d-a5b3-1f738e285ade (0x1001)
Sep 08 15:51:29 louis kernel: ACPI: \: failed to evaluate _DSM bf0212f2-788f-c64d-a5b3-1f738e285ade (0x1001)
Sep 08 15:51:31 louis kernel: wlp5s0: authenticate with 24:62:ce:27:8d:f2 (local address=c8:5e:a9:67:ed:8c)
Sep 08 15:51:31 louis kernel: wlp5s0: send auth to 24:62:ce:27:8d:f2 (try 1/3)
Sep 08 15:51:31 louis kernel: wlp5s0: authenticated
Sep 08 15:51:31 louis kernel: wlp5s0: associate with 24:62:ce:27:8d:f2 (try 1/3)
Sep 08 15:51:31 louis kernel: wlp5s0: RX AssocResp from 24:62:ce:27:8d:f2 (capab=0x1 status=0 aid=1)
Sep 08 15:51:31 louis kernel: wlp5s0: associated
Sep 08 15:51:32 louis kernel: r8169 0000:2a:00.0 enp42s0: Link is Up - 2.5Gbps/Full - flow control rx/tx
Sep 08 15:51:35 louis kernel: amdgpu 0000:2d:00.0: amdgpu: MES failed to respond to msg=MISC (WAIT_REG_MEM)
Sep 08 15:51:35 louis kernel: [drm:amdgpu_mes_reg_write_reg_wait [amdgpu]] *ERROR* failed to reg_write_reg_wait
Sep 08 15:51:35 louis kernel: input: Eclipse (AVRCP) as /devices/virtual/input/input37
Sep 08 15:51:35 louis kernel: amdgpu 0000:2d:00.0: amdgpu: MES ring buffer is full.
Sep 08 15:51:35 louis kernel: clocksource: Long readout interval, skipping watchdog check: cs_nsec: 1717184551 wd_nsec: 1717183957
Sep 08 15:51:37 louis kernel: INFO: task kworker/u100:9:20115 blocked for more than 122 seconds.
Sep 08 15:51:37 louis kernel: Tainted: G OE 6.10.8-arch1-1 #1
Sep 08 15:51:37 louis kernel: "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
Sep 08 15:51:37 louis kernel: task:kworker/u100:9 state:D stack:0 pid:20115 tgid:20115 ppid:2 flags:0x00004000
Sep 08 15:51:37 louis kernel: Workqueue: ttm ttm_bo_delayed_delete [ttm]
Sep 08 15:51:37 louis kernel: Call Trace:
Sep 08 15:51:37 louis kernel: <TASK>
Sep 08 15:51:37 louis kernel: __schedule+0x3d5/0x1520
Sep 08 15:51:37 louis kernel: ? srso_alias_return_thunk+0x5/0xfbef5
Sep 08 15:51:37 louis kernel: ? __slab_free+0xdf/0x2f0
Sep 08 15:51:37 louis kernel: schedule+0x27/0xf0
Sep 08 15:51:37 louis kernel: schedule_timeout+0x12f/0x160
Sep 08 15:51:37 louis kernel: dma_fence_default_wait+0x1d8/0x250
Sep 08 15:51:37 louis kernel: ? __pfx_dma_fence_default_wait_cb+0x10/0x10
Sep 08 15:51:37 louis kernel: dma_fence_wait_timeout+0x108/0x140
Sep 08 15:51:37 louis kernel: dma_resv_wait_timeout+0xcc/0x1c0
Sep 08 15:51:37 louis kernel: ttm_bo_delayed_delete+0x2a/0x80 [ttm 63be4c936e5801617c2d750c97be4a011ba25648]
Sep 08 15:51:37 louis kernel: process_one_work+0x17e/0x330
Sep 08 15:51:37 louis kernel: worker_thread+0x2e2/0x410
Sep 08 15:51:37 louis kernel: ? __pfx_worker_thread+0x10/0x10
Sep 08 15:51:37 louis kernel: kthread+0xd2/0x100
Sep 08 15:51:37 louis kernel: ? __pfx_kthread+0x10/0x10
Sep 08 15:51:37 louis kernel: ret_from_fork+0x34/0x50
Sep 08 15:51:37 louis kernel: ? __pfx_kthread+0x10/0x10
Sep 08 15:51:37 louis kernel: ret_from_fork_asm+0x1a/0x30
Sep 08 15:51:37 louis kernel: </TASK>
Sep 08 15:53:40 louis kernel: INFO: task kworker/u100:0:16672 blocked for more than 122 seconds.
Sep 08 15:53:40 louis kernel: Tainted: G OE 6.10.8-arch1-1 #1
Sep 08 15:53:40 louis kernel: "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
Sep 08 15:53:40 louis kernel: task:kworker/u100:0 state:D stack:0 pid:16672 tgid:16672 ppid:2 flags:0x00004000
Sep 08 15:53:40 louis kernel: Workqueue: ttm ttm_bo_delayed_delete [ttm]
Sep 08 15:53:40 louis kernel: Call Trace:
Sep 08 15:53:40 louis kernel: <TASK>
Sep 08 15:53:40 louis kernel: __schedule+0x3d5/0x1520
Sep 08 15:53:40 louis kernel: schedule+0x27/0xf0
Sep 08 15:53:40 louis kernel: schedule_timeout+0x12f/0x160
Sep 08 15:53:40 louis kernel: dma_fence_default_wait+0x1d8/0x250
Sep 08 15:53:40 louis kernel: ? __pfx_dma_fence_default_wait_cb+0x10/0x10
Sep 08 15:53:40 louis kernel: dma_fence_wait_timeout+0x108/0x140
Sep 08 15:53:40 louis kernel: dma_resv_wait_timeout+0xcc/0x1c0
Sep 08 15:53:40 louis kernel: ttm_bo_delayed_delete+0x2a/0x80 [ttm 63be4c936e5801617c2d750c97be4a011ba25648]
Sep 08 15:53:40 louis kernel: process_one_work+0x17e/0x330
Sep 08 15:53:40 louis kernel: worker_thread+0x2e2/0x410
Sep 08 15:53:40 louis kernel: ? __pfx_worker_thread+0x10/0x10
Sep 08 15:53:40 louis kernel: kthread+0xd2/0x100
Sep 08 15:53:40 louis kernel: ? __pfx_kthread+0x10/0x10
Sep 08 15:53:40 louis kernel: ret_from_fork+0x34/0x50
Sep 08 15:53:40 louis kernel: ? __pfx_kthread+0x10/0x10
Sep 08 15:53:40 louis kernel: ret_from_fork_asm+0x1a/0x30
Sep 08 15:53:40 louis kernel: </TASK>
Sep 08 15:53:40 louis kernel: INFO: task kworker/u100:4:16678 blocked for more than 122 seconds.
Sep 08 15:53:40 louis kernel: Tainted: G OE 6.10.8-arch1-1 #1
Sep 08 15:53:40 louis kernel: "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
Sep 08 15:53:40 louis kernel: task:kworker/u100:4 state:D stack:0 pid:16678 tgid:16678 ppid:2 flags:0x00004000
Sep 08 15:53:40 louis kernel: Workqueue: ttm ttm_bo_delayed_delete [ttm]
Sep 08 15:53:40 louis kernel: Call Trace:
Sep 08 15:53:40 louis kernel: <TASK>
Sep 08 15:53:40 louis kernel: __schedule+0x3d5/0x1520
Sep 08 15:53:40 louis kernel: ? srso_alias_return_thunk+0x5/0xfbef5
Sep 08 15:53:40 louis kernel: ? __slab_free+0xdf/0x2f0
Sep 08 15:53:40 louis kernel: schedule+0x27/0xf0
Sep 08 15:53:40 louis kernel: schedule_timeout+0x12f/0x160
Sep 08 15:53:40 louis kernel: dma_fence_default_wait+0x1d8/0x250
Sep 08 15:53:40 louis kernel: ? __pfx_dma_fence_default_wait_cb+0x10/0x10
Sep 08 15:53:40 louis kernel: dma_fence_wait_timeout+0x108/0x140
Sep 08 15:53:40 louis kernel: dma_resv_wait_timeout+0xcc/0x1c0
Sep 08 15:53:40 louis kernel: ttm_bo_delayed_delete+0x2a/0x80 [ttm 63be4c936e5801617c2d750c97be4a011ba25648]
Sep 08 15:53:40 louis kernel: process_one_work+0x17e/0x330
Sep 08 15:53:40 louis kernel: worker_thread+0x2e2/0x410
Sep 08 15:53:40 louis kernel: ? __pfx_worker_thread+0x10/0x10
Sep 08 15:53:40 louis kernel: kthread+0xd2/0x100
Sep 08 15:53:40 louis kernel: ? __pfx_kthread+0x10/0x10
Sep 08 15:53:40 louis kernel: ret_from_fork+0x34/0x50
Sep 08 15:53:40 louis kernel: ? __pfx_kthread+0x10/0x10
Sep 08 15:53:40 louis kernel: ret_from_fork_asm+0x1a/0x30
Sep 08 15:53:40 louis kernel: </TASK>
Sep 08 15:53:40 louis kernel: INFO: task kworker/u100:3:17710 blocked for more than 122 seconds.
Sep 08 15:53:40 louis kernel: Tainted: G OE 6.10.8-arch1-1 #1
Sep 08 15:53:40 louis kernel: "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
Sep 08 15:53:40 louis kernel: task:kworker/u100:3 state:D stack:0 pid:17710 tgid:17710 ppid:2 flags:0x00004000
Sep 08 15:53:40 louis kernel: Workqueue: ttm ttm_bo_delayed_delete [ttm]
Sep 08 15:53:40 louis kernel: Call Trace:
Sep 08 15:53:40 louis kernel: <TASK>
Sep 08 15:53:40 louis kernel: __schedule+0x3d5/0x1520
Sep 08 15:53:40 louis kernel: ? srso_alias_return_thunk+0x5/0xfbef5
Sep 08 15:53:40 louis kernel: ? __slab_free+0xdf/0x2f0
Sep 08 15:53:40 louis kernel: schedule+0x27/0xf0
Sep 08 15:53:40 louis kernel: schedule_timeout+0x12f/0x160
Sep 08 15:53:40 louis kernel: dma_fence_default_wait+0x1d8/0x250
Sep 08 15:53:40 louis kernel: ? __pfx_dma_fence_default_wait_cb+0x10/0x10
Sep 08 15:53:40 louis kernel: dma_fence_wait_timeout+0x108/0x140
Sep 08 15:53:40 louis kernel: dma_resv_wait_timeout+0xcc/0x1c0
Sep 08 15:53:40 louis kernel: ttm_bo_delayed_delete+0x2a/0x80 [ttm 63be4c936e5801617c2d750c97be4a011ba25648]
Sep 08 15:53:40 louis kernel: process_one_work+0x17e/0x330
Sep 08 15:53:40 louis kernel: worker_thread+0x2e2/0x410
Sep 08 15:53:40 louis kernel: ? __pfx_worker_thread+0x10/0x10
Sep 08 15:53:40 louis kernel: kthread+0xd2/0x100
Sep 08 15:53:40 louis kernel: ? __pfx_kthread+0x10/0x10
Sep 08 15:53:40 louis kernel: ret_from_fork+0x34/0x50
Sep 08 15:53:40 louis kernel: ? __pfx_kthread+0x10/0x10
Sep 08 15:53:40 louis kernel: ret_from_fork_asm+0x1a/0x30
Sep 08 15:53:40 louis kernel: </TASK>
Sep 08 15:53:40 louis kernel: INFO: task kworker/u101:4:19355 blocked for more than 122 seconds.
Sep 08 15:53:40 louis kernel: Tainted: G OE 6.10.8-arch1-1 #1
Sep 08 15:53:40 louis kernel: "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
Sep 08 15:53:40 louis kernel: task:kworker/u101:4 state:D stack:0 pid:19355 tgid:19355 ppid:2 flags:0x00004000
Sep 08 15:53:40 louis kernel: Workqueue: ttm ttm_bo_delayed_delete [ttm]
Sep 08 15:53:40 louis kernel: Call Trace:
Sep 08 15:53:40 louis kernel: <TASK>
Sep 08 15:53:40 louis kernel: __schedule+0x3d5/0x1520
Sep 08 15:53:40 louis kernel: ? __iommu_map+0x12c/0x270
Sep 08 15:53:40 louis kernel: schedule+0x27/0xf0
Sep 08 15:53:40 louis kernel: schedule_timeout+0x12f/0x160
Sep 08 15:53:40 louis kernel: dma_fence_default_wait+0x1d8/0x250
Sep 08 15:53:40 louis kernel: ? __pfx_dma_fence_default_wait_cb+0x10/0x10
Sep 08 15:53:40 louis kernel: dma_fence_wait_timeout+0x108/0x140
Sep 08 15:53:40 louis kernel: dma_resv_wait_timeout+0xcc/0x1c0
Sep 08 15:53:40 louis kernel: ttm_bo_delayed_delete+0x2a/0x80 [ttm 63be4c936e5801617c2d750c97be4a011ba25648]
Sep 08 15:53:40 louis kernel: process_one_work+0x17e/0x330
Sep 08 15:53:40 louis kernel: worker_thread+0x2e2/0x410
Sep 08 15:53:40 louis kernel: ? __pfx_worker_thread+0x10/0x10
Sep 08 15:53:40 louis kernel: kthread+0xd2/0x100
Sep 08 15:53:40 louis kernel: ? __pfx_kthread+0x10/0x10
Sep 08 15:53:40 louis kernel: ret_from_fork+0x34/0x50
Sep 08 15:53:40 louis kernel: ? __pfx_kthread+0x10/0x10
Sep 08 15:53:40 louis kernel: ret_from_fork_asm+0x1a/0x30
Sep 08 15:53:40 louis kernel: </TASK>
Sep 08 15:53:40 louis kernel: INFO: task kworker/u100:1:19707 blocked for more than 122 seconds.
Sep 08 15:53:40 louis kernel: Tainted: G OE 6.10.8-arch1-1 #1
Sep 08 15:53:40 louis kernel: "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
Sep 08 15:53:40 louis kernel: task:kworker/u100:1 state:D stack:0 pid:19707 tgid:19707 ppid:2 flags:0x00004000
Sep 08 15:53:40 louis kernel: Workqueue: ttm ttm_bo_delayed_delete [ttm]
Sep 08 15:53:40 louis kernel: Call Trace:
Sep 08 15:53:40 louis kernel: <TASK>
Sep 08 15:53:40 louis kernel: __schedule+0x3d5/0x1520
Sep 08 15:53:40 louis kernel: ? slab_update_freelist.isra.0+0x22/0xc0
Sep 08 15:53:40 louis kernel: ? srso_alias_return_thunk+0x5/0xfbef5
Sep 08 15:53:40 louis kernel: ? __slab_free+0xdf/0x2f0
Sep 08 15:53:40 louis kernel: schedule+0x27/0xf0
Sep 08 15:53:40 louis kernel: schedule_timeout+0x12f/0x160
Sep 08 15:53:40 louis kernel: dma_fence_default_wait+0x1d8/0x250
Sep 08 15:53:40 louis kernel: ? __pfx_dma_fence_default_wait_cb+0x10/0x10
Sep 08 15:53:40 louis kernel: dma_fence_wait_timeout+0x108/0x140
Sep 08 15:53:40 louis kernel: dma_resv_wait_timeout+0xcc/0x1c0
Sep 08 15:53:40 louis kernel: ttm_bo_delayed_delete+0x2a/0x80 [ttm 63be4c936e5801617c2d750c97be4a011ba25648]
Sep 08 15:53:40 louis kernel: process_one_work+0x17e/0x330
Sep 08 15:53:40 louis kernel: worker_thread+0x2e2/0x410
Sep 08 15:53:40 louis kernel: ? __pfx_worker_thread+0x10/0x10
Sep 08 15:53:40 louis kernel: kthread+0xd2/0x100
Sep 08 15:53:40 louis kernel: ? __pfx_kthread+0x10/0x10
Sep 08 15:53:40 louis kernel: ret_from_fork+0x34/0x50
Sep 08 15:53:40 louis kernel: ? __pfx_kthread+0x10/0x10
Sep 08 15:53:40 louis kernel: ret_from_fork_asm+0x1a/0x30
Sep 08 15:53:40 louis kernel: </TASK>
Sep 08 15:53:40 louis kernel: INFO: task kworker/u100:8:20005 blocked for more than 122 seconds.
Sep 08 15:53:40 louis kernel: Tainted: G OE 6.10.8-arch1-1 #1
Sep 08 15:53:40 louis kernel: "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
Sep 08 15:53:40 louis kernel: task:kworker/u100:8 state:D stack:0 pid:20005 tgid:20005 ppid:2 flags:0x00004000
Sep 08 15:53:40 louis kernel: Workqueue: ttm ttm_bo_delayed_delete [ttm]
Sep 08 15:53:40 louis kernel: Call Trace:
Sep 08 15:53:40 louis kernel: <TASK>
Sep 08 15:53:40 louis kernel: __schedule+0x3d5/0x1520
Sep 08 15:53:40 louis kernel: ? srso_alias_return_thunk+0x5/0xfbef5
Sep 08 15:53:40 louis kernel: ? __slab_free+0xdf/0x2f0
Sep 08 15:53:40 louis kernel: schedule+0x27/0xf0
Sep 08 15:53:40 louis kernel: schedule_timeout+0x12f/0x160
Sep 08 15:53:40 louis kernel: dma_fence_default_wait+0x1d8/0x250
Sep 08 15:53:40 louis kernel: ? __pfx_dma_fence_default_wait_cb+0x10/0x10
Sep 08 15:53:40 louis kernel: dma_fence_wait_timeout+0x108/0x140
Sep 08 15:53:40 louis kernel: dma_resv_wait_timeout+0xcc/0x1c0
Sep 08 15:53:40 louis kernel: ttm_bo_delayed_delete+0x2a/0x80 [ttm 63be4c936e5801617c2d750c97be4a011ba25648]
Sep 08 15:53:40 louis kernel: process_one_work+0x17e/0x330
Sep 08 15:53:40 louis kernel: worker_thread+0x2e2/0x410
Sep 08 15:53:40 louis kernel: ? __pfx_worker_thread+0x10/0x10
Sep 08 15:53:40 louis kernel: kthread+0xd2/0x100
Sep 08 15:53:40 louis kernel: ? __pfx_kthread+0x10/0x10
Sep 08 15:53:40 louis kernel: ret_from_fork+0x34/0x50
Sep 08 15:53:40 louis kernel: ? __pfx_kthread+0x10/0x10
Sep 08 15:53:40 louis kernel: ret_from_fork_asm+0x1a/0x30
Sep 08 15:53:40 louis kernel: </TASK>
Sep 08 15:53:40 louis kernel: INFO: task kworker/u100:9:20115 blocked for more than 245 seconds.
Sep 08 15:53:40 louis kernel: Tainted: G OE 6.10.8-arch1-1 #1
Sep 08 15:53:40 louis kernel: "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
Sep 08 15:53:40 louis kernel: task:kworker/u100:9 state:D stack:0 pid:20115 tgid:20115 ppid:2 flags:0x00004000
Sep 08 15:53:40 louis kernel: Workqueue: ttm ttm_bo_delayed_delete [ttm]
Sep 08 15:53:40 louis kernel: Call Trace:
Sep 08 15:53:40 louis kernel: <TASK>
Sep 08 15:53:40 louis kernel: __schedule+0x3d5/0x1520
Sep 08 15:53:40 louis kernel: ? srso_alias_return_thunk+0x5/0xfbef5
Sep 08 15:53:40 louis kernel: ? __slab_free+0xdf/0x2f0
Sep 08 15:53:40 louis kernel: schedule+0x27/0xf0
Sep 08 15:53:40 louis kernel: schedule_timeout+0x12f/0x160
Sep 08 15:53:40 louis kernel: dma_fence_default_wait+0x1d8/0x250
Sep 08 15:53:40 louis kernel: ? __pfx_dma_fence_default_wait_cb+0x10/0x10
Sep 08 15:53:40 louis kernel: dma_fence_wait_timeout+0x108/0x140
Sep 08 15:53:40 louis kernel: dma_resv_wait_timeout+0xcc/0x1c0
Sep 08 15:53:40 louis kernel: ttm_bo_delayed_delete+0x2a/0x80 [ttm 63be4c936e5801617c2d750c97be4a011ba25648]
Sep 08 15:53:40 louis kernel: process_one_work+0x17e/0x330
Sep 08 15:53:40 louis kernel: worker_thread+0x2e2/0x410
Sep 08 15:53:40 louis kernel: ? __pfx_worker_thread+0x10/0x10
Sep 08 15:53:40 louis kernel: kthread+0xd2/0x100
Sep 08 15:53:40 louis kernel: ? __pfx_kthread+0x10/0x10
Sep 08 15:53:40 louis kernel: ret_from_fork+0x34/0x50
Sep 08 15:53:40 louis kernel: ? __pfx_kthread+0x10/0x10
Sep 08 15:53:40 louis kernel: ret_from_fork_asm+0x1a/0x30
Sep 08 15:53:40 louis kernel: </TASK>
Sep 08 15:53:40 louis kernel: INFO: task kworker/u100:10:20116 blocked for more than 122 seconds.
Sep 08 15:53:40 louis kernel: Tainted: G OE 6.10.8-arch1-1 #1
Sep 08 15:53:40 louis kernel: "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
Sep 08 15:53:40 louis kernel: task:kworker/u100:10 state:D stack:0 pid:20116 tgid:20116 ppid:2 flags:0x00004000
Sep 08 15:53:40 louis kernel: Workqueue: ttm ttm_bo_delayed_delete [ttm]
Sep 08 15:53:40 louis kernel: Call Trace:
Sep 08 15:53:40 louis kernel: <TASK>
Sep 08 15:53:40 louis kernel: __schedule+0x3d5/0x1520
Sep 08 15:53:40 louis kernel: ? srso_alias_return_thunk+0x5/0xfbef5
Sep 08 15:53:40 louis kernel: ? __slab_free+0xdf/0x2f0
Sep 08 15:53:40 louis kernel: schedule+0x27/0xf0
Sep 08 15:53:40 louis kernel: schedule_timeout+0x12f/0x160
Sep 08 15:53:40 louis kernel: dma_fence_default_wait+0x1d8/0x250
Sep 08 15:53:40 louis kernel: ? __pfx_dma_fence_default_wait_cb+0x10/0x10
Sep 08 15:53:40 louis kernel: dma_fence_wait_timeout+0x108/0x140
Sep 08 15:53:40 louis kernel: dma_resv_wait_timeout+0xcc/0x1c0
Sep 08 15:53:40 louis kernel: ttm_bo_delayed_delete+0x2a/0x80 [ttm 63be4c936e5801617c2d750c97be4a011ba25648]
Sep 08 15:53:40 louis kernel: process_one_work+0x17e/0x330
Sep 08 15:53:40 louis kernel: worker_thread+0x2e2/0x410
Sep 08 15:53:40 louis kernel: ? __pfx_worker_thread+0x10/0x10
Sep 08 15:53:40 louis kernel: kthread+0xd2/0x100
Sep 08 15:53:40 louis kernel: ? __pfx_kthread+0x10/0x10
Sep 08 15:53:40 louis kernel: ret_from_fork+0x34/0x50
Sep 08 15:53:40 louis kernel: ? __pfx_kthread+0x10/0x10
Sep 08 15:53:40 louis kernel: ret_from_fork_asm+0x1a/0x30
Sep 08 15:53:40 louis kernel: </TASK>
Sep 08 15:55:43 louis kernel: INFO: task kworker/u100:0:16672 blocked for more than 245 seconds.
Sep 08 15:55:43 louis kernel: Tainted: G OE 6.10.8-arch1-1 #1
Sep 08 15:55:43 louis kernel: "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
Sep 08 15:55:43 louis kernel: task:kworker/u100:0 state:D stack:0 pid:16672 tgid:16672 ppid:2 flags:0x00004000
Sep 08 15:55:43 louis kernel: Workqueue: ttm ttm_bo_delayed_delete [ttm]
Sep 08 15:55:43 louis kernel: Call Trace:
Sep 08 15:55:43 louis kernel: <TASK>
Sep 08 15:55:43 louis kernel: __schedule+0x3d5/0x1520
Sep 08 15:55:43 louis kernel: schedule+0x27/0xf0
Sep 08 15:55:43 louis kernel: schedule_timeout+0x12f/0x160
Sep 08 15:55:43 louis kernel: dma_fence_default_wait+0x1d8/0x250
Sep 08 15:55:43 louis kernel: ? __pfx_dma_fence_default_wait_cb+0x10/0x10
Sep 08 15:55:43 louis kernel: dma_fence_wait_timeout+0x108/0x140
Sep 08 15:55:43 louis kernel: dma_resv_wait_timeout+0xcc/0x1c0
Sep 08 15:55:43 louis kernel: ttm_bo_delayed_delete+0x2a/0x80 [ttm 63be4c936e5801617c2d750c97be4a011ba25648]
Sep 08 15:55:43 louis kernel: process_one_work+0x17e/0x330
Sep 08 15:55:43 louis kernel: worker_thread+0x2e2/0x410
Sep 08 15:55:43 louis kernel: ? __pfx_worker_thread+0x10/0x10
Sep 08 15:55:43 louis kernel: kthread+0xd2/0x100
Sep 08 15:55:43 louis kernel: ? __pfx_kthread+0x10/0x10
Sep 08 15:55:43 louis kernel: ret_from_fork+0x34/0x50
Sep 08 15:55:43 louis kernel: ? __pfx_kthread+0x10/0x10
Sep 08 15:55:43 louis kernel: ret_from_fork_asm+0x1a/0x30
Sep 08 15:55:43 louis kernel: </TASK>
Sep 08 15:55:43 louis kernel: Future hung task reports are suppressed, see sysctl kernel.hung_task_warnings
It starts after the issue, though.
For the first log I sent, I'm pretty sure the next line up was from 2 hours before the issue.
I'll look into the bios settings for the gpu fans and give it another shot.
EDIT: I maxxed out at 89C with two different stress tests (one on windows and one linux) with no issues.
Last edited by onshore0927 (2024-09-09 09:15:52)
Offline
EDIT: I maxxed out at 89C with two different stress tests (one on windows and one linux) with no issues.
W/ the troublesome GPU or the replacement?
You could try to swap it back, maybe it was only badly seated.
Otherwise try a different SW stack (live distro) or at least different kernel - if the issue remains w/ the GPU, it's increasinly likey defective HW.
Offline
Those stress tests were with the troublesome GPU.
I'm trying it out today to see if it really just needed to be reseated or if there's an underlying issue.
Offline
so, after reinstalling the gpu, it hasn't had an issue all day.
I'll come back if it happens again, but i'll mark as solved for now.
thanks for the input!
Offline
Pages: 1