You are not logged in.

#1 2025-09-13 18:42:45

Shino
Member
From: Germany
Registered: 2015-02-01
Posts: 109

Folding at Home crashes AMDGPU and kills Xorg

Hi,

I'm trying out Folding at Home on my AMD Radeon RX 9060. After several minutes, my Desktop session suddenly died:

Subject: Process 2168 (Xorg) dumped core
Defined-By: systemd
Support: https://lists.freedesktop.org/mailman/listinfo/systemd-devel
Documentation: man:core(5)

Process 2168 (Xorg) crashed and dumped core.

This usually indicates a programming error in the crashing program and
should be reported to its vendor as a bug.

This is the coredump of Xorg: https://pastebin.com/7iDmZ7Ct

The full log of the crash https://pastebin.com/U7RQy6b6

It all starts with an error in chromium

Sep 13 20:19:17 ryzen9 kernel: amdgpu 0000:03:00.0: amdgpu:  in process chromium pid 7968 thread chromium:cs0 pid 7986)

After that, F@H crashes and shortly after the whole XServer.

Is this a driver problem? It seems related to https://gitlab.freedesktop.org/drm/amd/-/issues/3067

Sep 13 20:19:23 ryzen9 kernel: amdgpu 0000:03:00.0: amdgpu: GPU reset begin!
Sep 13 20:19:23 ryzen9 kernel: amdgpu 0000:03:00.0: amdgpu: remove_all_kfd_queues_mes: Failed to remove queue 2 for dev 16300
Sep 13 20:19:23 ryzen9 kernel: amdgpu 0000:03:00.0: amdgpu: Dumping IP State
Sep 13 20:19:23 ryzen9 kernel: amdgpu 0000:03:00.0: amdgpu: Dumping IP State Completed
Sep 13 20:19:23 ryzen9 systemd-coredump[28376]: Process 27667 (fah-client) of user 959 terminated abnormally with signal 6/ABRT, processing...
Sep 13 20:19:23 ryzen9 systemd[1]: Created slice Slice /system/systemd-coredump.
Sep 13 20:19:23 ryzen9 systemd[1]: Started Process Core Dump (PID 28376/UID 0).
Sep 13 20:19:23 ryzen9 kernel: amdgpu: Freeing queue vital buffer 0x7f657b000000, queue evicted
Sep 13 20:19:23 ryzen9 kernel: amdgpu: Freeing queue vital buffer 0x7f6580e00000, queue evicted
Sep 13 20:19:28 ryzen9 kernel: amdgpu 0000:03:00.0: amdgpu: MES(1) failed to respond to msg=REMOVE_QUEUE
Sep 13 20:19:28 ryzen9 kernel: [drm:amdgpu_mes_unmap_legacy_queue [amdgpu]] *ERROR* failed to unmap legacy queue
Sep 13 20:19:28 ryzen9 kernel: [drm:gfx_v12_0_hw_fini [amdgpu]] *ERROR* failed to halt cp gfx
Sep 13 20:19:28 ryzen9 kernel: amdgpu 0000:03:00.0: amdgpu: MODE1 reset
Sep 13 20:19:28 ryzen9 kernel: amdgpu 0000:03:00.0: amdgpu: GPU mode1 reset
Sep 13 20:19:28 ryzen9 kernel: amdgpu 0000:03:00.0: amdgpu: GPU smu mode1 reset
Sep 13 20:19:29 ryzen9 kernel: amdgpu 0000:03:00.0: amdgpu: GPU reset succeeded, trying to resume
Sep 13 20:19:29 ryzen9 kernel: amdgpu 0000:03:00.0: amdgpu: PCIE GART of 512M enabled (table at 0x00000083DAB00000).
Sep 13 20:19:29 ryzen9 kernel: amdgpu 0000:03:00.0: amdgpu: [drm] AMDGPU device coredump file has been created
Sep 13 20:19:29 ryzen9 kernel: amdgpu 0000:03:00.0: amdgpu: [drm] Check your /sys/class/drm/card1/device/devcoredump/data
Sep 13 20:19:29 ryzen9 kernel: [drm] VRAM is lost due to GPU reset!

Last edited by Shino (2025-09-13 18:45:58)

Offline

#2 2025-09-13 19:25:16

Head_on_a_Stick
Member
From: The Wirral
Registered: 2014-02-20
Posts: 9,003
Website

Re: Folding at Home crashes AMDGPU and kills Xorg


Jin, Jîyan, Azadî

Offline

Board footer

Powered by FluxBB