You are not logged in.

#1 2025-03-09 08:01:27

md5sum
Member
Registered: 2025-03-09
Posts: 2

amdgpu bug

kernel: amdgpu 0000:05:00.0: amdgpu: [gfxhub] page fault (src_id:0 ring:24 vmid:4 pasid:32784)
kernel: amdgpu 0000:05:00.0: amdgpu:  in process firefox pid 1248 thread firefox:cs0 pid 1326
kernel: amdgpu 0000:05:00.0: amdgpu:   in page starting at address 0x0000000000000000 from client 0x1b (UTCL2)
kernel: amdgpu 0000:05:00.0: amdgpu: GCVM_L2_PROTECTION_FAULT_STATUS:0x00401430
kernel: amdgpu 0000:05:00.0: amdgpu:          Faulty UTCL2 client ID: SQC (data) (0xa)
kernel: amdgpu 0000:05:00.0: amdgpu:          MORE_FAULTS: 0x0
kernel: amdgpu 0000:05:00.0: amdgpu:          WALKER_ERROR: 0x0
kernel: amdgpu 0000:05:00.0: amdgpu:          PERMISSION_FAULTS: 0x3
kernel: amdgpu 0000:05:00.0: amdgpu:          MAPPING_ERROR: 0x0
kernel: amdgpu 0000:05:00.0: amdgpu:          RW: 0x0
kernel: amdgpu 0000:05:00.0: amdgpu: Dumping IP State
kernel: amdgpu 0000:05:00.0: amdgpu: Dumping IP State Completed
kernel: amdgpu 0000:05:00.0: amdgpu: ring gfx_0.0.0 timeout, but soft recovered

if you need any more info please let me know.

(I'm sorry if the formatting is bad, its my first post on this forum)

Offline

#2 2025-03-09 08:56:45

seth
Member
From: Won't reply 2 private help req
Registered: 2012-09-03
Posts: 75,086

Re: amdgpu bug

https://bbs.archlinux.org/search.php?ac … ULT_STATUS

if you need any more info please let me know.

You mean like
- what hardware you're actually using
- context of the reset
- one off or frequent problem
- trigger environment (display server, kernel, mesa versions)
- anything beyond some isolated kernel messages indicating that your GPU recovered from a null pointer access?

Offline

#3 2025-03-09 17:18:31

md5sum
Member
Registered: 2025-03-09
Posts: 2

Re: amdgpu bug

i have amd ryzen 5 7535HS and nvidia gtx 2050 mobile.

it doesn't really crashes the entire system, the screen just freezes for a short period of time (3-4 secs) and everything goes back to normal.
the reset happens randomly for me.

it happens on both xorg and wayland.

my grub options:

GRUB_CMDLINE_LINUX_DEFAULT="loglevel=3 quiet nvidia-drm.modeset=1 amdgpu.bapm=0 amdgpu.runpm=0 amdgpu.aspm=0 pcie_aspm=off amdgpu.ppfeaturemask=0xffff8 amdgpu.dcdebugmask=0x10 nvidia-drm.fbdev=1"
GRUB_CMDLINE_LINUX="acpi_backlight=native"

without those amdgpu params the screen becomes ~1 fps and in order to fix this i need to reboot (i tried typing xrandr to see if screen refresh rate changes, and no it does not.)

Offline

#4 2025-03-09 20:51:28

seth
Member
From: Won't reply 2 private help req
Registered: 2012-09-03
Posts: 75,086

Re: amdgpu bug

Please post your Xorg log, https://wiki.archlinux.org/title/Xorg#General (for a general overview of the HW configuration)

Offline

#5 2025-03-15 12:08:23

daeler
Member
Registered: 2019-10-03
Posts: 1

Re: amdgpu bug

I have the same error (on Fedora, Linux kernel 6.13.6) since the recently released linux-firmware of 20250311on a Ryzen 9 7900 using the iGPU, what version of linux-firmware are you using? Firefox was updated recently to 136.0.1 though, maybe it's just a bug in Firefox.

mrt 15 12:23:03 sophia kernel: amdgpu 0000:05:00.0: amdgpu: [gfxhub] page fault (src_id:0 ring:24 vmid:3 pasid:32773)
mrt 15 12:23:03 sophia kernel: amdgpu 0000:05:00.0: amdgpu:  in process firefox pid 122191 thread firefox:cs0 pid 122278
mrt 15 12:23:03 sophia kernel: amdgpu 0000:05:00.0: amdgpu:   in page starting at address 0x0000000000000000 from client 0x1b (UTCL2)
mrt 15 12:23:03 sophia kernel: amdgpu 0000:05:00.0: amdgpu: GCVM_L2_PROTECTION_FAULT_STATUS:0x00301430
mrt 15 12:23:03 sophia kernel: amdgpu 0000:05:00.0: amdgpu:          Faulty UTCL2 client ID: SQC (data) (0xa)
mrt 15 12:23:03 sophia kernel: amdgpu 0000:05:00.0: amdgpu:          MORE_FAULTS: 0x0
mrt 15 12:23:03 sophia kernel: amdgpu 0000:05:00.0: amdgpu:          WALKER_ERROR: 0x0
mrt 15 12:23:03 sophia kernel: amdgpu 0000:05:00.0: amdgpu:          PERMISSION_FAULTS: 0x3
mrt 15 12:23:03 sophia kernel: amdgpu 0000:05:00.0: amdgpu:          MAPPING_ERROR: 0x0
mrt 15 12:23:03 sophia kernel: amdgpu 0000:05:00.0: amdgpu:          RW: 0x0
mrt 15 12:23:13 sophia kernel: amdgpu 0000:05:00.0: amdgpu: Dumping IP State
mrt 15 12:23:13 sophia kernel: amdgpu 0000:05:00.0: amdgpu: Dumping IP State Completed
mrt 15 12:23:13 sophia kernel: amdgpu 0000:05:00.0: amdgpu: ring gfx_0.1.0 timeout, signaled seq=858052, emitted seq=858053
mrt 15 12:23:13 sophia kernel: amdgpu 0000:05:00.0: amdgpu: Process information: process gnome-shell pid 1799 thread gnome-shel:cs0 pid 1841
mrt 15 12:23:13 sophia kernel: amdgpu 0000:05:00.0: amdgpu: Starting gfx_0.1.0 ring reset
mrt 15 12:23:14 sophia kernel: amdgpu 0000:05:00.0: amdgpu: Ring gfx_0.1.0 reset failure
mrt 15 12:23:14 sophia kernel: amdgpu 0000:05:00.0: amdgpu: GPU reset begin!
mrt 15 12:23:14 sophia kernel: amdgpu 0000:05:00.0: amdgpu: MODE2 reset
mrt 15 12:23:14 sophia kernel: amdgpu 0000:05:00.0: amdgpu: GPU reset succeeded, trying to resume
mrt 15 12:23:14 sophia kernel: [drm] PCIE GART of 1024M enabled (table at 0x000000F41FC00000).
mrt 15 12:23:14 sophia kernel: amdgpu 0000:05:00.0: amdgpu: PSP is resuming...
mrt 15 12:23:14 sophia kernel: amdgpu 0000:05:00.0: amdgpu: reserve 0xa00000 from 0xf41e000000 for PSP TMR
mrt 15 12:23:14 sophia kernel: amdgpu 0000:05:00.0: amdgpu: RAS: optional ras ta ucode is not available
mrt 15 12:23:14 sophia kernel: amdgpu 0000:05:00.0: amdgpu: RAP: optional rap ta ucode is not available
mrt 15 12:23:14 sophia kernel: amdgpu 0000:05:00.0: amdgpu: SECUREDISPLAY: securedisplay ta ucode is not available
mrt 15 12:23:14 sophia kernel: amdgpu 0000:05:00.0: amdgpu: SMU is resuming...
mrt 15 12:23:14 sophia kernel: amdgpu 0000:05:00.0: amdgpu: SMU is resumed successfully!
mrt 15 12:23:14 sophia kernel: [drm] kiq ring mec 2 pipe 1 q 0
mrt 15 12:23:14 sophia kernel: [drm] DMUB hardware initialized: version=0x05001C00
mrt 15 12:23:14 sophia kernel: amdgpu 0000:05:00.0: [drm] *ERROR* lttpr_caps phy_repeater_cnt is 0x0, forcing it to 0x80.
mrt 15 12:23:14 sophia kernel: amdgpu 0000:05:00.0: amdgpu: ring gfx_0.0.0 uses VM inv eng 0 on hub 0
mrt 15 12:23:14 sophia kernel: amdgpu 0000:05:00.0: amdgpu: ring gfx_0.1.0 uses VM inv eng 1 on hub 0
mrt 15 12:23:14 sophia kernel: amdgpu 0000:05:00.0: amdgpu: ring comp_1.0.0 uses VM inv eng 4 on hub 0
mrt 15 12:23:14 sophia kernel: amdgpu 0000:05:00.0: amdgpu: ring comp_1.1.0 uses VM inv eng 5 on hub 0
mrt 15 12:23:14 sophia kernel: amdgpu 0000:05:00.0: amdgpu: ring comp_1.2.0 uses VM inv eng 6 on hub 0
mrt 15 12:23:14 sophia kernel: amdgpu 0000:05:00.0: amdgpu: ring comp_1.3.0 uses VM inv eng 7 on hub 0
mrt 15 12:23:14 sophia kernel: amdgpu 0000:05:00.0: amdgpu: ring comp_1.0.1 uses VM inv eng 8 on hub 0
mrt 15 12:23:14 sophia kernel: amdgpu 0000:05:00.0: amdgpu: ring comp_1.1.1 uses VM inv eng 9 on hub 0
mrt 15 12:23:14 sophia kernel: amdgpu 0000:05:00.0: amdgpu: ring comp_1.2.1 uses VM inv eng 10 on hub 0
mrt 15 12:23:14 sophia kernel: amdgpu 0000:05:00.0: amdgpu: ring comp_1.3.1 uses VM inv eng 11 on hub 0
mrt 15 12:23:14 sophia kernel: amdgpu 0000:05:00.0: amdgpu: ring kiq_0.2.1.0 uses VM inv eng 12 on hub 0
mrt 15 12:23:14 sophia kernel: amdgpu 0000:05:00.0: amdgpu: ring sdma0 uses VM inv eng 13 on hub 0
mrt 15 12:23:14 sophia kernel: amdgpu 0000:05:00.0: amdgpu: ring vcn_dec_0 uses VM inv eng 0 on hub 8
mrt 15 12:23:14 sophia kernel: amdgpu 0000:05:00.0: amdgpu: ring vcn_enc_0.0 uses VM inv eng 1 on hub 8
mrt 15 12:23:14 sophia kernel: amdgpu 0000:05:00.0: amdgpu: ring vcn_enc_0.1 uses VM inv eng 4 on hub 8
mrt 15 12:23:14 sophia kernel: amdgpu 0000:05:00.0: amdgpu: ring jpeg_dec uses VM inv eng 5 on hub 8
mrt 15 12:23:14 sophia kernel: amdgpu 0000:05:00.0: amdgpu: GPU reset(2) succeeded!
mrt 15 12:23:14 sophia kernel: [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!

Last edited by daeler (2025-03-15 12:15:18)

Offline

#6 2025-03-22 08:42:36

zbinlin
Member
Registered: 2025-03-22
Posts: 1

Re: amdgpu bug

I downgrade mesa from 1:25.0.1-2 to 1:24.3.4-1 resolved.

Offline

Board footer

Powered by FluxBB