You are not logged in.
I'm getting crashes resulting in a gpu reset randomly, mostly but not exclusively while watching hw decoded youtube in firefox.
[Sep10 03:54] amdgpu 0000:c4:00.0: amdgpu: ring vcn_unified_0 timeout, signaled seq=53987, emitted seq=53988
[ +0.000010] amdgpu 0000:c4:00.0: amdgpu: Process information: process RDD Process pid 13895 thread firefox:cs0 pid 15256
[ +0.000002] amdgpu 0000:c4:00.0: amdgpu: GPU reset begin!
[ +0.517341] amdgpu 0000:c4:00.0: amdgpu: Dumping IP State
[ +0.000634] amdgpu 0000:c4:00.0: amdgpu: Dumping IP State Completed
[ +0.000002] amdgpu 0000:c4:00.0: amdgpu: MODE2 reset
[ +0.022087] amdgpu 0000:c4:00.0: amdgpu: GPU reset succeeded, trying to resume
[ +0.000436] [drm] PCIE GART of 512M enabled (table at 0x00000081FFB00000).
[ +0.000225] [drm] VRAM is lost due to GPU reset!
[ +0.000007] amdgpu 0000:c4:00.0: amdgpu: SMU is resuming...
[ +0.002345] amdgpu 0000:c4:00.0: amdgpu: SMU is resumed successfully!
[ +0.003107] [drm] DMUB hardware initialized: version=0x0000E600Time between resets is 10 to 100mins, roughly.
Hardware:
AMD Ryzen AI 9 365 w/ Radeon 880M
[Wiki] ASUS Zenbook S 16 UM5606WA_UM5606WA
Software:
aur/linux-mainline-um5606 6.11rc7-1, which includes patches for this laptop
plasmashell & kwin_wayland and native firefox wayland
not using any related kernel parameters (shouldn't be needed with that kernel?)
Since some people report having gotten the UM5606 stable on Linux, I'd like to know what configuration you're running exactly
Last edited by fallingcats (2024-09-10 03:28:03)
Offline
Do you already know which change caused the issue? Or was it always like this?
Offline
Do you already know which change caused the issue? Or was it always like this?
The laptop is a couple days old and I've never had it not do that, not with any kernel / parameter / firmware combination.
Offline
x-ref, https://bbs.archlinux.org/viewtopic.php?id=299274
Do you also get a page fault from dml_core_mode_support ?
Offline
x-ref, https://bbs.archlinux.org/viewtopic.php?id=299274
Do you also get a page fault from dml_core_mode_support ?
No, I don't think so. I haven't seen that.
Here is two crashes after each other, nothing cut:
[Sep10 13:03] amdgpu 0000:c4:00.0: amdgpu: ring vcn_unified_0 timeout, signaled seq=20349, emitted seq=20350
[ +0.000017] amdgpu 0000:c4:00.0: amdgpu: Process information: process RDD Process pid 13776 thread firefox:cs0 pid 22237
[ +0.000004] amdgpu 0000:c4:00.0: amdgpu: GPU reset begin!
[ +0.516105] amdgpu 0000:c4:00.0: amdgpu: Dumping IP State
[ +0.000700] amdgpu 0000:c4:00.0: amdgpu: Dumping IP State Completed
[ +0.000003] amdgpu 0000:c4:00.0: amdgpu: MODE2 reset
[ +0.022373] amdgpu 0000:c4:00.0: amdgpu: GPU reset succeeded, trying to resume
[ +0.000458] [drm] PCIE GART of 512M enabled (table at 0x00000081FFB00000).
[ +0.000246] [drm] VRAM is lost due to GPU reset!
[ +0.000006] amdgpu 0000:c4:00.0: amdgpu: SMU is resuming...
[ +0.002681] amdgpu 0000:c4:00.0: amdgpu: SMU is resumed successfully!
[ +0.003115] [drm] DMUB hardware initialized: version=0x0000E600
[ +0.796011] amdgpu 0000:c4:00.0: amdgpu: ring gfx_0.0.0 uses VM inv eng 0 on hub 0
[ +0.000015] amdgpu 0000:c4:00.0: amdgpu: ring comp_1.0.0 uses VM inv eng 1 on hub 0
[ +0.000003] amdgpu 0000:c4:00.0: amdgpu: ring comp_1.1.0 uses VM inv eng 4 on hub 0
[ +0.000003] amdgpu 0000:c4:00.0: amdgpu: ring comp_1.2.0 uses VM inv eng 6 on hub 0
[ +0.000002] amdgpu 0000:c4:00.0: amdgpu: ring comp_1.3.0 uses VM inv eng 7 on hub 0
[ +0.000002] amdgpu 0000:c4:00.0: amdgpu: ring comp_1.0.1 uses VM inv eng 8 on hub 0
[ +0.000002] amdgpu 0000:c4:00.0: amdgpu: ring comp_1.1.1 uses VM inv eng 9 on hub 0
[ +0.000002] amdgpu 0000:c4:00.0: amdgpu: ring comp_1.2.1 uses VM inv eng 10 on hub 0
[ +0.000003] amdgpu 0000:c4:00.0: amdgpu: ring comp_1.3.1 uses VM inv eng 11 on hub 0
[ +0.000002] amdgpu 0000:c4:00.0: amdgpu: ring sdma0 uses VM inv eng 12 on hub 0
[ +0.000002] amdgpu 0000:c4:00.0: amdgpu: ring vcn_unified_0 uses VM inv eng 0 on hub 8
[ +0.000003] amdgpu 0000:c4:00.0: amdgpu: ring jpeg_dec_0 uses VM inv eng 1 on hub 8
[ +0.000002] amdgpu 0000:c4:00.0: amdgpu: ring mes_kiq_3.1.0 uses VM inv eng 13 on hub 0
[ +0.000002] amdgpu 0000:c4:00.0: amdgpu: ring vpe uses VM inv eng 4 on hub 8
[ +0.015962] amdgpu 0000:c4:00.0: amdgpu: recover vram bo from shadow start
[ +0.000006] amdgpu 0000:c4:00.0: amdgpu: recover vram bo from shadow done
[ +0.000022] amdgpu 0000:c4:00.0: amdgpu: GPU reset(1) succeeded!
[ +0.001449] [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
[ +0.093227] firefox:cs0[22237]: segfault at 0 ip 0000558f46457aa0 sp 00007f82421ff990 error 6 in firefox[a6aa0,558f463d0000+c2000] likely on CPU 11 (core 1, socket 0)
[ +0.000019] Code: 53 50 48 89 fb 4c 8b 35 5e be 03 00 49 8b 36 ff 15 15 bf 03 00 49 8b 36 bf 0a 00 00 00 ff 15 7f bf 03 00 48 89 1d f8 ef 03 00 <c7> 04 25 00 00 00 00 23 00 00 00 e8 00 00 00 00 f3 0f 1e fa 50 48
[ +3.130865] traps: signal-desktop[1621] trap int3 ip:564c836c3eaa sp:7fffe4748510 error:0 in signal-desktop[5e33eaa,564c7f878000+864a000]
[Sep10 13:05] wlan0: disconnect from AP e6:38:83:95:cc:ed for new auth to 7e:45:58:83:a4:1a
[ +0.352466] wlan0: authenticate with 7e:45:58:83:a4:1a (local address=28:d0:43:ad:8c:00)
[ +0.016551] wlan0: send auth to 7e:45:58:83:a4:1a (try 1/3)
[ +0.025490] wlan0: authenticated
[ +0.005223] wlan0: associate with 7e:45:58:83:a4:1a (try 1/3)
[ +0.059246] wlan0: RX ReassocResp from 7e:45:58:83:a4:1a (capab=0x1111 status=0 aid=1)
[ +0.036427] wlan0: associated
[ +0.115369] wlan0: Limiting TX power to 30 (30 - 0) dBm as advertised by 7e:45:58:83:a4:1a
[Sep10 13:19] ucsi_acpi USBC000:00: ucsi_handle_connector_change: GET_CONNECTOR_STATUS failed (-70)
[Sep10 13:20] ucsi_acpi USBC000:00: GET_CONNECTOR_STATUS failed (-70)
[Sep10 13:21] pcieport 0000:00:08.3: PME: Spurious native interrupt!
[Sep10 14:44] amdgpu 0000:c4:00.0: amdgpu: ring vcn_unified_0 timeout, signaled seq=156946, emitted seq=156947
[ +0.000017] amdgpu 0000:c4:00.0: amdgpu: Process information: process RDD Process pid 24312 thread firefox:cs0 pid 83773
[ +0.000004] amdgpu 0000:c4:00.0: amdgpu: GPU reset begin!
[ +0.522424] amdgpu 0000:c4:00.0: amdgpu: Dumping IP State
[ +0.000682] amdgpu 0000:c4:00.0: amdgpu: Dumping IP State Completed
[ +0.000002] amdgpu 0000:c4:00.0: amdgpu: MODE2 reset
[ +0.022114] amdgpu 0000:c4:00.0: amdgpu: GPU reset succeeded, trying to resume
[ +0.000463] [drm] PCIE GART of 512M enabled (table at 0x00000081FFB00000).
[ +0.000247] [drm] VRAM is lost due to GPU reset!
[ +0.000004] amdgpu 0000:c4:00.0: amdgpu: SMU is resuming...
[ +0.002130] amdgpu 0000:c4:00.0: amdgpu: SMU is resumed successfully!
[ +0.004052] [drm] DMUB hardware initialized: version=0x0000E600
[ +0.327926] amdgpu 0000:c4:00.0: amdgpu: ring gfx_0.0.0 uses VM inv eng 0 on hub 0
[ +0.000014] amdgpu 0000:c4:00.0: amdgpu: ring comp_1.0.0 uses VM inv eng 1 on hub 0
[ +0.000003] amdgpu 0000:c4:00.0: amdgpu: ring comp_1.1.0 uses VM inv eng 4 on hub 0
[ +0.000001] amdgpu 0000:c4:00.0: amdgpu: ring comp_1.2.0 uses VM inv eng 6 on hub 0
[ +0.000002] amdgpu 0000:c4:00.0: amdgpu: ring comp_1.3.0 uses VM inv eng 7 on hub 0
[ +0.000001] amdgpu 0000:c4:00.0: amdgpu: ring comp_1.0.1 uses VM inv eng 8 on hub 0
[ +0.000001] amdgpu 0000:c4:00.0: amdgpu: ring comp_1.1.1 uses VM inv eng 9 on hub 0
[ +0.000002] amdgpu 0000:c4:00.0: amdgpu: ring comp_1.2.1 uses VM inv eng 10 on hub 0
[ +0.000002] amdgpu 0000:c4:00.0: amdgpu: ring comp_1.3.1 uses VM inv eng 11 on hub 0
[ +0.000001] amdgpu 0000:c4:00.0: amdgpu: ring sdma0 uses VM inv eng 12 on hub 0
[ +0.000002] amdgpu 0000:c4:00.0: amdgpu: ring vcn_unified_0 uses VM inv eng 0 on hub 8
[ +0.000001] amdgpu 0000:c4:00.0: amdgpu: ring jpeg_dec_0 uses VM inv eng 1 on hub 8
[ +0.000002] amdgpu 0000:c4:00.0: amdgpu: ring mes_kiq_3.1.0 uses VM inv eng 13 on hub 0
[ +0.000002] amdgpu 0000:c4:00.0: amdgpu: ring vpe uses VM inv eng 4 on hub 8
[ +0.010810] amdgpu 0000:c4:00.0: amdgpu: recover vram bo from shadow start
[ +0.000003] amdgpu 0000:c4:00.0: amdgpu: recover vram bo from shadow done
[ +0.000023] amdgpu 0000:c4:00.0: amdgpu: GPU reset(2) succeeded!
[ +0.082783] firefox:cs0[83773]: segfault at 0 ip 000055644fe10aa0 sp 00007f534c9ff990 error 6 in firefox[a6aa0,55644fd89000+c2000] likely on CPU 15 (core 9, socket 0)
[ +0.000024] Code: 53 50 48 89 fb 4c 8b 35 5e be 03 00 49 8b 36 ff 15 15 bf 03 00 49 8b 36 bf 0a 00 00 00 ff 15 7f bf 03 00 48 89 1d f8 ef 03 00 <c7> 04 25 00 00 00 00 23 00 00 00 e8 00 00 00 00 f3 0f 1e fa 50 48Offline