You are not logged in.
Pages: 1
I am running the current plasma desktop (6.5.3). Unfortunately it tends to crash every now and then. It throws me back to the login screen and I can see the following in dmesg:
[165681.988401] amdgpu 0000:04:00.0: amdgpu: Dumping IP State
[165681.989505] amdgpu 0000:04:00.0: amdgpu: Dumping IP State Completed
[165681.989586] amdgpu 0000:04:00.0: amdgpu: [drm] AMDGPU device coredump file has been created
[165681.989590] amdgpu 0000:04:00.0: amdgpu: [drm] Check your /sys/class/drm/card1/device/devcoredump/data
[165681.989593] amdgpu 0000:04:00.0: amdgpu: ring sdma0 timeout, signaled seq=621194, emitted seq=621197
[165681.989599] amdgpu 0000:04:00.0: amdgpu: GPU reset begin!. Source: 1
[165682.368964] amdgpu 0000:04:00.0: amdgpu: MODE2 reset
[165682.379123] amdgpu 0000:04:00.0: amdgpu: GPU reset succeeded, trying to resume
[165682.379948] [drm] PCIE GART of 1024M enabled (table at 0x000000F43FC00000).
[165682.379986] amdgpu 0000:04:00.0: amdgpu: PSP is resuming...
[165682.402303] amdgpu 0000:04:00.0: amdgpu: reserve 0xa00000 from 0xf43e000000 for PSP TMR
[165683.269478] amdgpu 0000:04:00.0: amdgpu: SMU is resuming...
[165683.269885] amdgpu 0000:04:00.0: amdgpu: SMU is resumed successfully!
[165683.270154] amdgpu 0000:04:00.0: amdgpu: kiq ring mec 2 pipe 1 q 0
[165683.283473] amdgpu 0000:04:00.0: amdgpu: [drm] DMUB hardware initialized: version=0x0300000C
[165683.643971] [drm] Failed to add display topology, DTM TA is not initialized.
[165684.028487] [drm] Failed to add display topology, DTM TA is not initialized.
[165684.108230] amdgpu 0000:04:00.0: amdgpu: ring gfx_0.0.0 uses VM inv eng 0 on hub 0
[165684.108239] amdgpu 0000:04:00.0: amdgpu: ring gfx_0.1.0 uses VM inv eng 1 on hub 0
[165684.108243] amdgpu 0000:04:00.0: amdgpu: ring comp_1.0.0 uses VM inv eng 4 on hub 0
[165684.108247] amdgpu 0000:04:00.0: amdgpu: ring comp_1.1.0 uses VM inv eng 5 on hub 0
[165684.108251] amdgpu 0000:04:00.0: amdgpu: ring comp_1.2.0 uses VM inv eng 6 on hub 0
[165684.108255] amdgpu 0000:04:00.0: amdgpu: ring comp_1.3.0 uses VM inv eng 7 on hub 0
[165684.108258] amdgpu 0000:04:00.0: amdgpu: ring comp_1.0.1 uses VM inv eng 8 on hub 0
[165684.108262] amdgpu 0000:04:00.0: amdgpu: ring comp_1.1.1 uses VM inv eng 9 on hub 0
[165684.108265] amdgpu 0000:04:00.0: amdgpu: ring comp_1.2.1 uses VM inv eng 10 on hub 0
[165684.108269] amdgpu 0000:04:00.0: amdgpu: ring comp_1.3.1 uses VM inv eng 11 on hub 0
[165684.108272] amdgpu 0000:04:00.0: amdgpu: ring kiq_0.2.1.0 uses VM inv eng 12 on hub 0
[165684.108276] amdgpu 0000:04:00.0: amdgpu: ring sdma0 uses VM inv eng 13 on hub 0
[165684.108279] amdgpu 0000:04:00.0: amdgpu: ring vcn_dec_0 uses VM inv eng 0 on hub 8
[165684.108283] amdgpu 0000:04:00.0: amdgpu: ring vcn_enc_0.0 uses VM inv eng 1 on hub 8
[165684.108286] amdgpu 0000:04:00.0: amdgpu: ring vcn_enc_0.1 uses VM inv eng 4 on hub 8
[165684.108290] amdgpu 0000:04:00.0: amdgpu: ring jpeg_dec uses VM inv eng 5 on hub 8
[165684.112013] amdgpu 0000:04:00.0: amdgpu: GPU reset(1) succeeded!
[165684.112041] amdgpu 0000:04:00.0: [drm] device wedged, but recovered through reset
[165685.324046] QThread[1353440]: segfault at 58 ip 00007f02202d5953 sp 00007f02097f93b0 error 4 in libqtmedia_pulse.so[b953,7f02202d0000+b000] likely on CPU 1 (core 0, socket 0)
[165685.324070] Code: 93 04 00 00 0f 1f 00 48 8b 7b 60 ff 15 0e a5 00 00 48 85 c0 0f 84 ad 01 00 00 48 c7 85 78 ff ff ff 00 00 00 00 e8 dd ee ff ff <48> 8b 78 58 48 89 85 68 ff ff ff 48 85 ff 74 06 ff 15 b7 a3 00 00
[165686.143981] [drm] Failed to add display topology, DTM TA is not initialized.
[165686.516017] [drm] Failed to add display topology, DTM TA is not initialized.
This is on steam deck (lcd version).
Any thoughts how to fix this?
Offline
Please use [code][/code] tags. Edit your post in this regard.
[165685.324046] QThread[1353440]: segfault at 58 ip 00007f02202d5953 sp 00007f02097f93b0 error 4 in libqtmedia_pulse.so[b953,7f02202d0000+b000] likely on CPU 1 (core 0, socket 0)
https://wiki.archlinux.org/title/Core_d … _core_dump
Other than that the GPU reset, but it's not 100% clear whether and how that caused a session crash.
Please post your complete system journal for a boot covering such crash, eg.
sudo journalctl -b -1 | curl -F 'file=@-' 0x0.stfor the previous ("-1") one.
Offline
Journal submitted. I have seen this crash before and it happens always after GPU reset.
Offline
You have to give us the URL it gave you.
Offline
haha..yes:
http://0x0.st/Kte4.txt
Offline
Nov 30 14:36:44 ham kernel: amdgpu 0000:04:00.0: amdgpu: Dumping IP State
Nov 30 14:36:44 ham kernel: amdgpu 0000:04:00.0: amdgpu: Dumping IP State Completed
Nov 30 14:36:44 ham kernel: amdgpu 0000:04:00.0: amdgpu: [drm] AMDGPU device coredump file has been created
Nov 30 14:36:44 ham kernel: amdgpu 0000:04:00.0: amdgpu: [drm] Check your /sys/class/drm/card1/device/devcoredump/data
Nov 30 14:36:44 ham kernel: amdgpu 0000:04:00.0: amdgpu: ring sdma0 timeout, signaled seq=76547, emitted seq=76549
Nov 30 14:36:44 ham kernel: amdgpu 0000:04:00.0: amdgpu: GPU reset begin!
Nov 30 14:36:45 ham kernel: amdgpu 0000:04:00.0: amdgpu: MODE2 reset
Nov 30 14:36:45 ham kernel: amdgpu 0000:04:00.0: amdgpu: GPU reset succeeded, trying to resume
Nov 30 14:36:45 ham kernel: [drm] PCIE GART of 1024M enabled (table at 0x000000F43FC00000).
Nov 30 14:36:45 ham kernel: amdgpu 0000:04:00.0: amdgpu: PSP is resuming...
Nov 30 14:36:45 ham kernel: amdgpu 0000:04:00.0: amdgpu: reserve 0xa00000 from 0xf43e000000 for PSP TMR
Nov 30 14:36:46 ham kernel: amdgpu 0000:04:00.0: amdgpu: SMU is resuming...
Nov 30 14:36:46 ham kernel: amdgpu 0000:04:00.0: amdgpu: SMU is resumed successfully!
Nov 30 14:36:46 ham kernel: amdgpu 0000:04:00.0: amdgpu: kiq ring mec 2 pipe 1 q 0
Nov 30 14:36:46 ham kernel: amdgpu 0000:04:00.0: amdgpu: [drm] DMUB hardware initialized: version=0x0300000C
Nov 30 14:36:46 ham kernel: [drm] Failed to add display topology, DTM TA is not initialized.
Nov 30 14:36:47 ham kernel: [drm] Failed to add display topology, DTM TA is not initialized.
Nov 30 14:36:47 ham kernel: amdgpu 0000:04:00.0: amdgpu: ring gfx_0.0.0 uses VM inv eng 0 on hub 0
Nov 30 14:36:47 ham kernel: amdgpu 0000:04:00.0: amdgpu: ring gfx_0.1.0 uses VM inv eng 1 on hub 0
Nov 30 14:36:47 ham kernel: amdgpu 0000:04:00.0: amdgpu: ring comp_1.0.0 uses VM inv eng 4 on hub 0
Nov 30 14:36:47 ham kernel: amdgpu 0000:04:00.0: amdgpu: ring comp_1.1.0 uses VM inv eng 5 on hub 0
Nov 30 14:36:47 ham kernel: amdgpu 0000:04:00.0: amdgpu: ring comp_1.2.0 uses VM inv eng 6 on hub 0
Nov 30 14:36:47 ham kernel: amdgpu 0000:04:00.0: amdgpu: ring comp_1.3.0 uses VM inv eng 7 on hub 0
Nov 30 14:36:47 ham kernel: amdgpu 0000:04:00.0: amdgpu: ring comp_1.0.1 uses VM inv eng 8 on hub 0
Nov 30 14:36:47 ham kernel: amdgpu 0000:04:00.0: amdgpu: ring comp_1.1.1 uses VM inv eng 9 on hub 0
Nov 30 14:36:47 ham kernel: amdgpu 0000:04:00.0: amdgpu: ring comp_1.2.1 uses VM inv eng 10 on hub 0
Nov 30 14:36:47 ham kernel: amdgpu 0000:04:00.0: amdgpu: ring comp_1.3.1 uses VM inv eng 11 on hub 0
Nov 30 14:36:47 ham kernel: amdgpu 0000:04:00.0: amdgpu: ring kiq_0.2.1.0 uses VM inv eng 12 on hub 0
Nov 30 14:36:47 ham kernel: amdgpu 0000:04:00.0: amdgpu: ring sdma0 uses VM inv eng 13 on hub 0
Nov 30 14:36:47 ham kernel: amdgpu 0000:04:00.0: amdgpu: ring vcn_dec_0 uses VM inv eng 0 on hub 8
Nov 30 14:36:47 ham kernel: amdgpu 0000:04:00.0: amdgpu: ring vcn_enc_0.0 uses VM inv eng 1 on hub 8
Nov 30 14:36:47 ham kernel: amdgpu 0000:04:00.0: amdgpu: ring vcn_enc_0.1 uses VM inv eng 4 on hub 8
Nov 30 14:36:47 ham kernel: amdgpu 0000:04:00.0: amdgpu: ring jpeg_dec uses VM inv eng 5 on hub 8
Nov 30 14:36:47 ham kernel: amdgpu 0000:04:00.0: amdgpu: GPU reset(1) succeeded!
Nov 30 14:36:47 ham kernel: amdgpu 0000:04:00.0: [drm] device wedged, but recovered through reset
Nov 30 14:36:47 ham systemd-coredump[516591]: Process 1022 (Xorg) of user 0 terminated abnormally with signal 6/ABRT, processing...
Nov 30 14:36:47 ham systemd[1]: Created slice Slice /system/drkonqi-coredump-processor.
Nov 30 14:36:47 ham systemd[1]: Created slice Slice /system/systemd-coredump.
Nov 30 14:36:47 ham systemd[1]: Started Process Core Dump (PID 516591/UID 0).
Nov 30 14:36:47 ham systemd[1]: Started Pass systemd-coredump journal entries to relevant user for potential DrKonqi handling.
Nov 30 14:36:47 ham systemd-coredump[516592]: Process 1022 (Xorg) of user 0 dumped core.
Stack trace of thread 1029:
#0 0x00007f91358255df abort (libc.so.6 + 0x255df)
#1 0x00007f9132eefa43 n/a (libgallium-25.3.0-arch1.2.so + 0xaefa43)
#2 0x00007f9132ef3d54 n/a (libgallium-25.3.0-arch1.2.so + 0xaf3d54)
#3 0x00007f9132997d55 n/a (libgallium-25.3.0-arch1.2.so + 0x597d55)
#4 0x00007f91329e27dd n/a (libgallium-25.3.0-arch1.2.so + 0x5e27dd)
#5 0x00007f913589698b n/a (libc.so.6 + 0x9698b)
#6 0x00007f913591a9cc n/a (libc.so.6 + 0x11a9cc)
Stack trace of thread 1037:
#0 0x00007f913589f002 n/a (libc.so.6 + 0x9f002)
#1 0x00007f913589316c n/a (libc.so.6 + 0x9316c)
#2 0x00007f91358937dc n/a (libc.so.6 + 0x937dc)
#3 0x00007f9135895e9e pthread_cond_wait (libc.so.6 + 0x95e9e)
#4 0x00007f91329e28ae n/a (libgallium-25.3.0-arch1.2.so + 0x5e28ae)
#5 0x00007f9132997c7d n/a (libgallium-25.3.0-arch1.2.so + 0x597c7d)
#6 0x00007f91329e27dd n/a (libgallium-25.3.0-arch1.2.so + 0x5e27dd)
#7 0x00007f913589698b n/a (libc.so.6 + 0x9698b)
#8 0x00007f913591a9cc n/a (libc.so.6 + 0x11a9cc)
Stack trace of thread 1038:
#0 0x00007f913589f002 n/a (libc.so.6 + 0x9f002)
#1 0x00007f913589316c n/a (libc.so.6 + 0x9316c)
#2 0x00007f91358937dc n/a (libc.so.6 + 0x937dc)
#3 0x00007f9135895e9e pthread_cond_wait (libc.so.6 + 0x95e9e)
#4 0x00007f91329e28ae n/a (libgallium-25.3.0-arch1.2.so + 0x5e28ae)
#5 0x00007f9132997c7d n/a (libgallium-25.3.0-arch1.2.so + 0x597c7d)
#6 0x00007f91329e27dd n/a (libgallium-25.3.0-arch1.2.so + 0x5e27dd)
#7 0x00007f913589698b n/a (libc.so.6 + 0x9698b)
#8 0x00007f913591a9cc n/a (libc.so.6 + 0x11a9cc)
Stack trace of thread 1022:
#0 0x00007f913591872d syscall (libc.so.6 + 0x11872d)
#1 0x00007f913298a35b n/a (libgallium-25.3.0-arch1.2.so + 0x58a35b)
#2 0x00007f9132997923 n/a (libgallium-25.3.0-arch1.2.so + 0x597923)
#3 0x00007f9132ed4fce n/a (libgallium-25.3.0-arch1.2.so + 0xad4fce)
#4 0x00007f9132beccd8 n/a (libgallium-25.3.0-arch1.2.so + 0x7eccd8)
#5 0x00007f91325467ad n/a (libgallium-25.3.0-arch1.2.so + 0x1467ad)
#6 0x00007f91325468d0 n/a (libgallium-25.3.0-arch1.2.so + 0x1468d0)
#7 0x00007f91288bd2dd n/a (libglamoregl.so + 0xd2dd)
#8 0x00007f91354fe9f4 n/a (amdgpu_drv.so + 0x119f4)
#9 0x0000555def5c8814 BlockHandler (/usr/lib/Xorg + 0x77814)
#10 0x0000555def6add46 WaitForSomething (/usr/lib/Xorg + 0x15cd46)
#11 0x0000555def584b99 n/a (/usr/lib/Xorg + 0x33b99)
#12 0x00007f9135827635 n/a (libc.so.6 + 0x27635)
#13 0x00007f91358276e9 __libc_start_main (libc.so.6 + 0x276e9)
#14 0x0000555def585d05 _start (/usr/lib/Xorg + 0x34d05)
Stack trace of thread 1954:
#0 0x00007f913589f002 n/a (libc.so.6 + 0x9f002)
#1 0x00007f913589316c n/a (libc.so.6 + 0x9316c)
#2 0x00007f91358937dc n/a (libc.so.6 + 0x937dc)
#3 0x00007f9135895e9e pthread_cond_wait (libc.so.6 + 0x95e9e)
#4 0x00007f91329e28ae n/a (libgallium-25.3.0-arch1.2.so + 0x5e28ae)
#5 0x00007f9132997c7d n/a (libgallium-25.3.0-arch1.2.so + 0x597c7d)
#6 0x00007f91329e27dd n/a (libgallium-25.3.0-arch1.2.so + 0x5e27dd)
#7 0x00007f913589698b n/a (libc.so.6 + 0x9698b)
#8 0x00007f913591a9cc n/a (libc.so.6 + 0x11a9cc)
Stack trace of thread 1031:
#0 0x00007f913589f002 n/a (libc.so.6 + 0x9f002)
#1 0x00007f913589316c n/a (libc.so.6 + 0x9316c)
#2 0x00007f91358937dc n/a (libc.so.6 + 0x937dc)
#3 0x00007f9135895e9e pthread_cond_wait (libc.so.6 + 0x95e9e)
#4 0x00007f91329e28ae n/a (libgallium-25.3.0-arch1.2.so + 0x5e28ae)
#5 0x00007f9132997c7d n/a (libgallium-25.3.0-arch1.2.so + 0x597c7d)
#6 0x00007f91329e27dd n/a (libgallium-25.3.0-arch1.2.so + 0x5e27dd)
#7 0x00007f913589698b n/a (libc.so.6 + 0x9698b)
#8 0x00007f913591a9cc n/a (libc.so.6 + 0x11a9cc)
Stack trace of thread 1032:
#0 0x00007f913589f002 n/a (libc.so.6 + 0x9f002)
#1 0x00007f913589316c n/a (libc.so.6 + 0x9316c)
#2 0x00007f91358937dc n/a (libc.so.6 + 0x937dc)
#3 0x00007f9135895e9e pthread_cond_wait (libc.so.6 + 0x95e9e)
#4 0x00007f91329e28ae n/a (libgallium-25.3.0-arch1.2.so + 0x5e28ae)
#5 0x00007f9132997c7d n/a (libgallium-25.3.0-arch1.2.so + 0x597c7d)
#6 0x00007f91329e27dd n/a (libgallium-25.3.0-arch1.2.so + 0x5e27dd)
#7 0x00007f913589698b n/a (libc.so.6 + 0x9698b)
#8 0x00007f913591a9cc n/a (libc.so.6 + 0x11a9cc)
Stack trace of thread 1039:
#0 0x00007f913589f002 n/a (libc.so.6 + 0x9f002)
#1 0x00007f913589316c n/a (libc.so.6 + 0x9316c)
#2 0x00007f91358937dc n/a (libc.so.6 + 0x937dc)
#3 0x00007f9135895e9e pthread_cond_wait (libc.so.6 + 0x95e9e)
#4 0x00007f91329e28ae n/a (libgallium-25.3.0-arch1.2.so + 0x5e28ae)
#5 0x00007f9132997c7d n/a (libgallium-25.3.0-arch1.2.so + 0x597c7d)
#6 0x00007f91329e27dd n/a (libgallium-25.3.0-arch1.2.so + 0x5e27dd)
#7 0x00007f913589698b n/a (libc.so.6 + 0x9698b)
#8 0x00007f913591a9cc n/a (libc.so.6 + 0x11a9cc)
Stack trace of thread 1030:
#0 0x00007f913589f002 n/a (libc.so.6 + 0x9f002)
#1 0x00007f913589316c n/a (libc.so.6 + 0x9316c)
#2 0x00007f91358937dc n/a (libc.so.6 + 0x937dc)
#3 0x00007f9135895e9e pthread_cond_wait (libc.so.6 + 0x95e9e)
#4 0x00007f91329e28ae n/a (libgallium-25.3.0-arch1.2.so + 0x5e28ae)
#5 0x00007f9132997c7d n/a (libgallium-25.3.0-arch1.2.so + 0x597c7d)
#6 0x00007f91329e27dd n/a (libgallium-25.3.0-arch1.2.so + 0x5e27dd)
#7 0x00007f913589698b n/a (libc.so.6 + 0x9698b)
#8 0x00007f913591a9cc n/a (libc.so.6 + 0x11a9cc)
Stack trace of thread 1033:
#0 0x00007f913589f002 n/a (libc.so.6 + 0x9f002)
#1 0x00007f913589316c n/a (libc.so.6 + 0x9316c)
#2 0x00007f91358937dc n/a (libc.so.6 + 0x937dc)
#3 0x00007f9135895e9e pthread_cond_wait (libc.so.6 + 0x95e9e)
#4 0x00007f91329e28ae n/a (libgallium-25.3.0-arch1.2.so + 0x5e28ae)
#5 0x00007f9132997c7d n/a (libgallium-25.3.0-arch1.2.so + 0x597c7d)
#6 0x00007f91329e27dd n/a (libgallium-25.3.0-arch1.2.so + 0x5e27dd)
#7 0x00007f913589698b n/a (libc.so.6 + 0x9698b)
#8 0x00007f913591a9cc n/a (libc.so.6 + 0x11a9cc)
Stack trace of thread 1051:
#0 0x00007f913589f002 n/a (libc.so.6 + 0x9f002)
#1 0x00007f913589316c n/a (libc.so.6 + 0x9316c)
#2 0x00007f91358931b4 n/a (libc.so.6 + 0x931b4)
#3 0x00007f913591acb5 epoll_wait (libc.so.6 + 0x11acb5)
#4 0x0000555def6b0dae n/a (/usr/lib/Xorg + 0x15fdae)
#5 0x0000555def6b2599 n/a (/usr/lib/Xorg + 0x161599)
#6 0x00007f913589698b n/a (libc.so.6 + 0x9698b)
#7 0x00007f913591a9cc n/a (libc.so.6 + 0x11a9cc)
Stack trace of thread 1034:
#0 0x00007f913589f002 n/a (libc.so.6 + 0x9f002)
#1 0x00007f913589316c n/a (libc.so.6 + 0x9316c)
#2 0x00007f91358937dc n/a (libc.so.6 + 0x937dc)
#3 0x00007f9135895e9e pthread_cond_wait (libc.so.6 + 0x95e9e)
#4 0x00007f91329e28ae n/a (libgallium-25.3.0-arch1.2.so + 0x5e28ae)
#5 0x00007f9132997c7d n/a (libgallium-25.3.0-arch1.2.so + 0x597c7d)
#6 0x00007f91329e27dd n/a (libgallium-25.3.0-arch1.2.so + 0x5e27dd)
#7 0x00007f913589698b n/a (libc.so.6 + 0x9698b)
#8 0x00007f913591a9cc n/a (libc.so.6 + 0x11a9cc)
Stack trace of thread 1088:
#0 0x00007f913589f002 n/a (libc.so.6 + 0x9f002)
#1 0x00007f913589316c n/a (libc.so.6 + 0x9316c)
#2 0x00007f91358937dc n/a (libc.so.6 + 0x937dc)
#3 0x00007f9135895e9e pthread_cond_wait (libc.so.6 + 0x95e9e)
#4 0x00007f91329e28ae n/a (libgallium-25.3.0-arch1.2.so + 0x5e28ae)
#5 0x00007f9132997c7d n/a (libgallium-25.3.0-arch1.2.so + 0x597c7d)
#6 0x00007f91329e27dd n/a (libgallium-25.3.0-arch1.2.so + 0x5e27dd)
#7 0x00007f913589698b n/a (libc.so.6 + 0x9698b)
#8 0x00007f913591a9cc n/a (libc.so.6 + 0x11a9cc)
Stack trace of thread 1291:
#0 0x00007f913589f002 n/a (libc.so.6 + 0x9f002)
#1 0x00007f913589316c n/a (libc.so.6 + 0x9316c)
#2 0x00007f91358937dc n/a (libc.so.6 + 0x937dc)
#3 0x00007f9135895e9e pthread_cond_wait (libc.so.6 + 0x95e9e)
#4 0x00007f91329e28ae n/a (libgallium-25.3.0-arch1.2.so + 0x5e28ae)
#5 0x00007f9132997c7d n/a (libgallium-25.3.0-arch1.2.so + 0x597c7d)
#6 0x00007f91329e27dd n/a (libgallium-25.3.0-arch1.2.so + 0x5e27dd)
#7 0x00007f913589698b n/a (libc.so.6 + 0x9698b)
#8 0x00007f913591a9cc n/a (libc.so.6 + 0x11a9cc)
Stack trace of thread 1035:
#0 0x00007f913589f002 n/a (libc.so.6 + 0x9f002)
#1 0x00007f913589316c n/a (libc.so.6 + 0x9316c)
#2 0x00007f91358937dc n/a (libc.so.6 + 0x937dc)
#3 0x00007f9135895e9e pthread_cond_wait (libc.so.6 + 0x95e9e)
#4 0x00007f91329e28ae n/a (libgallium-25.3.0-arch1.2.so + 0x5e28ae)
#5 0x00007f9132997c7d n/a (libgallium-25.3.0-arch1.2.so + 0x597c7d)
#6 0x00007f91329e27dd n/a (libgallium-25.3.0-arch1.2.so + 0x5e27dd)
#7 0x00007f913589698b n/a (libc.so.6 + 0x9698b)
#8 0x00007f913591a9cc n/a (libc.so.6 + 0x11a9cc)
Stack trace of thread 1293:
#0 0x00007f913589f002 n/a (libc.so.6 + 0x9f002)
#1 0x00007f913589316c n/a (libc.so.6 + 0x9316c)
#2 0x00007f91358937dc n/a (libc.so.6 + 0x937dc)
#3 0x00007f9135895e9e pthread_cond_wait (libc.so.6 + 0x95e9e)
#4 0x00007f91329e28ae n/a (libgallium-25.3.0-arch1.2.so + 0x5e28ae)
#5 0x00007f9132997c7d n/a (libgallium-25.3.0-arch1.2.so + 0x597c7d)
#6 0x00007f91329e27dd n/a (libgallium-25.3.0-arch1.2.so + 0x5e27dd)
#7 0x00007f913589698b n/a (libc.so.6 + 0x9698b)
#8 0x00007f913591a9cc n/a (libc.so.6 + 0x11a9cc)
Stack trace of thread 1036:
#0 0x00007f913589f002 n/a (libc.so.6 + 0x9f002)
#1 0x00007f913589316c n/a (libc.so.6 + 0x9316c)
#2 0x00007f91358937dc n/a (libc.so.6 + 0x937dc)
#3 0x00007f9135895e9e pthread_cond_wait (libc.so.6 + 0x95e9e)
#4 0x00007f91329e28ae n/a (libgallium-25.3.0-arch1.2.so + 0x5e28ae)
#5 0x00007f9132997c7d n/a (libgallium-25.3.0-arch1.2.so + 0x597c7d)
#6 0x00007f91329e27dd n/a (libgallium-25.3.0-arch1.2.so + 0x5e27dd)
#7 0x00007f913589698b n/a (libc.so.6 + 0x9698b)
#8 0x00007f913591a9cc n/a (libc.so.6 + 0x11a9cc)
ELF object binary architecture: AMD x86-64Possibly related to https://bbs.archlinux.org/viewtopic.php?id=310552 but there's not second GPU.
Do you get the same w/ the LTS kernel?
Offline
OK, will test.
Offline
Yes, it happens with LTS kernel too. Here is the relevant dmesg:
[ 7310.611016] amdgpu 0000:04:00.0: amdgpu: Dumping IP State
[ 7310.612239] amdgpu 0000:04:00.0: amdgpu: Dumping IP State Completed
[ 7310.612316] amdgpu 0000:04:00.0: amdgpu: ring sdma0 timeout, signaled seq=15291, emitted seq=15293
[ 7310.612322] amdgpu 0000:04:00.0: amdgpu: GPU reset begin!
[ 7311.007660] amdgpu 0000:04:00.0: amdgpu: MODE2 reset
[ 7311.017860] amdgpu 0000:04:00.0: amdgpu: GPU reset succeeded, trying to resume
[ 7311.018484] [drm] PCIE GART of 1024M enabled (table at 0x000000F43FC00000).
[ 7311.018526] amdgpu 0000:04:00.0: amdgpu: PSP is resuming...
[ 7311.040848] amdgpu 0000:04:00.0: amdgpu: reserve 0xa00000 from 0xf43e000000 for PSP TMR
[ 7311.904230] amdgpu 0000:04:00.0: amdgpu: SMU is resuming...
[ 7311.904983] amdgpu 0000:04:00.0: amdgpu: SMU is resumed successfully!
[ 7311.905391] [drm] kiq ring mec 2 pipe 1 q 0
[ 7311.917903] [drm] DMUB hardware initialized: version=0x0300000C
[ 7312.325452] [drm] Failed to add display topology, DTM TA is not initialized.
[ 7312.667420] [drm] Failed to add display topology, DTM TA is not initialized.
[ 7312.742325] amdgpu 0000:04:00.0: amdgpu: ring gfx_0.0.0 uses VM inv eng 0 on hub 0
[ 7312.742333] amdgpu 0000:04:00.0: amdgpu: ring gfx_0.1.0 uses VM inv eng 1 on hub 0
[ 7312.742336] amdgpu 0000:04:00.0: amdgpu: ring comp_1.0.0 uses VM inv eng 4 on hub 0
[ 7312.742339] amdgpu 0000:04:00.0: amdgpu: ring comp_1.1.0 uses VM inv eng 5 on hub 0
[ 7312.742341] amdgpu 0000:04:00.0: amdgpu: ring comp_1.2.0 uses VM inv eng 6 on hub 0
[ 7312.742344] amdgpu 0000:04:00.0: amdgpu: ring comp_1.3.0 uses VM inv eng 7 on hub 0
[ 7312.742346] amdgpu 0000:04:00.0: amdgpu: ring comp_1.0.1 uses VM inv eng 8 on hub 0
[ 7312.742348] amdgpu 0000:04:00.0: amdgpu: ring comp_1.1.1 uses VM inv eng 9 on hub 0
[ 7312.742351] amdgpu 0000:04:00.0: amdgpu: ring comp_1.2.1 uses VM inv eng 10 on hub 0
[ 7312.742353] amdgpu 0000:04:00.0: amdgpu: ring comp_1.3.1 uses VM inv eng 11 on hub 0
[ 7312.742356] amdgpu 0000:04:00.0: amdgpu: ring kiq_0.2.1.0 uses VM inv eng 12 on hub 0
[ 7312.742358] amdgpu 0000:04:00.0: amdgpu: ring sdma0 uses VM inv eng 13 on hub 0
[ 7312.742361] amdgpu 0000:04:00.0: amdgpu: ring vcn_dec_0 uses VM inv eng 0 on hub 8
[ 7312.742363] amdgpu 0000:04:00.0: amdgpu: ring vcn_enc_0.0 uses VM inv eng 1 on hub 8
[ 7312.742366] amdgpu 0000:04:00.0: amdgpu: ring vcn_enc_0.1 uses VM inv eng 4 on hub 8
[ 7312.742368] amdgpu 0000:04:00.0: amdgpu: ring jpeg_dec uses VM inv eng 5 on hub 8
[ 7312.746902] amdgpu 0000:04:00.0: amdgpu: GPU reset(1) succeeded!
[ 7314.839782] [drm] Failed to add display topology, DTM TA is not initialized.
[ 7315.254103] [drm] Failed to add display topology, DTM TA is not initialized.
[ 7315.629606] [drm] Failed to add display topology, DTM TA is not initialized.
[ 7917.645365] [drm] Failed to add display topology, DTM TA is not initialized.
[ 8305.191822] [drm] Failed to add display topology, DTM TA is not initialized.
[ 8305.607151] [drm] Failed to add display topology, DTM TA is not initialized.
[ 8305.941749] [drm] Failed to add display topology, DTM TA is not initialized.
[ 8309.432875] cs35l41 spi-VLV1776:00: DSP1: cirrus/cs35l41-dsp1-spk-prot.wmfw: format 3 timestamp 0x62b5c26c
[ 8309.432885] cs35l41 spi-VLV1776:00: DSP1: cirrus/cs35l41-dsp1-spk-prot.wmfw: Fri 24 Jun 2022 14:55:56 GMT Daylight Time
[ 8309.541740] cs35l41 spi-VLV1776:00: DSP1: Firmware: 400a4 vendor: 0x2 v0.58.0, 2 algorithms
[ 8309.542002] cs35l41 spi-VLV1776:00: DSP1: cirrus/cs35l41-dsp1-spk-prot.bin: v0.58.0
[ 8309.542006] cs35l41 spi-VLV1776:00: DSP1: Protection: e:\workspace\workspace\tibranch_release_playback_6.76_2\ormis\staging\default_tunings\internal\CS35L
[ 8309.546217] cs35l41 spi-VLV1776:01: DSP1: cirrus/cs35l41-dsp1-spk-prot.wmfw: format 3 timestamp 0x62b5c26c
[ 8309.546226] cs35l41 spi-VLV1776:01: DSP1: cirrus/cs35l41-dsp1-spk-prot.wmfw: Fri 24 Jun 2022 14:55:56 GMT Daylight Time
[ 8309.653557] cs35l41 spi-VLV1776:01: DSP1: Firmware: 400a4 vendor: 0x2 v0.58.0, 2 algorithms
[ 8309.653827] cs35l41 spi-VLV1776:01: DSP1: cirrus/cs35l41-dsp1-spk-prot.bin: v0.58.0
[ 8309.653832] cs35l41 spi-VLV1776:01: DSP1: Protection: e:\workspace\workspace\tibranch_release_playback_6.76_2\ormis\staging\default_tunings\internal\CS35LAgain happens right after GPU reset.
Last edited by aa6kj (2025-12-07 21:27:32)
Offline
Please use [code][/code] tags. Edit your post in this regard.
https://gitlab.freedesktop.org/drm/amd/ … ge_size=20
https://bbs.archlinux.org/viewtopic.php?id=310552 is most likely https://gitlab.freedesktop.org/drm/amd/-/issues/4761
Offline
Forgot to say, it does not crash or freeze the system, just Xorg flips and it is back to login screen. I think you wanted me to add code and /code (in brackets) around the dmesg lines? I did that now.
Last edited by aa6kj (2025-12-07 21:35:18)
Offline
Nothing in the other thread really looks like a "freeze" on the kernel level at all, but it's also a different context and the other user doesn't have another GPU, he's actually running everything on the troublesome one.
Offline
The first link you posted does mention system lockup. Just wanted to highlight that it only crashes xorg.
Offline
Sadly this happens pretty regularly. About once a day. The latest one looks slightly different (see below). It says something about core dump under /sys. Would that be useful and if yes, how & where should I send that? This is with the latest kernel from testing.
12498.901130] amdgpu 0000:04:00.0: amdgpu: Dumping IP State
[12498.902669] amdgpu 0000:04:00.0: amdgpu: Dumping IP State Completed
[12498.902774] amdgpu 0000:04:00.0: amdgpu: [drm] AMDGPU device coredump file has been created
[12498.902779] amdgpu 0000:04:00.0: amdgpu: [drm] Check your /sys/class/drm/card1/device/devcoredump/data
[12498.902784] amdgpu 0000:04:00.0: amdgpu: ring sdma0 timeout, signaled seq=71363, emitted seq=71365
[12498.902792] amdgpu 0000:04:00.0: amdgpu: GPU reset begin!. Source: 1
[12499.284236] amdgpu 0000:04:00.0: amdgpu: MODE2 reset
[12499.294431] amdgpu 0000:04:00.0: amdgpu: GPU reset succeeded, trying to resume
[12499.295095] [drm] PCIE GART of 1024M enabled (table at 0x000000F43FC00000).
[12499.295136] amdgpu 0000:04:00.0: amdgpu: PSP is resuming...
[12499.317467] amdgpu 0000:04:00.0: amdgpu: reserve 0xa00000 from 0xf43e000000 for PSP TMR
[12500.175063] amdgpu 0000:04:00.0: amdgpu: SMU is resuming...
[12500.176071] amdgpu 0000:04:00.0: amdgpu: SMU is resumed successfully!
[12500.176349] amdgpu 0000:04:00.0: amdgpu: kiq ring mec 2 pipe 1 q 0
[12500.189984] amdgpu 0000:04:00.0: amdgpu: [drm] DMUB hardware initialized: version=0x0300000C
[12500.548719] [drm] Failed to add display topology, DTM TA is not initialized.
[12500.932741] [drm] Failed to add display topology, DTM TA is not initialized.
[12501.014530] amdgpu 0000:04:00.0: amdgpu: ring gfx_0.0.0 uses VM inv eng 0 on hub 0
[12501.014539] amdgpu 0000:04:00.0: amdgpu: ring gfx_0.1.0 uses VM inv eng 1 on hub 0
[12501.014543] amdgpu 0000:04:00.0: amdgpu: ring comp_1.0.0 uses VM inv eng 4 on hub 0
[12501.014547] amdgpu 0000:04:00.0: amdgpu: ring comp_1.1.0 uses VM inv eng 5 on hub 0
[12501.014551] amdgpu 0000:04:00.0: amdgpu: ring comp_1.2.0 uses VM inv eng 6 on hub 0
[12501.014554] amdgpu 0000:04:00.0: amdgpu: ring comp_1.3.0 uses VM inv eng 7 on hub 0
[12501.014557] amdgpu 0000:04:00.0: amdgpu: ring comp_1.0.1 uses VM inv eng 8 on hub 0
[12501.014561] amdgpu 0000:04:00.0: amdgpu: ring comp_1.1.1 uses VM inv eng 9 on hub 0
[12501.014564] amdgpu 0000:04:00.0: amdgpu: ring comp_1.2.1 uses VM inv eng 10 on hub 0
[12501.014568] amdgpu 0000:04:00.0: amdgpu: ring comp_1.3.1 uses VM inv eng 11 on hub 0
[12501.014572] amdgpu 0000:04:00.0: amdgpu: ring kiq_0.2.1.0 uses VM inv eng 12 on hub 0
[12501.014575] amdgpu 0000:04:00.0: amdgpu: ring sdma0 uses VM inv eng 13 on hub 0
[12501.014579] amdgpu 0000:04:00.0: amdgpu: ring vcn_dec_0 uses VM inv eng 0 on hub 8
[12501.014582] amdgpu 0000:04:00.0: amdgpu: ring vcn_enc_0.0 uses VM inv eng 1 on hub 8
[12501.014586] amdgpu 0000:04:00.0: amdgpu: ring vcn_enc_0.1 uses VM inv eng 4 on hub 8
[12501.014589] amdgpu 0000:04:00.0: amdgpu: ring jpeg_dec uses VM inv eng 5 on hub 8
[12501.019186] amdgpu 0000:04:00.0: amdgpu: GPU reset(1) succeeded!
[12501.019217] amdgpu 0000:04:00.0: [drm] device wedged, but recovered through reset
[12502.952097] [drm] Failed to add display topology, DTM TA is not initialized.
[12503.324757] [drm] Failed to add display topology, DTM TA is not initialized.Last edited by aa6kj (Yesterday 23:58:23)
Offline
Pages: 1