You are not logged in.

#1 2025-05-26 19:22:37

Wild Penguin
Member
Registered: 2015-03-19
Posts: 365

amdgpu related freeze after suspend since kernel 6.14.7 [SOLVED]

Hi,

This is on the Zen branch.

After upgrading from 6.14.6 to 6.14.7, all graphics will (seemingly reliably) not come up after resuming from suspend.

As in, display seems to come out of power save. but nothing is displayed.

In the log I get these kind of errors which were not there before:

May 26 21:36:57 ArkkiVille kernel: amdgpu 0000:2d:00.0: amdgpu: [drm] amdgpu: DP AUX transfer fail:4 

and:

touko 26 21:51:01 ArkkiVille kernel: amdgpu 0000:2d:00.0: [drm] *ERROR* [CRTC:91:crtc-0] flip_done timed out 
touko 26 21:51:11 ArkkiVille kernel: amdgpu 0000:2d:00.0: [drm] *ERROR* flip_done timed out 
touko 26 21:51:11 ArkkiVille kernel: amdgpu 0000:2d:00.0: [drm] *ERROR* [CRTC:91:crtc-0] commit wait timed out 
touko 26 21:51:22 ArkkiVille kernel: amdgpu 0000:2d:00.0: [drm] *ERROR* flip_done timed out 
touko 26 21:51:22 ArkkiVille kernel: amdgpu 0000:2d:00.0: [drm] *ERROR* [CONNECTOR:113:DP-1] commit wait timed out 

and, finally:

May 26 21:51:32 ArkkiVille kernel: ------------[] cut here ]------------ 
May 26 21:51:32 ArkkiVille kernel: WARNING: CPU: 22 PID: 145780 at drivers/gpu/drm/amd/amdgpu/../display/amdgpu_dm/amdgpu_dm.c:9412 amdgpu_dm_commit_planes+0x1dc7/0x2030 [amdgpu] 
May 26 21:51:32 ArkkiVille kernel: Modules linked in: ddbridge hid_microsoft ff_memless rpcrdma rdma_cm iw_cm ib_cm ip6table_filter ib_core ip6_tables iptable_filter rfcomm snd_seq_dummy snd_hrtimer snd_seq snd_seq_device nct6775_core hwmon_vid uhid cmac algif_hash algif_skcipher af_alg bnep nfsd auth_rpcgss nfs_acl lockd grace nfs_localio sunrpc vboxnetflt(OE) vboxnetadp(OE) vboxdrv(OE) overlay uinput vfat fat tda18271c2dd amd_atl intel_rapl_msr intel_rapl_common pktcdvd snd_hda_codec_realtek snd_hda_codec_generic btusb snd_hda_scodec_component snd_hda_codec_hdmi btrtl btintel kvm_amd snd_hda_intel btbcm btmtk snd_intel_dspcfg kvm snd_intel_sdw_acpi drxk irqbypass bluetooth snd_hda_codec polyval_clmulni mousedev polyval_generic joydev snd_hda_core corsair_cpro ghash_clmulni_intel sha512_ssse3 snd_hwdep rfkill sha256_ssse3 snd_pcm dvb_core r8169 ee1004 ucsi_ccg sha1_ssse3 videobuf2_vmalloc typec_ucsi aesni_intel snd_timer sp5100_tco realtek videobuf2_memops crypto_simd typec snd cryptd mdio_devres videobuf2_common i2c_piix4 rapl wmi_bmof 
May 26 21:51:32 ArkkiVille kernel:  soundcore roles gpio_amdpt ccp i2c_smbus libphy zenpower(OE) mc gpio_generic mac_hid nct6687(OE) i2c_dev sg crypto_user loop dm_mod nfnetlink ip_tables x_tables nvme sr_mod nvme_core cdrom nvme_auth hid_logitech_dj hid_logitech_hidpp hid_generic usbhid amdgpu amdxcp i2c_algo_bit drm_ttm_helper ttm drm_exec gpu_sched drm_suballoc_helper video wmi drm_panel_backlight_quirks drm_buddy drm_display_helper cec [last unloaded: ddbridge] 
May 26 21:51:32 ArkkiVille kernel: CPU: 22 UID: 0 PID: 145780 Comm: kworker/22:1 Tainted: G S         OE      6.14.7-zen2-1-zen #1 3b79f8c13197897386d587fb5dc955013017e979 
May 26 21:51:32 ArkkiVille kernel: Tainted: [S]=CPU_OUT_OF_SPEC, [O]=OOT_MODULE, [E]=UNSIGNED_MODULE 
May 26 21:51:32 ArkkiVille kernel: Hardware name: Micro-Star International Co., Ltd. MS-7C91/MAG B550 TOMAHAWK (MS-7C91), BIOS A.G0 03/12/2024 
May 26 21:51:32 ArkkiVille kernel: Workqueue: events fbcon_register_existing_fbs 
May 26 21:51:32 ArkkiVille kernel: RIP: 0010:amdgpu_dm_commit_planes+0x1dc7/0x2030 [amdgpu] 
May 26 21:51:32 ArkkiVille kernel: Code: b8 00 00 00 89 b4 01 64 06 00 00 48 8b 85 00 ff ff ff 48 8b 80 50 01 00 00 48 8b 40 08 8b 90 f4 02 00 00 e9 ad e8 ff ff 0f 0b <0f> 0b e9 14 f6 ff ff 0f 0b e9 2c f6 ff ff 31 d2 31 c0 e9 1e f9 ff 
May 26 21:51:32 ArkkiVille kernel: RSP: 0018:ffffba6e551877c0 EFLAGS: 00010002 
May 26 21:51:32 ArkkiVille kernel: RAX: 0000000000000282 RBX: 0000000000000282 RCX: ffffffffc11ae348 
May 26 21:51:32 ArkkiVille kernel: RDX: 0000000000000001 RSI: 0000000000000286 RDI: ffff9b04c5e00178 
May 26 21:51:32 ArkkiVille kernel: RBP: ffffba6e55187978 R08: 0000000000000000 R09: 00000000fffffa1d 
May 26 21:51:32 ArkkiVille kernel: R10: 0000000000000000 R11: ffffba6e551876dc R12: 0000000000000002 
May 26 21:51:32 ArkkiVille kernel: R13: 0000000000000000 R14: ffff9b0b666e7000 R15: ffff9b04dcca3000 
May 26 21:51:32 ArkkiVille kernel: FS:  0000000000000000(0000) GS:ffff9b0bbed00000(0000) knlGS:0000000000000000 
May 26 21:51:32 ArkkiVille kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033 
May 26 21:51:32 ArkkiVille kernel: CR2: 00007f239fa76000 CR3: 00000001127a0000 CR4: 0000000000f50ef0 
May 26 21:51:32 ArkkiVille kernel: PKRU: 55555554 
May 26 21:51:32 ArkkiVille kernel: Call Trace: 
May 26 21:51:32 ArkkiVille kernel:  <TASK> 
May 26 21:51:32 ArkkiVille kernel:  amdgpu_dm_atomic_commit_tail+0xf8c/0x30b0 [amdgpu 30d7512d174fa929063ef098b0124df029bff6e3] 
May 26 21:51:32 ArkkiVille kernel:  commit_tail+0xa1/0x130 
May 26 21:51:32 ArkkiVille kernel:  drm_atomic_helper_commit+0x13c/0x180 
May 26 21:51:32 ArkkiVille kernel:  drm_atomic_commit+0xb2/0xe0 
May 26 21:51:32 ArkkiVille kernel:  ? __pfx___drm_printfn_info+0x10/0x10 
May 26 21:51:32 ArkkiVille kernel:  drm_client_modeset_commit_atomic.constprop.0+0x1b7/0x1f0 
May 26 21:51:32 ArkkiVille kernel:  drm_client_modeset_commit_locked+0x55/0x180 
May 26 21:51:32 ArkkiVille kernel:  drm_client_modeset_commit+0x25/0x40 
May 26 21:51:32 ArkkiVille kernel:  drm_fb_helper_set_par+0x95/0xd0 
May 26 21:51:32 ArkkiVille kernel:  fbcon_init+0x39e/0x6b0 
May 26 21:51:32 ArkkiVille kernel:  visual_init+0xce/0x130 
May 26 21:51:32 ArkkiVille kernel:  do_bind_con_driver.isra.0+0x2b0/0x470 
May 26 21:51:32 ArkkiVille kernel:  do_take_over_console+0x23b/0x460 
May 26 21:51:32 ArkkiVille kernel:  do_fb_registered+0x115/0x1a0 
May 26 21:51:32 ArkkiVille kernel:  fbcon_register_existing_fbs+0x3f/0x70 
May 26 21:51:32 ArkkiVille kernel:  process_one_work+0x193/0x360 
May 26 21:51:32 ArkkiVille kernel:  worker_thread+0x24f/0x380 
May 26 21:51:32 ArkkiVille kernel:  ? __pfx_worker_thread+0x10/0x10 
May 26 21:51:32 ArkkiVille kernel:  kthread+0xef/0x230 
May 26 21:51:32 ArkkiVille kernel:  ? __pfx_kthread+0x10/0x10 
May 26 21:51:32 ArkkiVille kernel:  ret_from_fork+0x34/0x50 
May 26 21:51:32 ArkkiVille kernel:  ? __pfx_kthread+0x10/0x10 
May 26 21:51:32 ArkkiVille kernel:  ret_from_fork_asm+0x1a/0x30 
May 26 21:51:32 ArkkiVille kernel:  </TASK> 
May 26 21:51:32 ArkkiVille kernel: ---[] end trace 0000000000000000 ]--- 

I'm currently downgrading to confirm the issue goes away.

I can try a shutdown via SSH, but it will take ages (user processes dumping core and systemd waiting for timeouts, I guess, but I can not see what is going on save for the log). It seemingly finishes after a while, however the system does not power off or reboot (possibly Kernel panic at late stage during the shutdown?).

EDIT: It's always not that long after all:

May 26 21:56:02 ArkkiVille sudo[155289]:    ville : TTY=pts/7 ; PWD=/home/ville ; USER=root ; COMMAND=/usr/bin/systemctl reboot
.
.
.
May 26 21:57:53 ArkkiVille systemd-journald[670]: Journal stopped

However still longer than usual and I can not see what's the status of the shutdown. On a previous shutdown (after amdgpu crashed), it took much longer according to the journal.

Any other users with a similar issue?

Last edited by Wild Penguin (2025-06-02 21:50:47)

Offline

#2 2025-05-26 19:39:07

Wild Penguin
Member
Registered: 2015-03-19
Posts: 365

Re: amdgpu related freeze after suspend since kernel 6.14.7 [SOLVED]

I believe I found the upstream issue:

https://gitlab.freedesktop.org/drm/amd/-/issues/4243 (or https://gitlab.freedesktop.org/drm/amd/-/issues/4155 ) - albeit didn't read throughly yet smile.

Offline

#3 2025-05-28 14:46:51

Janhouse
Member
Registered: 2010-10-02
Posts: 30

Re: amdgpu related freeze after suspend since kernel 6.14.7 [SOLVED]

For me 3rd monitor which is connected using display port cable doesn't turn on at all with 6.14.7.

And dmesg full or these:

[   23.328616] amdgpu 0000:0a:00.0: amdgpu: [drm] amdgpu: DP AUX transfer fail:3
[   23.730006] amdgpu 0000:0a:00.0: amdgpu: [drm] amdgpu: DP AUX transfer fail:3
[   23.731316] amdgpu 0000:0a:00.0: amdgpu: [drm] amdgpu: DP AUX transfer fail:3
[   28.163150] amdgpu 0000:0a:00.0: amdgpu: [drm] amdgpu: AUX reply command not ACK: 0x02.
[   28.179051] amdgpu 0000:0a:00.0: amdgpu: [drm] amdgpu: AUX reply command not ACK: 0x02.
[   32.196202] amdgpu 0000:0a:00.0: amdgpu: [drm] amdgpu: AUX reply command not ACK: 0x02.

Offline

#4 2025-06-01 01:34:54

AngelBePro
Member
From: Bulgaria
Registered: 2023-07-27
Posts: 69

Re: amdgpu related freeze after suspend since kernel 6.14.7 [SOLVED]

This seems to have been fixed in latest kernel as of now using linux 6.14.9-arch1-1 (I am not using zen). Can someone confirm?


Thanks for helping me!
MB: MSI B760-P DDR4 II
CPU: i5-14400F
GPU: RX 7800 XT

Offline

#5 2025-06-02 21:50:19

Wild Penguin
Member
Registered: 2015-03-19
Posts: 365

Re: amdgpu related freeze after suspend since kernel 6.14.7 [SOLVED]

I can not reproduce at the moment. Marking as [SOLVED]!

Offline

Board footer

Powered by FluxBB