You are not logged in.

#1 2023-03-08 07:53:54

jcr
Member
Registered: 2022-12-21
Posts: 19

Can't resume from suspend on recent kernel with AMD iGPU

I've seen similar problems here, but not quite identical so I'd rather make a new post.

It's been basically about a week since my AMD Ryzen 2500u laptop can't resume from suspend. I have to manually force shutdown with power button and reboot each time (can't even access a TTY).
I've just got time to look at it today, and it looks very much like a problem with AMD iGPU driver.

I've installed LTS kernel : everything works fine. I'll use it for now so nothing urgent, just thought this post might help.

Here are the journals from 2 boots, where I only booted, closed the lid to suspend, opened it to resume.

6.2.2 : http://0x0.st/HiXp.txt

6.1.15 LTS : http://0x0.st/HiXf.txt

Maybe this should be a bug report somewhere instead ? No idea where though, so please guide me if I should post this anywhere else.

Offline

#2 2023-03-10 00:23:50

krutoileshii
Member
Registered: 2019-01-15
Posts: 45

Re: Can't resume from suspend on recent kernel with AMD iGPU

Are you getting mce errors?

If yes,

Try adding amd_iommu=on and iommu=pt kernel parameters.

Helped me on Ryzen with a 6700.

Offline

#3 2023-03-10 07:30:10

seth
Member
Registered: 2012-09-03
Posts: 51,143

Re: Can't resume from suspend on recent kernel with AMD iGPU

Mar 08 08:38:31 arch-laptop kernel: ------------[ cut here ]------------
Mar 08 08:38:31 arch-laptop kernel: WARNING: CPU: 5 PID: 9 at drivers/pci/ats.c:251 pci_disable_pri+0x90/0xa0
Mar 08 08:38:31 arch-laptop kernel: Modules linked in: rfcomm snd_seq_dummy snd_hrtimer snd_seq snd_seq_device xt_conntrack xt_MASQUERADE nf_conntrack_netlink nfnetlink iptable_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 libcrc32c xt_addrtype iptable_filter br_netfilter bridge ccm overlay cmac algif_hash algif_skcipher af_alg bnep 8021q garp mrp stp llc snd_sof_amd_rembrandt snd_sof_amd_renoir snd_sof_amd_acp snd_sof_pci snd_sof_xtensa_dsp snd_sof snd_sof_utils snd_ctl_led iwlmvm snd_soc_core snd_hda_codec_realtek intel_rapl_msr snd_compress intel_rapl_common snd_hda_codec_generic mac80211 snd_hda_codec_hdmi ac97_bus edac_mce_amd snd_pcm_dmaengine snd_hda_intel snd_pci_ps kvm_amd snd_intel_dspcfg libarc4 snd_intel_sdw_acpi snd_rpl_pci_acp6x snd_hda_codec kvm btusb vfat snd_acp_pci btrtl irqbypass fat snd_hda_core snd_pci_acp6x uvcvideo crct10dif_pclmul btbcm snd_hwdep crc32_pclmul iwlwifi snd_pci_acp5x polyval_clmulni btintel snd_rn_pci_acp3x polyval_generic videobuf2_vmalloc joydev gf128mul snd_pcm
Mar 08 08:38:31 arch-laptop kernel:  snd_acp_config mousedev videobuf2_memops btmtk ghash_clmulni_intel videobuf2_v4l2 snd_timer sha512_ssse3 bluetooth cfg80211 snd_soc_acpi videodev aesni_intel sp5100_tco huawei_wmi snd hid_multitouch videobuf2_common ledtrig_audio crypto_simd cryptd mc rapl psmouse ecdh_generic pcspkr sparse_keymap snd_pci_acp3x soundcore i2c_piix4 wmi_bmof ccp rfkill k10temp i2c_hid_acpi i2c_hid acpi_cpufreq mac_hid pkcs8_key_parser crypto_user fuse dm_mod loop bpf_preload ip_tables x_tables ext4 crc32c_generic crc16 mbcache jbd2 amdgpu drm_ttm_helper ttm serio_raw drm_buddy atkbd gpu_sched libps2 vivaldi_fmap drm_display_helper crc32c_intel xhci_pci cec xhci_pci_renesas i8042 video serio wmi
Mar 08 08:38:31 arch-laptop kernel: CPU: 5 PID: 9 Comm: kworker/u32:0 Tainted: G        W          6.2.2-arch1-1 #1 ce7dcadfb89d51289a10c1f3b4caa2589e329214
Mar 08 08:38:31 arch-laptop kernel: Hardware name: HUAWEI KPL-W0X/KPL-W0X-PCB, BIOS 1.19 01/11/2019
Mar 08 08:38:31 arch-laptop kernel: Workqueue: events_unbound async_run_entry_fn
Mar 08 08:38:31 arch-laptop kernel: RIP: 0010:pci_disable_pri+0x90/0xa0
Mar 08 08:38:31 arch-laptop kernel: Code: 89 ee e8 83 8e fc ff 0f b7 54 24 06 89 ee 48 89 df 83 e2 fe 66 89 54 24 06 0f b7 d2 e8 49 8f fc ff 80 a3 4b 08 00 00 fd eb b3 <0f> 0b eb af e8 87 42 64 00 0f 1f 80 00 00 00 00 90 90 90 90 90 90
Mar 08 08:38:31 arch-laptop kernel: RSP: 0018:ffffb490c0127c40 EFLAGS: 00010046
Mar 08 08:38:31 arch-laptop kernel: RAX: 0000000000000000 RBX: ffff9b72c1326000 RCX: 0000000000000000
Mar 08 08:38:31 arch-laptop kernel: RDX: 0000000000000000 RSI: 0000000000000014 RDI: ffff9b72c1326000
Mar 08 08:38:31 arch-laptop kernel: RBP: ffff9b72c2936c10 R08: 0000000000000002 R09: 0000000000000000
Mar 08 08:38:31 arch-laptop kernel: R10: 0000000000000002 R11: ffffffff9ac9f090 R12: ffff9b72c2936c00
Mar 08 08:38:31 arch-laptop kernel: R13: ffff9b72c14f3200 R14: ffff9b72c14f3200 R15: ffff9b72c2936d60
Mar 08 08:38:31 arch-laptop kernel: FS:  0000000000000000(0000) GS:ffff9b73e7f40000(0000) knlGS:0000000000000000
Mar 08 08:38:31 arch-laptop kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Mar 08 08:38:31 arch-laptop kernel: CR2: 0000000000000000 CR3: 00000001a1810000 CR4: 00000000003506e0
Mar 08 08:38:31 arch-laptop kernel: Call Trace:
Mar 08 08:38:31 arch-laptop kernel:  <TASK>
Mar 08 08:38:31 arch-laptop kernel:  amd_iommu_attach_device+0x2e2/0x300
Mar 08 08:38:31 arch-laptop kernel:  __iommu_attach_device+0x1b/0x90
Mar 08 08:38:31 arch-laptop kernel:  __iommu_attach_group+0x55/0xb0
Mar 08 08:38:31 arch-laptop kernel:  iommu_attach_group+0x2a/0x40
Mar 08 08:38:31 arch-laptop kernel:  amd_iommu_init_device+0x16f/0x250
Mar 08 08:38:31 arch-laptop kernel:  ? __pfx_pci_pm_resume+0x10/0x10
Mar 08 08:38:31 arch-laptop kernel:  kfd_iommu_resume+0x50/0x1a0 [amdgpu 4f2637ea7cdfc13a9a95f7466e065e14842ba36e]
Mar 08 08:38:31 arch-laptop kernel:  ? __pfx_pci_pm_resume+0x10/0x10
Mar 08 08:38:31 arch-laptop kernel:  kgd2kfd_resume_iommu+0x16/0x30 [amdgpu 4f2637ea7cdfc13a9a95f7466e065e14842ba36e]
Mar 08 08:38:31 arch-laptop kernel:  ? __pfx_pci_pm_resume+0x10/0x10
Mar 08 08:38:31 arch-laptop kernel:  amdgpu_device_resume+0x1eb/0x310 [amdgpu 4f2637ea7cdfc13a9a95f7466e065e14842ba36e]
Mar 08 08:38:31 arch-laptop kernel:  ? __pfx_pci_pm_resume+0x10/0x10
Mar 08 08:38:31 arch-laptop kernel:  amdgpu_pmops_resume+0x36/0x70 [amdgpu 4f2637ea7cdfc13a9a95f7466e065e14842ba36e]
Mar 08 08:38:31 arch-laptop kernel:  ? __pfx_pci_pm_resume+0x10/0x10
Mar 08 08:38:31 arch-laptop kernel:  dpm_run_callback+0x4a/0x150
Mar 08 08:38:31 arch-laptop kernel:  device_resume+0xa2/0x1f0
Mar 08 08:38:31 arch-laptop kernel:  async_resume+0x1d/0x30
Mar 08 08:38:31 arch-laptop kernel:  async_run_entry_fn+0x34/0x130
Mar 08 08:38:31 arch-laptop kernel:  process_one_work+0x1c8/0x380
Mar 08 08:38:31 arch-laptop kernel:  worker_thread+0x51/0x390
Mar 08 08:38:31 arch-laptop kernel:  ? __pfx_worker_thread+0x10/0x10
Mar 08 08:38:31 arch-laptop kernel:  kthread+0xde/0x110
Mar 08 08:38:31 arch-laptop kernel:  ? __pfx_kthread+0x10/0x10
Mar 08 08:38:31 arch-laptop kernel:  ret_from_fork+0x2c/0x50
Mar 08 08:38:31 arch-laptop kernel:  </TASK>
Mar 08 08:38:31 arch-laptop kernel: ---[ end trace 0000000000000000 ]---
Mar 08 08:38:31 arch-laptop kernel: ------------[ cut here ]------------
Mar 08 08:38:31 arch-laptop kernel: WARNING: CPU: 5 PID: 9 at drivers/pci/ats.c:419 pci_disable_pasid+0x49/0x50
Mar 08 08:38:31 arch-laptop kernel: Modules linked in: rfcomm snd_seq_dummy snd_hrtimer snd_seq snd_seq_device xt_conntrack xt_MASQUERADE nf_conntrack_netlink nfnetlink iptable_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 libcrc32c xt_addrtype iptable_filter br_netfilter bridge ccm overlay cmac algif_hash algif_skcipher af_alg bnep 8021q garp mrp stp llc snd_sof_amd_rembrandt snd_sof_amd_renoir snd_sof_amd_acp snd_sof_pci snd_sof_xtensa_dsp snd_sof snd_sof_utils snd_ctl_led iwlmvm snd_soc_core snd_hda_codec_realtek intel_rapl_msr snd_compress intel_rapl_common snd_hda_codec_generic mac80211 snd_hda_codec_hdmi ac97_bus edac_mce_amd snd_pcm_dmaengine snd_hda_intel snd_pci_ps kvm_amd snd_intel_dspcfg libarc4 snd_intel_sdw_acpi snd_rpl_pci_acp6x snd_hda_codec kvm btusb vfat snd_acp_pci btrtl irqbypass fat snd_hda_core snd_pci_acp6x uvcvideo crct10dif_pclmul btbcm snd_hwdep crc32_pclmul iwlwifi snd_pci_acp5x polyval_clmulni btintel snd_rn_pci_acp3x polyval_generic videobuf2_vmalloc joydev gf128mul snd_pcm
Mar 08 08:38:31 arch-laptop kernel:  snd_acp_config mousedev videobuf2_memops btmtk ghash_clmulni_intel videobuf2_v4l2 snd_timer sha512_ssse3 bluetooth cfg80211 snd_soc_acpi videodev aesni_intel sp5100_tco huawei_wmi snd hid_multitouch videobuf2_common ledtrig_audio crypto_simd cryptd mc rapl psmouse ecdh_generic pcspkr sparse_keymap snd_pci_acp3x soundcore i2c_piix4 wmi_bmof ccp rfkill k10temp i2c_hid_acpi i2c_hid acpi_cpufreq mac_hid pkcs8_key_parser crypto_user fuse dm_mod loop bpf_preload ip_tables x_tables ext4 crc32c_generic crc16 mbcache jbd2 amdgpu drm_ttm_helper ttm serio_raw drm_buddy atkbd gpu_sched libps2 vivaldi_fmap drm_display_helper crc32c_intel xhci_pci cec xhci_pci_renesas i8042 video serio wmi
Mar 08 08:38:31 arch-laptop kernel: CPU: 5 PID: 9 Comm: kworker/u32:0 Tainted: G        W          6.2.2-arch1-1 #1 ce7dcadfb89d51289a10c1f3b4caa2589e329214
Mar 08 08:38:31 arch-laptop kernel: Hardware name: HUAWEI KPL-W0X/KPL-W0X-PCB, BIOS 1.19 01/11/2019
Mar 08 08:38:31 arch-laptop kernel: Workqueue: events_unbound async_run_entry_fn
Mar 08 08:38:31 arch-laptop kernel: RIP: 0010:pci_disable_pasid+0x49/0x50
Mar 08 08:38:31 arch-laptop kernel: Code: 53 48 89 fb 85 f6 75 06 5b e9 67 a8 65 00 83 c6 06 31 d2 e8 a9 90 fc ff 80 a3 4b 08 00 00 fe 5b e9 50 a8 65 00 e9 4b a8 65 00 <0f> 0b e9 44 a8 65 00 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90
Mar 08 08:38:31 arch-laptop kernel: RSP: 0018:ffffb490c0127c60 EFLAGS: 00010046
Mar 08 08:38:31 arch-laptop kernel: RAX: 0000000000000000 RBX: ffff9b72c1326000 RCX: 0000000000000000
Mar 08 08:38:31 arch-laptop kernel: RDX: 0000000000000000 RSI: 0000000000000014 RDI: ffff9b72c1326000
Mar 08 08:38:31 arch-laptop kernel: RBP: ffff9b72c2936c10 R08: 0000000000000002 R09: 0000000000000000
Mar 08 08:38:31 arch-laptop kernel: R10: 0000000000000002 R11: ffffffff9ac9f090 R12: ffff9b72c2936c00
Mar 08 08:38:31 arch-laptop kernel: R13: ffff9b72c14f3200 R14: ffff9b72c14f3200 R15: ffff9b72c2936d60
Mar 08 08:38:31 arch-laptop kernel: FS:  0000000000000000(0000) GS:ffff9b73e7f40000(0000) knlGS:0000000000000000
Mar 08 08:38:31 arch-laptop kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Mar 08 08:38:31 arch-laptop kernel: CR2: 0000000000000000 CR3: 00000001a1810000 CR4: 00000000003506e0
Mar 08 08:38:31 arch-laptop kernel: Call Trace:
Mar 08 08:38:31 arch-laptop kernel:  <TASK>
Mar 08 08:38:31 arch-laptop kernel:  amd_iommu_attach_device+0x2ea/0x300
Mar 08 08:38:31 arch-laptop kernel:  __iommu_attach_device+0x1b/0x90
Mar 08 08:38:31 arch-laptop kernel:  __iommu_attach_group+0x55/0xb0
Mar 08 08:38:31 arch-laptop kernel:  iommu_attach_group+0x2a/0x40
Mar 08 08:38:31 arch-laptop kernel:  amd_iommu_init_device+0x16f/0x250
Mar 08 08:38:31 arch-laptop kernel:  ? __pfx_pci_pm_resume+0x10/0x10
Mar 08 08:38:31 arch-laptop kernel:  kfd_iommu_resume+0x50/0x1a0 [amdgpu 4f2637ea7cdfc13a9a95f7466e065e14842ba36e]
Mar 08 08:38:31 arch-laptop kernel:  ? __pfx_pci_pm_resume+0x10/0x10
Mar 08 08:38:31 arch-laptop kernel:  kgd2kfd_resume_iommu+0x16/0x30 [amdgpu 4f2637ea7cdfc13a9a95f7466e065e14842ba36e]
Mar 08 08:38:31 arch-laptop kernel:  ? __pfx_pci_pm_resume+0x10/0x10
Mar 08 08:38:31 arch-laptop kernel:  amdgpu_device_resume+0x1eb/0x310 [amdgpu 4f2637ea7cdfc13a9a95f7466e065e14842ba36e]
Mar 08 08:38:31 arch-laptop kernel:  ? __pfx_pci_pm_resume+0x10/0x10
Mar 08 08:38:31 arch-laptop kernel:  amdgpu_pmops_resume+0x36/0x70 [amdgpu 4f2637ea7cdfc13a9a95f7466e065e14842ba36e]
Mar 08 08:38:31 arch-laptop kernel:  ? __pfx_pci_pm_resume+0x10/0x10
Mar 08 08:38:31 arch-laptop kernel:  dpm_run_callback+0x4a/0x150
Mar 08 08:38:31 arch-laptop kernel:  device_resume+0xa2/0x1f0
Mar 08 08:38:31 arch-laptop kernel:  async_resume+0x1d/0x30
Mar 08 08:38:31 arch-laptop kernel:  async_run_entry_fn+0x34/0x130
Mar 08 08:38:31 arch-laptop kernel:  process_one_work+0x1c8/0x380
Mar 08 08:38:31 arch-laptop kernel:  worker_thread+0x51/0x390
Mar 08 08:38:31 arch-laptop kernel:  ? __pfx_worker_thread+0x10/0x10
Mar 08 08:38:31 arch-laptop kernel:  kthread+0xde/0x110
Mar 08 08:38:31 arch-laptop kernel:  ? __pfx_kthread+0x10/0x10
Mar 08 08:38:31 arch-laptop kernel:  ret_from_fork+0x2c/0x50
Mar 08 08:38:31 arch-laptop kernel:  </TASK>
Mar 08 08:38:31 arch-laptop kernel: ---[ end trace 0000000000000000 ]---
Mar 08 08:38:31 arch-laptop kernel: kfd kfd: amdgpu: Failed to resume IOMMU for device 1002:15dd
Mar 08 08:38:31 arch-laptop kernel: amdgpu 0000:02:00.0: amdgpu: amdgpu_device_ip_resume failed (-6).
Mar 08 08:38:31 arch-laptop kernel: amdgpu 0000:02:00.0: PM: dpm_run_callback(): pci_pm_resume+0x0/0xf0 returns -6
Mar 08 08:38:31 arch-laptop kernel: amdgpu 0000:02:00.0: PM: failed to resume async: error -6

If the iommu parameters don't help, try

amdgpu.dpm=0 amdgpu.bapm=0 amdgpu.aspm=0 amdgpu.runpm=0 pcie_aspm=off

https://wiki.archlinux.org/title/Kernel_parameters
nb. that "amdgpu.dpm=0" might prevent the GPU from activating at all.

Offline

Board footer

Powered by FluxBB