You are not logged in.

#1 2024-03-11 06:53:13

sisama
Member
Registered: 2024-03-11
Posts: 4

Freeze/failure when resuming from sleep

I have been unable to successfully sleep/wake-up on my computer. The screen just turns blank and although I can get to the greeter on SDDM after pressing Ctrl+F2, after putting in log-in details; it just freezes and had to be hard-rebooted with the power key.

Host details:
```
[sisama@rzr-ntbk-archlinux-sing1 ~]$ neofetch
                   -`                    sisama@rzr-ntbk-archlinux-sing1
                  .o+`                   -------------------------------
                 `ooo/                   OS: Arch Linux x86_64
                `+oooo:                  Host: Blade 14 - RZ09-0370 1.04
               `+oooooo:                 Kernel: 6.7.9-arch1-1
               -+oooooo+:                Uptime: 14 mins
             `/:-:++oooo+:               Packages: 940 (pacman), 17 (flatpak)
            `/++++/+++++++:              Shell: bash 5.2.26
           `/++++++++++++++:             Resolution: 2560x1440
          `/+++ooooooooooooo/`           DE: Plasma 6.0.1
         ./ooosssso++osssssso+`          WM: kwin
        .oossssso-````/ossssss+`         Theme: [Plasma], Breeze [GTK2/3]
       -osssssso.      :ssssssso.        Icons: [Plasma], breeze [GTK2/3]
      :osssssss/        osssso+++.       Terminal: konsole
     /ossssssss/        +ssssooo/-       CPU: AMD Ryzen 9 5900HX with Radeon Graphics (16) @ 4.890GHz
   `/ossssso+/:-        -:/+osssso+-     GPU: AMD ATI Radeon Vega Series / Radeon Vega Mobile Series
  `+sso+:-`                 `.-/+oso:    Memory: 3890MiB / 15393MiB
`++:.                           `-/+/
.`                                 `/                           
                                                               
```

The following is the output from the system journal `-b -1` : https://0x0.st/HhLM.txt

Any pointers on how to mitigate this would be much appreciated!

Offline

#2 2024-03-11 08:32:00

seth
Member
From: Won't reply 2 private help req
Registered: 2012-09-03
Posts: 76,016

Re: Freeze/failure when resuming from sleep

Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel: amdgpu 0000:04:00.0: Refused to change power state from D0 to D3hot
…
Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel: [drm] reserve 0x400000 from 0xf41f800000 for PSP TMR
Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel: [drm] psp gfx command SETUP_TMR(0x5) failed and response status is (0x80000306)
Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel: [drm] failed to load ucode CP_MEC1(0x19) 
Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel: [drm] psp gfx command LOAD_IP_FW(0x6) failed and response status is (0x80000203)
Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel: [drm] failed to load ucode VCN(0x36) 
Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel: [drm] psp gfx command LOAD_IP_FW(0x6) failed and response status is (0x80000203)
Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel: [drm] failed to load ucode DMCUB(0x3C) 
Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel: [drm] psp gfx command LOAD_IP_FW(0x6) failed and response status is (0x80000203)
Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel: amdgpu 0000:04:00.0: amdgpu: RAS: optional ras ta ucode is not available
Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel: amdgpu 0000:04:00.0: amdgpu: RAP: optional rap ta ucode is not available
Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel: amdgpu 0000:04:00.0: amdgpu: SECUREDISPLAY: securedisplay ta ucode is not available
Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel: amdgpu 0000:04:00.0: amdgpu: SMU is resuming...
Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel: amdgpu 0000:04:00.0: amdgpu: dpm has been disabled
Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel: amdgpu 0000:04:00.0: amdgpu: SMU is resumed successfully!
Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel: [drm] Wait for DMUB auto-load failed: 3
Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel: [drm] DMUB hardware initialized: version=0x01010028
Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel: ------------[ cut here ]------------
Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel: WARNING: CPU: 8 PID: 2316 at drivers/gpu/drm/amd/amdgpu/../display/dc/dcn20/dcn20_hubbub.c:566 hubbub2_get_dchub_ref_freq+0xa0/0xc0 [amdgpu]
Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel: Modules linked in: rfcomm snd_seq_dummy snd_hrtimer snd_seq snd_seq_device ccm cmac algif_hash algif_skcipher af_alg bnep xt_hl ip6t_rt ip6t_REJECT nf_reject_ipv6 xt_LOG nf_log_syslog xt_comment xt_multiport xt_limit xt_addrtype xt_tcpudp xt_conntrack snd_acp_legacy_mach snd_acp_mach nf_conntrack snd_soc_nau8821 snd_soc_dmic snd_acp3x_rn nf_defrag_ipv6 snd_acp3x_pdm_dma intel_rapl_msr intel_rapl_common nf_defrag_ipv4 snd_sof_amd_acp63 libcrc32c snd_sof_amd_vangogh ipt_REJECT nf_reject_ipv4 snd_sof_amd_rembrandt ip6table_filter snd_sof_amd_renoir snd_sof_amd_acp ip6_tables snd_sof_pci kvm_amd iptable_filter snd_hda_codec_realtek snd_sof_xtensa_dsp amdgpu snd_sof snd_hda_codec_generic snd_sof_utils kvm ledtrig_audio iwlmvm snd_soc_core irqbypass crct10dif_pclmul snd_compress snd_hda_intel ac97_bus crc32_pclmul polyval_clmulni amdxcp snd_intel_dspcfg btusb snd_pcm_dmaengine drm_exec polyval_generic snd_intel_sdw_acpi mac80211 btrtl mousedev gf128mul uvcvideo gpu_sched snd_pci_ps joydev snd_hda_codec
Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel:  ghash_clmulni_intel btintel snd_rpl_pci_acp6x videobuf2_vmalloc sha512_ssse3 btbcm drm_buddy hid_multitouch uvc snd_acp_pci libarc4 i2c_algo_bit btmtk vfat sha256_ssse3 snd_hda_core videobuf2_memops ptp snd_acp_legacy_common drm_suballoc_helper fat pps_core videobuf2_v4l2 snd_hwdep snd_pci_acp6x sha1_ssse3 drm_ttm_helper ttm bluetooth aesni_intel snd_pci_acp5x videodev snd_pcm drm_display_helper snd_rn_pci_acp3x iwlwifi crypto_simd hid_generic dcdbas snd_timer snd_acp_config cec cryptd cfg80211 snd wmi_bmof dell_wmi_descriptor videobuf2_common ecdh_generic video snd_soc_acpi i2c_hid_acpi sp5100_tco mc usbhid snd_pci_acp3x ccp rfkill soundcore i2c_piix4 k10temp rapl pcspkr wmi i2c_hid mac_hid i2c_dev crypto_user fuse dm_mod loop nfnetlink ip_tables x_tables ext4 crc32c_generic crc16 mbcache jbd2 nvme nvme_core xhci_pci crc32c_intel nvme_auth xhci_pci_renesas
Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel: Unloaded tainted modules: nvidia(POE):3
Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel: CPU: 8 PID: 2316 Comm: kworker/u32:22 Tainted: P           OE      6.7.9-arch1-1 #1 ad54415bbff2f0801422a3b76df850f68e71ecab
Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel: Hardware name: Razer Blade 14 - RZ09-0370/PI411, BIOS 1.06 06/07/2021
Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel: Workqueue: events_unbound async_run_entry_fn
Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel: RIP: 0010:hubbub2_get_dchub_ref_freq+0xa0/0xc0 [amdgpu]
Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel: Code: 83 c0 63 ff ff 3d 20 4e 00 00 77 22 89 5d 00 48 8b 44 24 08 65 48 2b 04 25 28 00 00 00 75 24 48 83 c4 10 5b 5d e9 0b d7 14 da <0f> 0b eb de 0f 0b eb da d1 eb 8d 83 c0 63 ff ff 3d 20 4e 00 00 76
Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel: RSP: 0018:ffffa575c1cabc40 EFLAGS: 00010246
Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel: RAX: 0000000000001000 RBX: 000000000000bb80 RCX: 0000000000000000
Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel: RDX: ffffa575c1cabc44 RSI: 0000000000001638 RDI: ffff8c1904d80000
Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel: RBP: ffff8c190492e3a0 R08: ffffa575c1cabc40 R09: 000000000000000c
Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel: R10: ffffa575ffecb100 R11: 000000000000000f R12: ffff8c190492e000
Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel: R13: ffff8c19016bde00 R14: ffff8c191ec0fc00 R15: ffff8c1901409568
Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel: FS:  0000000000000000(0000) GS:ffff8c1bde800000(0000) knlGS:0000000000000000
Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel: CR2: 0000000000000000 CR3: 000000018d820000 CR4: 0000000000f50ef0
Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel: PKRU: 55555554
Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel: Call Trace:
Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel:  <TASK>
Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel:  ? hubbub2_get_dchub_ref_freq+0xa0/0xc0 [amdgpu 164728a6c26992f7aed1675a5d67c737dcdcdbdf]
Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel:  ? __warn+0x81/0x130
Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel:  ? hubbub2_get_dchub_ref_freq+0xa0/0xc0 [amdgpu 164728a6c26992f7aed1675a5d67c737dcdcdbdf]
Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel:  ? report_bug+0x171/0x1a0
Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel:  ? handle_bug+0x3c/0x80
Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel:  ? exc_invalid_op+0x17/0x70
Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel:  ? asm_exc_invalid_op+0x1a/0x20
Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel:  ? hubbub2_get_dchub_ref_freq+0xa0/0xc0 [amdgpu 164728a6c26992f7aed1675a5d67c737dcdcdbdf]
Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel:  dcn10_init_hw+0x185/0x4c0 [amdgpu 164728a6c26992f7aed1675a5d67c737dcdcdbdf]
Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel:  dc_set_power_state+0x61/0xa0 [amdgpu 164728a6c26992f7aed1675a5d67c737dcdcdbdf]
Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel:  dm_resume+0x11b/0x8c0 [amdgpu 164728a6c26992f7aed1675a5d67c737dcdcdbdf]
Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel:  ? srso_alias_return_thunk+0x5/0xfbef5
Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel:  ? _dev_info+0x79/0xa0
Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel:  amdgpu_device_ip_resume_phase2+0x52/0xc0 [amdgpu 164728a6c26992f7aed1675a5d67c737dcdcdbdf]
Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel:  amdgpu_device_resume+0xa0/0x2d0 [amdgpu 164728a6c26992f7aed1675a5d67c737dcdcdbdf]
Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel:  amdgpu_pmops_resume+0x4a/0x80 [amdgpu 164728a6c26992f7aed1675a5d67c737dcdcdbdf]
Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel:  ? __pfx_pci_pm_resume+0x10/0x10
Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel:  dpm_run_callback+0x8c/0x1e0
Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel:  __device_resume+0xb0/0x2d0
Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel:  async_resume+0x1d/0x30
Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel:  async_run_entry_fn+0x34/0x160
Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel:  process_one_work+0x17b/0x350
Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel:  worker_thread+0x30f/0x450
Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel:  ? __pfx_worker_thread+0x10/0x10
Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel:  kthread+0xe8/0x120
Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel:  ? __pfx_kthread+0x10/0x10
Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel:  ret_from_fork+0x34/0x50
Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel:  ? __pfx_kthread+0x10/0x10
Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel:  ret_from_fork_asm+0x1b/0x30
Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel:  </TASK>
Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel: ---[ end trace 0000000000000000 ]---
Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel: amdgpu 0000:04:00.0: [drm] enabling link 0 failed: 15
Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel: amdgpu 0000:04:00.0: amdgpu: [gfxhub0] no-retry page fault (src_id:0 ring:221 vmid:0 pasid:0, for process  pid 0 thread  pid 0)
Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel: amdgpu 0000:04:00.0: amdgpu:   in page starting at address 0x0000000000001000 from IH client 0x1b (UTCL2)
Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel: [drm] kiq ring mec 2 pipe 1 q 0
Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel: amdgpu 0000:04:00.0: amdgpu: VM_L2_PROTECTION_FAULT_STATUS:0x00000BBA
Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel: amdgpu 0000:04:00.0: amdgpu:          Faulty UTCL2 client ID: CPC (0x5)
Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel: amdgpu 0000:04:00.0: amdgpu:          MORE_FAULTS: 0x0
Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel: amdgpu 0000:04:00.0: amdgpu:          WALKER_ERROR: 0x5
Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel: amdgpu 0000:04:00.0: amdgpu:          PERMISSION_FAULTS: 0xb
Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel: amdgpu 0000:04:00.0: amdgpu:          MAPPING_ERROR: 0x1
Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel: amdgpu 0000:04:00.0: amdgpu:          RW: 0x0
Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel: amdgpu 0000:04:00.0: [drm:amdgpu_ring_test_helper [amdgpu]] *ERROR* ring kiq_0.2.1.0 test failed (-110)
Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel: [drm:amdgpu_gfx_enable_kcq [amdgpu]] *ERROR* KCQ enable failed
Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel: [drm:amdgpu_device_ip_resume_phase2 [amdgpu]] *ERROR* resume of IP block <gfx_v9_0> failed -110
Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel: amdgpu 0000:04:00.0: amdgpu: amdgpu_device_ip_resume failed (-110).
Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel: amdgpu 0000:04:00.0: PM: dpm_run_callback(): pci_pm_resume+0x0/0xf0 returns -110
Mar 11 02:17:13 rzr-ntbk-archlinux-sing1 kernel: amdgpu 0000:04:00.0: PM: failed to resume async: error -110
…
Mar 11 02:17:23 rzr-ntbk-archlinux-sing1 kernel: amdgpu 0000:04:00.0: [drm] *ERROR* [CRTC:73:crtc-0] flip_done timed out
Mar 11 02:17:23 rzr-ntbk-archlinux-sing1 kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring sdma0 timeout, signaled seq=1925, emitted seq=1927
Mar 11 02:17:23 rzr-ntbk-archlinux-sing1 kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process information: process  pid 0 thread  pid 0
Mar 11 02:17:23 rzr-ntbk-archlinux-sing1 kernel: amdgpu 0000:04:00.0: amdgpu: GPU reset begin!
Mar 11 02:17:24 rzr-ntbk-archlinux-sing1 kernel: amdgpu 0000:04:00.0: [drm] *ERROR* Error queueing DMUB command: status=2
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel: amdgpu 0000:04:00.0: [drm] *ERROR* Error queueing DMUB command: status=2
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel: amdgpu 0000:04:00.0: [drm] *ERROR* Error queueing DMUB command: status=2
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel: amdgpu 0000:04:00.0: [drm] *ERROR* Error queueing DMUB command: status=2
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel: ------------[ cut here ]------------
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel: WARNING: CPU: 8 PID: 2321 at drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c:622 amdgpu_irq_put+0x46/0x70 [amdgpu]
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel: Modules linked in: rfcomm snd_seq_dummy snd_hrtimer snd_seq snd_seq_device ccm cmac algif_hash algif_skcipher af_alg bnep xt_hl ip6t_rt ip6t_REJECT nf_reject_ipv6 xt_LOG nf_log_syslog xt_comment xt_multiport xt_limit xt_addrtype xt_tcpudp xt_conntrack snd_acp_legacy_mach snd_acp_mach nf_conntrack snd_soc_nau8821 snd_soc_dmic snd_acp3x_rn nf_defrag_ipv6 snd_acp3x_pdm_dma intel_rapl_msr intel_rapl_common nf_defrag_ipv4 snd_sof_amd_acp63 libcrc32c snd_sof_amd_vangogh ipt_REJECT nf_reject_ipv4 snd_sof_amd_rembrandt ip6table_filter snd_sof_amd_renoir snd_sof_amd_acp ip6_tables snd_sof_pci kvm_amd iptable_filter snd_hda_codec_realtek snd_sof_xtensa_dsp amdgpu snd_sof snd_hda_codec_generic snd_sof_utils kvm ledtrig_audio iwlmvm snd_soc_core irqbypass crct10dif_pclmul snd_compress snd_hda_intel ac97_bus crc32_pclmul polyval_clmulni amdxcp snd_intel_dspcfg btusb snd_pcm_dmaengine drm_exec polyval_generic snd_intel_sdw_acpi mac80211 btrtl mousedev gf128mul uvcvideo gpu_sched snd_pci_ps joydev snd_hda_codec
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel:  ghash_clmulni_intel btintel snd_rpl_pci_acp6x videobuf2_vmalloc sha512_ssse3 btbcm drm_buddy hid_multitouch uvc snd_acp_pci libarc4 i2c_algo_bit btmtk vfat sha256_ssse3 snd_hda_core videobuf2_memops ptp snd_acp_legacy_common drm_suballoc_helper fat pps_core videobuf2_v4l2 snd_hwdep snd_pci_acp6x sha1_ssse3 drm_ttm_helper ttm bluetooth aesni_intel snd_pci_acp5x videodev snd_pcm drm_display_helper snd_rn_pci_acp3x iwlwifi crypto_simd hid_generic dcdbas snd_timer snd_acp_config cec cryptd cfg80211 snd wmi_bmof dell_wmi_descriptor videobuf2_common ecdh_generic video snd_soc_acpi i2c_hid_acpi sp5100_tco mc usbhid snd_pci_acp3x ccp rfkill soundcore i2c_piix4 k10temp rapl pcspkr wmi i2c_hid mac_hid i2c_dev crypto_user fuse dm_mod loop nfnetlink ip_tables x_tables ext4 crc32c_generic crc16 mbcache jbd2 nvme nvme_core xhci_pci crc32c_intel nvme_auth xhci_pci_renesas
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel: Unloaded tainted modules: nvidia(POE):3
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel: CPU: 8 PID: 2321 Comm: kworker/u32:26 Tainted: P        W  OE      6.7.9-arch1-1 #1 ad54415bbff2f0801422a3b76df850f68e71ecab
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel: Hardware name: Razer Blade 14 - RZ09-0370/PI411, BIOS 1.06 06/07/2021
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel: Workqueue: amdgpu-reset-dev drm_sched_job_timedout [gpu_sched]
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel: RIP: 0010:amdgpu_irq_put+0x46/0x70 [amdgpu]
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel: Code: c0 74 33 48 8b 4e 10 48 83 39 00 74 29 89 d1 48 8d 04 88 8b 08 85 c9 74 11 f0 ff 08 74 07 31 c0 e9 ea a7 52 da e9 5a fd ff ff <0f> 0b b8 ea ff ff ff e9 d9 a7 52 da b8 ea ff ff ff e9 cf a7 52 da
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel: RSP: 0018:ffffa575c3717c68 EFLAGS: 00010246
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel: RAX: ffff8c1921bced10 RBX: ffff8c1904d80000 RCX: 0000000000000000
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel: RDX: 0000000000000000 RSI: ffff8c1904da4410 RDI: ffff8c1904d80000
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel: RBP: ffff8c1904d80000 R08: 000000000003a5c0 R09: 0000000000000006
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel: R10: 0000000000000008 R11: 0000000000000000 R12: 0000000000001050
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel: R13: ffff8c1904dba128 R14: ffff8c1927800c00 R15: 0000000000000000
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel: FS:  0000000000000000(0000) GS:ffff8c1bde800000(0000) knlGS:0000000000000000
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel: CR2: 00007480610d2010 CR3: 000000018d820000 CR4: 0000000000f50ef0
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel: PKRU: 55555554
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel: Call Trace:
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel:  <TASK>
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel:  ? amdgpu_irq_put+0x46/0x70 [amdgpu 164728a6c26992f7aed1675a5d67c737dcdcdbdf]
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel:  ? __warn+0x81/0x130
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel:  ? amdgpu_irq_put+0x46/0x70 [amdgpu 164728a6c26992f7aed1675a5d67c737dcdcdbdf]
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel:  ? report_bug+0x171/0x1a0
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel:  ? handle_bug+0x3c/0x80
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel:  ? exc_invalid_op+0x17/0x70
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel:  ? asm_exc_invalid_op+0x1a/0x20
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel:  ? amdgpu_irq_put+0x46/0x70 [amdgpu 164728a6c26992f7aed1675a5d67c737dcdcdbdf]
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel:  ? srso_alias_return_thunk+0x5/0xfbef5
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel:  gfx_v9_0_hw_fini+0x35/0x740 [amdgpu 164728a6c26992f7aed1675a5d67c737dcdcdbdf]
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel:  amdgpu_device_ip_suspend_phase2+0x105/0x1a0 [amdgpu 164728a6c26992f7aed1675a5d67c737dcdcdbdf]
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel:  ? amdgpu_device_ip_suspend_phase1+0x6f/0xe0 [amdgpu 164728a6c26992f7aed1675a5d67c737dcdcdbdf]
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel:  amdgpu_device_ip_suspend+0x40/0x70 [amdgpu 164728a6c26992f7aed1675a5d67c737dcdcdbdf]
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel:  amdgpu_device_pre_asic_reset+0xd3/0x2a0 [amdgpu 164728a6c26992f7aed1675a5d67c737dcdcdbdf]
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel:  amdgpu_device_gpu_recover+0x476/0xc90 [amdgpu 164728a6c26992f7aed1675a5d67c737dcdcdbdf]
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel:  ? ___drm_dbg+0x60/0xd0
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel:  amdgpu_job_timedout+0x186/0x270 [amdgpu 164728a6c26992f7aed1675a5d67c737dcdcdbdf]
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel:  drm_sched_job_timedout+0x85/0x120 [gpu_sched fb54e3185d2218cc261be59aab22418ea255661c]
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel:  process_one_work+0x17b/0x350
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel:  worker_thread+0x30f/0x450
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel:  ? __pfx_worker_thread+0x10/0x10
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel:  kthread+0xe8/0x120
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel:  ? __pfx_kthread+0x10/0x10
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel:  ret_from_fork+0x34/0x50
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel:  ? __pfx_kthread+0x10/0x10
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel:  ret_from_fork_asm+0x1b/0x30
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel:  </TASK>
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel: ---[ end trace 0000000000000000 ]---
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel: ------------[ cut here ]------------
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel: WARNING: CPU: 8 PID: 2321 at drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c:622 amdgpu_irq_put+0x46/0x70 [amdgpu]
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel: Modules linked in: rfcomm snd_seq_dummy snd_hrtimer snd_seq snd_seq_device ccm cmac algif_hash algif_skcipher af_alg bnep xt_hl ip6t_rt ip6t_REJECT nf_reject_ipv6 xt_LOG nf_log_syslog xt_comment xt_multiport xt_limit xt_addrtype xt_tcpudp xt_conntrack snd_acp_legacy_mach snd_acp_mach nf_conntrack snd_soc_nau8821 snd_soc_dmic snd_acp3x_rn nf_defrag_ipv6 snd_acp3x_pdm_dma intel_rapl_msr intel_rapl_common nf_defrag_ipv4 snd_sof_amd_acp63 libcrc32c snd_sof_amd_vangogh ipt_REJECT nf_reject_ipv4 snd_sof_amd_rembrandt ip6table_filter snd_sof_amd_renoir snd_sof_amd_acp ip6_tables snd_sof_pci kvm_amd iptable_filter snd_hda_codec_realtek snd_sof_xtensa_dsp amdgpu snd_sof snd_hda_codec_generic snd_sof_utils kvm ledtrig_audio iwlmvm snd_soc_core irqbypass crct10dif_pclmul snd_compress snd_hda_intel ac97_bus crc32_pclmul polyval_clmulni amdxcp snd_intel_dspcfg btusb snd_pcm_dmaengine drm_exec polyval_generic snd_intel_sdw_acpi mac80211 btrtl mousedev gf128mul uvcvideo gpu_sched snd_pci_ps joydev snd_hda_codec
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel:  ghash_clmulni_intel btintel snd_rpl_pci_acp6x videobuf2_vmalloc sha512_ssse3 btbcm drm_buddy hid_multitouch uvc snd_acp_pci libarc4 i2c_algo_bit btmtk vfat sha256_ssse3 snd_hda_core videobuf2_memops ptp snd_acp_legacy_common drm_suballoc_helper fat pps_core videobuf2_v4l2 snd_hwdep snd_pci_acp6x sha1_ssse3 drm_ttm_helper ttm bluetooth aesni_intel snd_pci_acp5x videodev snd_pcm drm_display_helper snd_rn_pci_acp3x iwlwifi crypto_simd hid_generic dcdbas snd_timer snd_acp_config cec cryptd cfg80211 snd wmi_bmof dell_wmi_descriptor videobuf2_common ecdh_generic video snd_soc_acpi i2c_hid_acpi sp5100_tco mc usbhid snd_pci_acp3x ccp rfkill soundcore i2c_piix4 k10temp rapl pcspkr wmi i2c_hid mac_hid i2c_dev crypto_user fuse dm_mod loop nfnetlink ip_tables x_tables ext4 crc32c_generic crc16 mbcache jbd2 nvme nvme_core xhci_pci crc32c_intel nvme_auth xhci_pci_renesas
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel: Unloaded tainted modules: nvidia(POE):3
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel: CPU: 8 PID: 2321 Comm: kworker/u32:26 Tainted: P        W  OE      6.7.9-arch1-1 #1 ad54415bbff2f0801422a3b76df850f68e71ecab
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel: Hardware name: Razer Blade 14 - RZ09-0370/PI411, BIOS 1.06 06/07/2021
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel: Workqueue: amdgpu-reset-dev drm_sched_job_timedout [gpu_sched]
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel: RIP: 0010:amdgpu_irq_put+0x46/0x70 [amdgpu]
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel: Code: c0 74 33 48 8b 4e 10 48 83 39 00 74 29 89 d1 48 8d 04 88 8b 08 85 c9 74 11 f0 ff 08 74 07 31 c0 e9 ea a7 52 da e9 5a fd ff ff <0f> 0b b8 ea ff ff ff e9 d9 a7 52 da b8 ea ff ff ff e9 cf a7 52 da
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel: RSP: 0018:ffffa575c3717c68 EFLAGS: 00010246
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel: RAX: ffff8c1921bceb00 RBX: ffff8c1904d80000 RCX: 0000000000000000
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel: RDX: 0000000000000000 RSI: ffff8c1904da4428 RDI: ffff8c1904d80000
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel: RBP: ffff8c1904d80000 R08: 000000000003a5c0 R09: 0000000000000006
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel: R10: 0000000000000008 R11: 0000000000000000 R12: 0000000000001050
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel: R13: ffff8c1904dba128 R14: ffff8c1927800c00 R15: 0000000000000000
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel: FS:  0000000000000000(0000) GS:ffff8c1bde800000(0000) knlGS:0000000000000000
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel: CR2: 00007480610d2010 CR3: 000000018d820000 CR4: 0000000000f50ef0
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel: PKRU: 55555554
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel: Call Trace:
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel:  <TASK>
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel:  ? amdgpu_irq_put+0x46/0x70 [amdgpu 164728a6c26992f7aed1675a5d67c737dcdcdbdf]
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel:  ? __warn+0x81/0x130
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel:  ? amdgpu_irq_put+0x46/0x70 [amdgpu 164728a6c26992f7aed1675a5d67c737dcdcdbdf]
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel:  ? report_bug+0x171/0x1a0
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel:  ? handle_bug+0x3c/0x80
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel:  ? exc_invalid_op+0x17/0x70
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel:  ? asm_exc_invalid_op+0x1a/0x20
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel:  ? amdgpu_irq_put+0x46/0x70 [amdgpu 164728a6c26992f7aed1675a5d67c737dcdcdbdf]
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel:  ? srso_alias_return_thunk+0x5/0xfbef5
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel:  gfx_v9_0_hw_fini+0x46/0x740 [amdgpu 164728a6c26992f7aed1675a5d67c737dcdcdbdf]
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel:  amdgpu_device_ip_suspend_phase2+0x105/0x1a0 [amdgpu 164728a6c26992f7aed1675a5d67c737dcdcdbdf]
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel:  ? amdgpu_device_ip_suspend_phase1+0x6f/0xe0 [amdgpu 164728a6c26992f7aed1675a5d67c737dcdcdbdf]
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel:  amdgpu_device_ip_suspend+0x40/0x70 [amdgpu 164728a6c26992f7aed1675a5d67c737dcdcdbdf]
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel:  amdgpu_device_pre_asic_reset+0xd3/0x2a0 [amdgpu 164728a6c26992f7aed1675a5d67c737dcdcdbdf]
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel:  amdgpu_device_gpu_recover+0x476/0xc90 [amdgpu 164728a6c26992f7aed1675a5d67c737dcdcdbdf]
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel:  ? ___drm_dbg+0x60/0xd0
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel:  amdgpu_job_timedout+0x186/0x270 [amdgpu 164728a6c26992f7aed1675a5d67c737dcdcdbdf]
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel:  drm_sched_job_timedout+0x85/0x120 [gpu_sched fb54e3185d2218cc261be59aab22418ea255661c]
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel:  process_one_work+0x17b/0x350
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel:  worker_thread+0x30f/0x450
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel:  ? __pfx_worker_thread+0x10/0x10
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel:  kthread+0xe8/0x120
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel:  ? __pfx_kthread+0x10/0x10
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel:  ret_from_fork+0x34/0x50
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel:  ? __pfx_kthread+0x10/0x10
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel:  ret_from_fork_asm+0x1b/0x30
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel:  </TASK>
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel: ---[ end trace 0000000000000000 ]---
Mar 11 02:17:25 rzr-ntbk-archlinux-sing1 kernel: [drm] psp gfx command DESTROY_TMR(0x7) failed and response status is (0xFFFF0007)

https://gitlab.freedesktop.org/drm/amd/-/issues/2403

How reliable is this, does it happen when you boot w/ "pcie_aspm=off"?

Offline

#3 2024-03-12 07:39:03

sisama
Member
Registered: 2024-03-11
Posts: 4

Re: Freeze/failure when resuming from sleep

Thanks for looking into this Seth; I really appreciate it.

No luck with `pcie_aspm=off`. After wake from sleep, freezes and cycles the display on/off with fans ramping up. It takes a long time to get to greeter which only appears if I press Ctrl+Alt+F2 and then is just frozen with no ability to input text/cursor but touchpad input can be seen.

journal output: https://0x0.st/Hh3l.txt

Offline

#4 2024-03-12 09:19:57

seth
Member
From: Won't reply 2 private help req
Registered: 2012-09-03
Posts: 76,016

Re: Freeze/failure when resuming from sleep

Remove all parameters that are a mitigational effort (cstate? nomwait?)

This here looks odd

Mar 12 03:31:12 rzr-ntbk-archlinux-sing1 kernel: pci 0000:01:00.0: [10de:249d] type 00 class 0x030000
Mar 12 03:31:12 rzr-ntbk-archlinux-sing1 kernel: pci 0000:01:00.1: [10de:228b] type 00 class 0x040300
…
Mar 12 03:31:13 rzr-ntbk-archlinux-sing1 kernel: NVRM: No NVIDIA GPU found.

There /is/ an RTX 3070 and the device isn't taken by eg. vfio or pci_stub and there'er no signs of bbswitch powering it down either.
Are you using some legacy nvidia driver (eg. 390xx)?
Did you disable the GPU in the BIOS?
Did you try to update or reset the BIOS?
Is this on battery or external power supply?
Do you experience the same w/ s2idle?

https://gitlab.freedesktop.org/drm/amd/ … te_2181391 seems to resort to just using the nvidia GPU…

Edit: while somewhat nuts, "pcie_aspm=1 pcie_aspm.policy=performance amdgpu.aspm=1" might be worth a shot

Last edited by seth (2024-03-12 09:21:38)

Offline

#5 2024-03-13 01:07:52

sisama
Member
Registered: 2024-03-11
Posts: 4

Re: Freeze/failure when resuming from sleep

Yes, there is an RTX 3070 which I have completely powered down following https://wiki.archlinux.org/title/Hybrid … screte_GPU since I couldnt get PRIME to work and would much prefer to have longer battery life.

> Are you using some legacy nvidia driver (eg. 390xx)?
I have installed the latest NVIDIA driver when attempting to get PRIME to work:
```
$ pacman -Q nvidia
nvidia 550.54.14-5
```
> Did you disable the GPU in the BIOS?
> Did you try to update or reset the BIOS?
No to both these questions. My BIOS is quite basic and I couldnt really find an option to do so.

> Is this on battery or external power supply?
It was on battery

> Do you experience the same w/ s2idle?
I guess I only have [deep] enabled based on
```
$ cat /sys/power/mem_sleep
s2idle [deep]
```
I can try with s2idle.

Offline

#6 2024-03-13 01:14:03

sisama
Member
Registered: 2024-03-11
Posts: 4

Re: Freeze/failure when resuming from sleep

The behavior seems incosistent: I was able to successfully sleep and resume 2x on this boot. The one difference is that I changed the Sleep setting to "Suspend" from "Sleep, then Hibernate".
Journal - https://0x0.st/Hhkx.txt

Offline

Board footer

Powered by FluxBB