You are not logged in.
Hi,
after upgrading from kernel version 5.11.16.arch1 to version 5.12.3.arch1 hibernation stopped working for me. Instead of turning off, my PC now turns of the screen and then stays in a semi-dead state that cannot be left (no keyboard input possible and not display output). Instead, I have to hard-reset the PC. However, the PC is restored to its previous state after booting - so, storing RAM to swap somewhat works, but the following HW sleep does not.
Additionally, after such a hibernate -> hard reset -> restore cycle, my PC is acting up and I get low FPS and stutter in games (GS:GO) until after a proper reboot. So something is wrong even with the hibernation process (RAM to swap).
I downgraded my kernel to 5.11.16-arch1 and hibernation works flawlessly again.
Any ideas how I could track down what causes this issue and help fix it?
Thanks!
Hardware information:
- CPU: AMD Ryzen 9 5900X
- GPU: AMD RX 6900 XT
- MB: Gigabyte X570 AORUS PRO
- RAM: 16 GB G.SKILL
Last edited by tuttiarch (2021-05-14 14:43:15)
Offline
Have you tried this?
System does not power off when hibernating
Offline
Thanks, I tried it just now and it does mitigate the issue but it does not fix it. The truth is, that both hibernate and suspend do not work and using
HibernateMode=shutdownresolves the issue for pure hibernation but not for suspend and the "hybrid-sleep" option (suspend to RAM and to disk).
I am not sure but checking the logs it looks to me like AMDGPU is playing a role in this issue (I don't know how much of these messages are considered normal or are simple warnings):
ay 14 18:13:08 hme kernel: ------------[ cut here ]------------
May 14 18:13:08 hme kernel: WARNING: CPU: 11 PID: 258 at drivers/gpu/drm/amd/amdgpu/../display/dc/dcn30/dcn30_hwseq.c:975 dcn30_set_hubp_blank+0x206/0x220 [amdgpu]
May 14 18:13:08 hme kernel: Modules linked in: intel_rapl_msr intel_rapl_common amdgpu snd_hda_codec_realtek snd_hda_codec_generic it87 edac_mce_amd hwmon_vid ledtrig_audio snd_hda_codec_hdmi snd_hda_intel snd_intel_dspcfg gpu_sched snd_intel_sdw_acpi drm_ttm_helper vfat fat ttm snd_usb_audio snd_hda_codec kvm drm_kms_helper snd_usbmidi_lib snd_>
May 14 18:13:08 hme kernel: CPU: 11 PID: 258 Comm: kworker/u64:2 Tainted: G W 5.12.3-arch1-1 #1
May 14 18:13:08 hme kernel: Hardware name: Gigabyte Technology Co., Ltd. X570 AORUS PRO/X570 AORUS PRO, BIOS F33g 03/25/2021
May 14 18:13:08 hme kernel: Workqueue: events_unbound async_run_entry_fn
May 14 18:13:08 hme kernel: RIP: 0010:dcn30_set_hubp_blank+0x206/0x220 [amdgpu]
May 14 18:13:08 hme kernel: Code: 00 00 48 85 db 74 af 48 8b bb e8 00 00 00 31 f6 48 8b 07 48 8b 40 50 e8 a8 fe 60 da 48 8b 9b d0 02 00 00 48 85 db 75 df eb 8c <0f> 0b e9 46 ff ff ff 0f 0b e9 be fe ff ff 0f 0b e9 42 fe ff ff 0f
May 14 18:13:08 hme kernel: RSP: 0018:ffffb48400af3788 EFLAGS: 00010246
May 14 18:13:08 hme kernel: RAX: 0000000000000000 RBX: ffff8b3bc9ca1548 RCX: 0000000000000003
May 14 18:13:08 hme kernel: RDX: 0000000000000000 RSI: 0000000000000000 RDI: ffff8b3b387a0000
May 14 18:13:08 hme kernel: RBP: ffff8b3bc9ca06c0 R08: ffffb48400af3774 R09: ffffb48400af36d0
May 14 18:13:08 hme kernel: R10: 0000000000000002 R11: 0000000000000000 R12: ffff8b3bc9ca1ef8
May 14 18:13:08 hme kernel: R13: ffff8b3bc9ca0000 R14: ffff8b3b3a470000 R15: ffff8b3b3a470000
May 14 18:13:08 hme kernel: FS: 0000000000000000(0000) GS:ffff8b3e2ecc0000(0000) knlGS:0000000000000000
May 14 18:13:08 hme kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
May 14 18:13:08 hme kernel: CR2: 0000557efbec4b60 CR3: 0000000135850000 CR4: 0000000000750ee0
May 14 18:13:08 hme kernel: PKRU: 55555554
May 14 18:13:08 hme kernel: Call Trace:
May 14 18:13:08 hme kernel: dcn20_post_unlock_program_front_end+0x140/0x200 [amdgpu]
May 14 18:13:08 hme kernel: dc_commit_state+0x448/0xa70 [amdgpu]
May 14 18:13:08 hme kernel: amdgpu_dm_atomic_commit_tail+0x5f7/0x2660 [amdgpu]
May 14 18:13:08 hme kernel: ? kernel_init_free_pages+0x4d/0x70
May 14 18:13:08 hme kernel: ? get_page_from_freelist+0x1858/0x1920
May 14 18:13:08 hme kernel: ? dcn20_populate_dml_pipes_from_context+0xa95/0xc10 [amdgpu]
May 14 18:13:08 hme kernel: commit_tail+0x94/0x130 [drm_kms_helper]
May 14 18:13:08 hme kernel: drm_atomic_helper_commit+0x113/0x140 [drm_kms_helper]
May 14 18:13:08 hme kernel: drm_atomic_helper_disable_all+0x175/0x190 [drm_kms_helper]
May 14 18:13:08 hme kernel: drm_atomic_helper_suspend+0xaf/0x1f0 [drm_kms_helper]
May 14 18:13:08 hme kernel: dm_suspend+0x48/0x1d0 [amdgpu]
May 14 18:13:08 hme kernel: ? smuio_v11_0_6_update_rom_clock_gating+0x2c/0x70 [amdgpu]
May 14 18:13:08 hme kernel: ? nv_common_set_clockgating_state+0x9a/0xb0 [amdgpu]
May 14 18:13:08 hme kernel: amdgpu_device_ip_suspend_phase1+0x75/0xd0 [amdgpu]
May 14 18:13:08 hme kernel: amdgpu_device_suspend+0x4d/0xa0 [amdgpu]
May 14 18:13:08 hme kernel: amdgpu_pmops_freeze+0x1e/0x40 [amdgpu]
May 14 18:13:08 hme kernel: pci_pm_freeze+0x53/0xb0
May 14 18:13:08 hme kernel: ? pci_pm_poweroff+0xe0/0xe0
May 14 18:13:08 hme kernel: dpm_run_callback+0x4c/0x150
May 14 18:13:08 hme kernel: __device_suspend+0x11c/0x4d0
May 14 18:13:08 hme kernel: async_suspend+0x1b/0x90
May 14 18:13:08 hme kernel: async_run_entry_fn+0x37/0x140
May 14 18:13:08 hme kernel: process_one_work+0x214/0x3e0
May 14 18:13:08 hme kernel: worker_thread+0x4d/0x3d0
May 14 18:13:08 hme kernel: ? process_one_work+0x3e0/0x3e0
May 14 18:13:08 hme kernel: kthread+0x133/0x150
May 14 18:13:08 hme kernel: ? __kthread_bind_mask+0x60/0x60
May 14 18:13:08 hme kernel: ret_from_fork+0x22/0x30
May 14 18:13:08 hme kernel: ---[ end trace e723036490174740 ]---
May 14 18:13:08 hme kernel: [drm] free PSP TMR buffer
May 14 18:13:08 hme kernel: amdgpu 0000:0a:00.0: amdgpu: MODE1 reset
May 14 18:13:08 hme kernel: amdgpu 0000:0a:00.0: amdgpu: GPU smu mode1 resetMay 14 18:13:08 hme kernel: amdgpu 0000:0a:00.0: amdgpu: SECUREDISPLAY: securedisplay ta ucode is not available
May 14 18:13:08 hme kernel: amdgpu 0000:0a:00.0: amdgpu: SMU is resuming...
May 14 18:13:08 hme kernel: amdgpu 0000:0a:00.0: amdgpu: dpm has been enabled
May 14 18:13:08 hme kernel: amdgpu 0000:0a:00.0: amdgpu: SMU is resumed successfully!
May 14 18:13:08 hme kernel: [drm] DMUB hardware initialized: version=0x02000018
[...]
May 14 18:13:08 hme kernel: amdgpu 0000:0a:00.0: amdgpu: ring gfx_0.0.0 uses VM inv eng 0 on hub 0
May 14 18:13:08 hme kernel: amdgpu 0000:0a:00.0: amdgpu: ring comp_1.0.0 uses VM inv eng 1 on hub 0
May 14 18:13:08 hme kernel: amdgpu 0000:0a:00.0: amdgpu: ring comp_1.1.0 uses VM inv eng 4 on hub 0
May 14 18:13:08 hme kernel: amdgpu 0000:0a:00.0: amdgpu: ring comp_1.2.0 uses VM inv eng 5 on hub 0
May 14 18:13:08 hme kernel: amdgpu 0000:0a:00.0: amdgpu: ring comp_1.3.0 uses VM inv eng 6 on hub 0
May 14 18:13:08 hme kernel: amdgpu 0000:0a:00.0: amdgpu: ring comp_1.0.1 uses VM inv eng 7 on hub 0
May 14 18:13:08 hme kernel: amdgpu 0000:0a:00.0: amdgpu: ring comp_1.1.1 uses VM inv eng 8 on hub 0
May 14 18:13:08 hme kernel: amdgpu 0000:0a:00.0: amdgpu: ring comp_1.2.1 uses VM inv eng 9 on hub 0
May 14 18:13:08 hme kernel: amdgpu 0000:0a:00.0: amdgpu: ring comp_1.3.1 uses VM inv eng 10 on hub 0
May 14 18:13:08 hme kernel: amdgpu 0000:0a:00.0: amdgpu: ring kiq_2.1.0 uses VM inv eng 11 on hub 0
May 14 18:13:08 hme kernel: amdgpu 0000:0a:00.0: amdgpu: ring sdma0 uses VM inv eng 12 on hub 0
May 14 18:13:08 hme kernel: amdgpu 0000:0a:00.0: amdgpu: ring sdma1 uses VM inv eng 13 on hub 0
May 14 18:13:08 hme kernel: amdgpu 0000:0a:00.0: amdgpu: ring sdma2 uses VM inv eng 14 on hub 0
May 14 18:13:08 hme kernel: amdgpu 0000:0a:00.0: amdgpu: ring sdma3 uses VM inv eng 15 on hub 0
May 14 18:13:08 hme kernel: amdgpu 0000:0a:00.0: amdgpu: ring vcn_dec_0 uses VM inv eng 0 on hub 1
May 14 18:13:08 hme kernel: amdgpu 0000:0a:00.0: amdgpu: ring vcn_enc_0.0 uses VM inv eng 1 on hub 1
May 14 18:13:08 hme kernel: amdgpu 0000:0a:00.0: amdgpu: ring vcn_enc_0.1 uses VM inv eng 4 on hub 1
May 14 18:13:08 hme kernel: amdgpu 0000:0a:00.0: amdgpu: ring vcn_dec_1 uses VM inv eng 5 on hub 1
May 14 18:13:08 hme kernel: amdgpu 0000:0a:00.0: amdgpu: ring vcn_enc_1.0 uses VM inv eng 6 on hub 1
May 14 18:13:08 hme kernel: amdgpu 0000:0a:00.0: amdgpu: ring vcn_enc_1.1 uses VM inv eng 7 on hub 1
May 14 18:13:08 hme kernel: amdgpu 0000:0a:00.0: amdgpu: ring jpeg_dec uses VM inv eng 8 on hub 1
May 14 18:13:08 hme kernel: amdgpu 0000:0a:00.0: amdgpu: failed send message: TransferTableSmu2Dram (18) param: 0x00000008 response 0xff
May 14 18:13:08 hme kernel: amdgpu 0000:0a:00.0: amdgpu: Failed to get overdrive table!
May 14 18:13:08 hme kernel: amdgpu 0000:0a:00.0: amdgpu: Failed to setup default OD settings!
May 14 18:13:08 hme kernel: [drm:amdgpu_device_ip_late_init [amdgpu]] *ERROR* late_init of IP block <smu> failed -5
May 14 18:13:08 hme kernel: PM: dpm_run_callback(): pci_pm_restore+0x0/0xe0 returns -5
May 14 18:13:08 hme kernel: amdgpu 0000:0a:00.0: PM: failed to restore async: error -5
May 14 18:13:08 hme kernel: PM: hibernation: Basic memory bitmaps freed
May 14 18:13:08 hme kernel: OOM killer enabled.
May 14 18:13:08 hme kernel: Restarting tasks ... done.
May 14 18:13:08 hme kernel: PM: hibernation: hibernation exitThese five lines in the latter part are colored in red:
May 14 18:13:08 hme kernel: amdgpu 0000:0a:00.0: amdgpu: failed send message: TransferTableSmu2Dram (18) param: 0x00000008 response 0xff
May 14 18:13:08 hme kernel: amdgpu 0000:0a:00.0: amdgpu: Failed to get overdrive table!
May 14 18:13:08 hme kernel: amdgpu 0000:0a:00.0: amdgpu: Failed to setup default OD settings!
May 14 18:13:08 hme kernel: [drm:amdgpu_device_ip_late_init [amdgpu]] *ERROR* late_init of IP block <smu> failed -5
May 14 18:13:08 hme kernel: PM: dpm_run_callback(): pci_pm_restore+0x0/0xe0 returns -5
May 14 18:13:08 hme kernel: amdgpu 0000:0a:00.0: PM: failed to restore async: error -5And finally:
May 14 18:13:09 hme kernel: amdgpu 0000:0a:00.0: amdgpu: Msg issuing pre-check failed and SMU may be not in the right state!Offline
With the new 5.12.4-arch1-2 kernel I still have the same issue. Help please?
Offline