You are not logged in.
Hello,
I did a pacman -Syu on Saturday, which pulled in a new kernel. Now, at a seemingly random time, the video and keyboard stop working. It might be whenever the automatic screen blank kicks in, not sure. I'll try to check that when I have a chance.
After this happens, the caps lock and num lock keys no longer cause those lights on the keyboard to turn on, and the screen is black. Switching to a new virtual terminal does not work. The system still accepts ssh connections normally. The Xorg process appears to still be running.
Looking through my kernel.log is a bit difficult because of many gigabytes of these new "ACPI BIOS Error (bug): Could not resolve symbol" messages, which I've heard are now "normal" (grumble...) but otherwise the only odd thing I've found so far is a kernel Oops, which *might* be happening at the time the issue manifests (it's hard to be sure so far, but it does look possibly in the right general range of time). Here it is:
May 2 00:25:11 xez kernel: Oops: 0000 [#1] PREEMPT SMP PTI
May 2 00:25:11 xez kernel: CPU: 2 PID: 101 Comm: kworker/2:1 Not tainted 5.17.5-arch1-1 #1 bff91b48f6c3cb8d3bfd68f772f9c0a96e684769
May 2 00:25:11 xez kernel: Hardware name: To Be Filled By O.E.M. To Be Filled By O.E.M./Z97M OC Formula, BIOS P2.30 05/13/2015
May 2 00:25:11 xez kernel: Workqueue: pm pm_runtime_work
May 2 00:25:11 xez kernel: RIP: 0010:psp_hw_start+0x300/0x550 [amdgpu]
May 2 00:25:11 xez kernel: Code: 2d 07 00 0b 00 83 e0 fd 0f 84 a9 00 00 00 48 89 df e8 44 b9 ff ff 48 8b b3 38 01 00 00 48 8b 3b 49 89 c4 4c 8b ab 40 01 00 00 <48> 8b ae 30 01 00 00 e8 34 03 fa ff 48 c7 c7 10 01 1f c1 49 89 c0
May 2 00:25:11 xez kernel: RSP: 0018:ffffa75680407bb0 EFLAGS: 00010202
May 2 00:25:11 xez kernel: RAX: ffff910543644800 RBX: ffff91054c6d32f0 RCX: 0000000000000000
May 2 00:25:11 xez kernel: RDX: ffff910543058000 RSI: 0000000000000000 RDI: ffff91054c6c0000
May 2 00:25:11 xez kernel: RBP: 0000000000000000 R08: 0000000000000000 R09: 0000000000000000
May 2 00:25:11 xez kernel: R10: 0000000000000000 R11: 0000000000000002 R12: ffff910543644800
May 2 00:25:11 xez kernel: R13: 0000000000000000 R14: ffff910543644800 R15: 0000000000000000
May 2 00:25:11 xez kernel: FS: 0000000000000000(0000) GS:ffff91085ec80000(0000) knlGS:0000000000000000
May 2 00:25:11 xez kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
May 2 00:25:11 xez kernel: CR2: 0000000000000130 CR3: 000000025f610006 CR4: 00000000001706e0
May 2 00:25:11 xez kernel: Call Trace:
May 2 00:25:11 xez kernel: <TASK>
May 2 00:25:11 xez kernel: psp_resume+0x76/0x226 [amdgpu cc7df1096636f0450114ea079639416c26399903]
May 2 00:25:11 xez kernel: amdgpu_device_fw_loading+0x78/0x140 [amdgpu cc7df1096636f0450114ea079639416c26399903]
May 2 00:25:11 xez kernel: amdgpu_device_resume+0x6e/0x220 [amdgpu cc7df1096636f0450114ea079639416c26399903]
May 2 00:25:11 xez kernel: amdgpu_pmops_runtime_resume+0x7e/0xe0 [amdgpu cc7df1096636f0450114ea079639416c26399903]
May 2 00:25:11 xez kernel: pci_pm_runtime_resume+0xaa/0xc0
May 2 00:25:11 xez kernel: ? pci_pm_freeze_noirq+0x100/0x100
May 2 00:25:11 xez kernel: __rpm_callback+0x44/0x150
May 2 00:25:11 xez kernel: ? pci_pm_freeze_noirq+0x100/0x100
May 2 00:25:11 xez kernel: rpm_callback+0x59/0x70
May 2 00:25:11 xez kernel: rpm_resume+0x561/0x800
May 2 00:25:11 xez kernel: __pm_runtime_resume+0x4a/0x80
May 2 00:25:11 xez kernel: rpm_get_suppliers+0x3c/0xc0
May 2 00:25:11 xez kernel: ? pci_pm_freeze_noirq+0x100/0x100
May 2 00:25:11 xez kernel: __rpm_callback+0xa2/0x150
May 2 00:25:11 xez kernel: ? pci_pm_freeze_noirq+0x100/0x100
May 2 00:25:11 xez kernel: rpm_callback+0x59/0x70
May 2 00:25:11 xez kernel: rpm_resume+0x561/0x800
May 2 00:25:11 xez kernel: pm_runtime_work+0x6c/0xa0
May 2 00:25:11 xez kernel: process_one_work+0x1e5/0x3b0
May 2 00:25:11 xez kernel: worker_thread+0x50/0x3a0
May 2 00:25:11 xez kernel: ? rescuer_thread+0x3a0/0x3a0
May 2 00:25:11 xez kernel: kthread+0xd8/0x100
May 2 00:25:11 xez kernel: ? kthread_complete_and_exit+0x20/0x20
May 2 00:25:11 xez kernel: ret_from_fork+0x22/0x30
May 2 00:25:11 xez kernel: </TASK>
May 2 00:25:11 xez kernel: CR2: 0000000000000130
May 2 00:25:11 xez kernel: ---[ end trace 0000000000000000 ]---
May 2 00:25:11 xez kernel: RIP: 0010:psp_hw_start+0x300/0x550 [amdgpu]
May 2 00:25:11 xez kernel: Code: 2d 07 00 0b 00 83 e0 fd 0f 84 a9 00 00 00 48 89 df e8 44 b9 ff ff 48 8b b3 38 01 00 00 48 8b 3b 49 89 c4 4c 8b ab 40 01 00 00 <48> 8b ae 30 01 00 00 e8 34 03 fa ff 48 c7 c7 10 01 1f c1 49 89 c0
May 2 00:25:11 xez kernel: RSP: 0018:ffffa75680407bb0 EFLAGS: 00010202
May 2 00:25:11 xez kernel: RAX: ffff910543644800 RBX: ffff91054c6d32f0 RCX: 0000000000000000
May 2 00:25:11 xez kernel: RDX: ffff910543058000 RSI: 0000000000000000 RDI: ffff91054c6c0000
May 2 00:25:11 xez kernel: RBP: 0000000000000000 R08: 0000000000000000 R09: 0000000000000000
May 2 00:25:11 xez kernel: R10: 0000000000000000 R11: 0000000000000002 R12: ffff910543644800
May 2 00:25:11 xez kernel: R13: 0000000000000000 R14: ffff910543644800 R15: 0000000000000000
May 2 00:25:11 xez kernel: FS: 0000000000000000(0000) GS:ffff91085ec80000(0000) knlGS:0000000000000000
May 2 00:25:11 xez kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
May 2 00:25:11 xez kernel: CR2: 0000000000000130 CR3: 000000025f610006 CR4: 00000000001706e0If anybody has a clue what any of this means, I'd appreciate it!
Last edited by Xezlec (2022-05-05 02:05:06)
Offline
many gigabytes of these new "ACPI BIOS Error (bug): Could not resolve symbol" messages, which I've heard are now "normal"
No, and I'm not sure that whatever you're actually seeing there may not be related to your current situation.
However: Try passing "amdgpu.runpm=0" to the kernel (at least that's where the oops is and the various *pm features in amdgpu have a track record of acting up, "modinfo amdgpu | grep pm")
Offline
Oh, ok. In that case, here's the exact error message that keeps repeating:
May 3 18:34:00 xez kernel: ACPI Error: Aborting method \_GPE._L09 due to previous error (AE_NOT_FOUND) (20211217/psparse-529)
May 3 18:34:00 xez kernel: ACPI Error: AE_NOT_FOUND, while evaluating GPE method [_L09] (20211217/evgpe-511)
May 3 18:34:00 xez kernel: ACPI BIOS Error (bug): Could not resolve symbol [\_GPE._L09.D1F0], AE_NOT_FOUND (20211217/psargs-330)amdgpu pm thing sounds like a good idea; will definitely see if disabling that stuff helps. Thanks!
Offline
Offline
Welp. My Google skills need work then. Glad to see it's getting addressed.
Regarding the keyboard/video failure, that hasn't happened since I added amdgpu.runpm=0 (and neither has the oops) so I will mark this as [SOLVED] unless it happens again. Thanks again!
Offline