You are not logged in.
Hello,
for some time now my GPU keeps failing while playing any game on steam. Most of the time it lets me play a little before a fail. The sound is still there, but I get no view in the monitor. I have tried to wait for nvidia and kernel updates, but those didnt solve anything. I have tried to downgrade linux kernel, linux-headers, nvidia, nvidia-dkms to versions prior the problem. This also did not help. I have tried to disable plasma-powerdevil, but this didnt help also. I am running NVIDIA 1660 super, with KDE, Wayland. When I look at the journalctl I see this in the logs:
spal. 17 23:16:26 Pysius kernel: [drm:__nv_drm_semsurf_wait_fence_work_cb [nvidia_drm]] *ERROR* [nvidia-drm] [GPU ID 0x00000a00] Failed to register auto-value-update on pre-wait value for sync FD semaphore surface
spal. 17 23:16:26 Pysius kernel: nvidia-gpu 0000:0a:00.3: Unable to change power state from D3hot to D0, device inaccessible
spal. 17 23:16:26 Pysius kernel: xhci_hcd 0000:0a:00.2: Unable to change power state from D3hot to D0, device inaccessible
spal. 17 23:16:26 Pysius kernel: xhci_hcd 0000:0a:00.2: Unable to change power state from D3cold to D0, device inaccessible
spal. 17 23:16:26 Pysius kernel: xhci_hcd 0000:0a:00.2: Controller not ready at resume -19
spal. 17 23:16:26 Pysius kernel: xhci_hcd 0000:0a:00.2: PCI post-resume error -19!
spal. 17 23:16:26 Pysius kernel: xhci_hcd 0000:0a:00.2: HC died; cleaning up
spal. 17 23:16:31 Pysius kwin_wayland[965]: kwin_wayland_drm: Failed to create framebuffer: Invalid argument
spal. 17 23:16:31 Pysius kernel: [drm] [nvidia-drm] [GPU ID 0x00000a00] Framebuffer memory not appropriate for scanout
spal. 17 23:16:31 Pysius kernel: [drm] [nvidia-drm] [GPU ID 0x00000a00] Framebuffer memory not appropriate for scanout
spal. 17 23:16:36 Pysius kwin_wayland[965]: kwin_wayland_drm: Pageflip timed out! This is a kernel bug
spal. 17 23:16:41 Pysius kwin_wayland[965]: kwin_wayland_drm: Pageflip timed out! This is a kernel bug
spal. 17 23:16:46 Pysius kwin_wayland[965]: kwin_wayland_drm: Pageflip timed out! This is a kernel bug
spal. 17 23:16:49 Pysius kernel: [drm:nv_drm_master_drop [nvidia_drm]] *ERROR* [nvidia-drm] [GPU ID 0x00000a00] nv_drm_atomic_helper_disable_all failed with error code -22 !
spal. 17 23:16:51 Pysius kwin_wayland[965]: kwin_wayland_drm: Pageflip timed out! This is a kernel bug
spal. 17 23:16:52 Pysius kernel: [drm:nv_drm_atomic_commit [nvidia_drm]] *ERROR* [nvidia-drm] [GPU ID 0x00000a00] Flip event timeout on head 0
spal. 17 23:16:55 Pysius kernel: [drm:nv_drm_atomic_commit [nvidia_drm]] *ERROR* [nvidia-drm] [GPU ID 0x00000a00] Flip event timeout on head 0
spal. 17 23:16:55 Pysius kernel: nvidia-modeset: ERROR: GPU:0: Failed to query display engine channel state: 0x0000c57d:0:0:0x0000000f
spal. 17 23:16:55 Pysius kernel: nvidia-modeset: ERROR: GPU:0: Failed to query display engine channel state: 0x0000c57e:0:0:0x0000000f
spal. 17 23:16:55 Pysius kernel: nvidia-modeset: ERROR: GPU:0: Failed to query display engine channel state: 0x0000c57e:1:0:0x0000000f
spal. 17 23:16:55 Pysius kernel: nvidia-modeset: ERROR: GPU:0: Failed to query display engine channel state: 0x0000c57e:2:0:0x0000000f
spal. 17 23:16:55 Pysius kernel: nvidia-modeset: ERROR: GPU:0: Failed to query display engine channel state: 0x0000c57e:3:0:0x0000000f
spal. 17 23:16:55 Pysius kernel: nvidia-modeset: ERROR: GPU:0: Failed to query display engine channel state: 0x0000c57e:4:0:0x0000000f
spal. 17 23:16:55 Pysius kernel: nvidia-modeset: ERROR: GPU:0: Failed to query display engine channel state: 0x0000c57e:5:0:0x0000000f
spal. 17 23:16:55 Pysius kernel: nvidia-modeset: ERROR: GPU:0: Failed to query display engine channel state: 0x0000c57e:6:0:0x0000000f
spal. 17 23:16:55 Pysius kernel: nvidia-modeset: ERROR: GPU:0: Failed to query display engine channel state: 0x0000c57e:7:0:0x0000000f
spal. 17 23:16:56 Pysius kwin_wayland[965]: kwin_wayland_drm: Pageflip timed out! This is a kernel bug
spal. 17 23:16:58 Pysius kernel: [drm:nv_drm_atomic_commit [nvidia_drm]] *ERROR* [nvidia-drm] [GPU ID 0x00000a00] Flip event timeout on head 0
spal. 17 23:17:00 Pysius kernel: usb 8-2.3: USB disconnect, device number 3
spal. 17 23:17:01 Pysius kernel: [drm:__nv_drm_semsurf_wait_fence_work_cb [nvidia_drm]] *ERROR* [nvidia-drm] [GPU ID 0x00000a00] Failed to register auto-value-update on pre-wait value for sync FD semaphore surface
spal. 17 23:17:00 Pysius plasmashell[1092]: qt.qpa.wayland: eglSwapBuffers failed with 0x300d, surface: 0x5e68e7dc87a0
spal. 17 23:17:01 Pysius plasmashell[1092]: qt.qpa.wayland: eglSwapBuffers failed with 0x300d, surface: 0x5e68e7dc87a0
spal. 17 23:17:01 Pysius kwin_wayland[965]: kwin_wayland_drm: Pageflip timed out! This is a kernel bug
spal. 17 23:17:01 Pysius kernel: [drm:nv_drm_atomic_commit [nvidia_drm]] *ERROR* [nvidia-drm] [GPU ID 0x00000a00] Flip event timeout on head 0
Any suggestions how to proceed further with the investigation? I thought maybe installing X11 and seeing what happens, or even to install Windows to confirm it is not a hardware problem, but I thought that I will post here first.
Best regards
Offline
I have upgraded to the latest versions again and set nvidia-drm.fbdev=0 as seen in another similar post in here. What changed is that the fonts look messed up during the boot. The games still crash, but now I see another error:
spal. 18 17:47:04 Pysius kernel: NVRM: GPU at PCI:0000:0a:00: GPU-3e12cf44-828b-b5aa-593a-e4c906b2e37b
spal. 18 17:47:04 Pysius kernel: NVRM: Xid (PCI:0000:0a:00): 79, pid='<unknown>', name=<unknown>, GPU has fallen off the bus.
spal. 18 17:47:04 Pysius kernel: NVRM: GPU 0000:0a:00.0: GPU has fallen off the bus.
spal. 18 17:47:05 Pysius kernel: nvidia-gpu 0000:0a:00.3: Unable to change power state from D3hot to D0, device inaccessible
spal. 18 17:47:05 Pysius kernel: xhci_hcd 0000:0a:00.2: Unable to change power state from D3hot to D0, device inaccessible
spal. 18 17:47:05 Pysius kernel: xhci_hcd 0000:0a:00.2: Unable to change power state from D3cold to D0, device inaccessible
spal. 18 17:47:05 Pysius kernel: xhci_hcd 0000:0a:00.2: Controller not ready at resume -19
spal. 18 17:47:05 Pysius kernel: xhci_hcd 0000:0a:00.2: PCI post-resume error -19!
spal. 18 17:47:05 Pysius kernel: xhci_hcd 0000:0a:00.2: HC died; cleaning up
spal. 18 17:47:06 Pysius systemd[875]: Starting Virtual filesystem service - disk device monitor...
spal. 18 17:47:06 Pysius systemd[875]: Started Virtual filesystem service - disk device monitor.
spal. 18 17:47:06 Pysius systemd[875]: Created slice Slice /app/dbus-:1.2-org.gnome.DejaDup.
spal. 18 17:47:06 Pysius systemd[875]: Started dbus-:1.2-org.gnome.DejaDup@0.service.
spal. 18 17:47:06 Pysius deja-dup[2636]: Unknown key gtk-modules in /home/spurgis/.config/gtk-4.0/settings.ini
spal. 18 17:47:07 Pysius xdg-desktop-portal-kde[1103]: xdp-kde-settings: Namespace "org.gnome.desktop.a11y.interface" is not supported
spal. 18 17:47:07 Pysius deja-dup[2636]: Using GtkSettings:gtk-application-prefer-dark-theme with libadwaita is unsupported. Please use AdwStyleManager:color-scheme instead.
spal. 18 17:47:09 Pysius kwin_wayland[918]: kwin_wayland_drm: Failed to create framebuffer: Invalid argument
spal. 18 17:47:09 Pysius kernel: [drm] [nvidia-drm] [GPU ID 0x00000a00] Framebuffer memory not appropriate for scanout
spal. 18 17:47:09 Pysius kernel: [drm] [nvidia-drm] [GPU ID 0x00000a00] Framebuffer memory not appropriate for scanout
spal. 18 17:47:14 Pysius kwin_wayland[918]: kwin_wayland_drm: Pageflip timed out! This is a kernel bug
spal. 18 17:47:19 Pysius steam[1683]: [2024-10-18 17:47:19] Background update loop checking for update. . .
spal. 18 17:47:19 Pysius steam[1683]: [2024-10-18 17:47:19] Checking for available updates...
spal. 18 17:47:19 Pysius steam[1683]: [2024-10-18 17:47:19] Downloading manifest: https://client-update.akamai.steamstati … t_ubuntu12
spal. 18 17:47:19 Pysius steam[1683]: [2024-10-18 17:47:19] Manifest download: send request
spal. 18 17:47:19 Pysius kwin_wayland[918]: kwin_wayland_drm: Pageflip timed out! This is a kernel bug
spal. 18 17:47:19 Pysius steam[1683]: [2024-10-18 17:47:19] Manifest download: waiting for download to finish
spal. 18 17:47:20 Pysius steam[1683]: [2024-10-18 17:47:20] Manifest download: finished
spal. 18 17:47:20 Pysius steam[1683]: [2024-10-18 17:47:20] Download skipped by HTTP 304 Not Modified
spal. 18 17:47:20 Pysius steam[1683]: [2024-10-18 17:47:20] Nothing to do
spal. 18 17:47:24 Pysius kwin_wayland[918]: kwin_wayland_drm: Pageflip timed out! This is a kernel bug
spal. 18 17:47:29 Pysius kwin_wayland[918]: kwin_wayland_drm: Pageflip timed out! This is a kernel bug
Offline