You are not logged in.

#1 2024-10-16 17:27:08

dimich
Member
From: Kharkiv, Ukraine
Registered: 2009-11-03
Posts: 283

[SOLVED] nvidia and linux >= 6.11.2: kernel errors and warnings

I noticed Nvidia "Flip event timeout on head" ERRORs in dmesg followed by WARNINGs with call trace:

[    8.192423] [drm] Initialized nvidia-drm 0.0.0 for 0000:01:00.0 on minor 0
[    8.192488] Console: switching to colour dummy device 80x25
[    8.192520] nvidia 0000:01:00.0: vgaarb: deactivate vga console
[    8.290799] fbcon: nvidia-drmdrmfb (fb0) is primary device
[    8.501935] Console: switching to colour frame buffer device 240x67
[   11.577139] [drm:nv_drm_atomic_commit [nvidia_drm]] *ERROR* [nvidia-drm] [GPU ID 0x00000100] Flip event timeout on head 0
[   14.777189] [drm:nv_drm_atomic_commit [nvidia_drm]] *ERROR* [nvidia-drm] [GPU ID 0x00000100] Flip event timeout on head 1
[   14.782789] nvidia 0000:01:00.0: [drm] fb0: nvidia-drmdrmfb frame buffer device
[   14.920674] ------------[ cut here ]------------
[   14.920676] WARNING: CPU: 5 PID: 554 at /build/nvidia/src/nvidia/560.35.03/build/nvidia-drm/nvidia-drm-crtc.h:264 __nv_drm_handle_flip_event+0x1b0/0x1c0 [nvidia_drm]
[   14.920684] Modules linked in: nft_masq nft_chain_nat nft_ct nf_log_syslog nft_log nf_tables ip6table_filter ip6_tables vboxnetflt(OE) vboxnetadp(OE) vboxdrv(OE) usbip_host usbip_core pkcs8_key_parser snd_sof_pci_intel_cnl snd_sof_intel_hda_generic soundwire_intel soundwire_cadence joydev mousedev snd_sof_intel_hda_common snd_sof_intel_hda_mlink nvidia_uvm(POE) nvidia_drm(POE) snd_sof_intel_hda nvidia_modeset(POE) snd_sof_pci snd_sof_xtensa_dsp snd_sof snd_sof_utils soundwire_generic_allocation soundwire_bus snd_soc_avs snd_soc_hda_codec snd_soc_skl intel_rapl_msr rtw88_8822bu intel_rapl_common rtw88_usb snd_soc_hdac_hda rtw88_8822b intel_uncore_frequency snd_hda_ext_core intel_uncore_frequency_common rtw88_core snd_soc_sst_ipc snd_soc_sst_dsp snd_soc_acpi_intel_match mac80211 intel_tcc_cooling nvidia(POE) snd_soc_acpi x86_pkg_temp_thermal snd_soc_core snd_hda_codec_realtek intel_powerclamp libarc4 snd_hda_codec_generic snd_compress ac97_bus snd_hda_scodec_component snd_hda_codec_hdmi coretemp snd_pcm_dmaengine
[   14.920710]  cfg80211 snd_hda_intel kvm_intel hid_generic snd_intel_dspcfg snd_intel_sdw_acpi rfkill usbhid snd_hda_codec iTCO_wdt kvm snd_hda_core intel_pmc_bxt iTCO_vendor_support ee1004 snd_hwdep rapl mei_pxp mei_hdcp snd_pcm nf_nat intel_cstate intel_uncore r8169 nf_conntrack snd_timer psmouse gigabyte_wmi realtek drm_ttm_helper nf_defrag_ipv6 spi_nor snd i2c_i801 ttm mdio_devres nf_defrag_ipv4 i2c_smbus wmi_bmof soundcore mtd libcrc32c intel_wmi_thunderbolt libphy i2c_mux tun slhc intel_pmc_core mei_me intel_vsec mei pinctrl_cannonlake ecryptfs intel_pch_thermal pmt_telemetry pmt_class acpi_pad mac_hid cbc encrypted_keys trusted asn1_encoder tee overlay vfat fat nls_iso8859_1 nls_cp866 nls_utf8 cuse cp210x pl2303 usb_storage spidev ppdev parport_pc parport i2c_dev sg crypto_user loop nfnetlink ip_tables x_tables raid1 md_mod crct10dif_pclmul crc32_pclmul polyval_clmulni polyval_generic ghash_clmulni_intel serio_raw atkbd sha512_ssse3 nvme sha256_ssse3 libps2 vivaldi_fmap sha1_ssse3 nvme_core aesni_intel gf128mul
[   14.920750]  nvme_auth crypto_simd spi_intel_pci cryptd spi_intel xhci_pci xhci_pci_renesas i8042 video serio wmi dm_mod ext4 crc32c_generic crc32c_intel crc16 mbcache jbd2
[   14.920757] CPU: 5 UID: 0 PID: 554 Comm: nvidia-modeset/ Tainted: P           OE      6.11.3-arch1-1 #1 1400000003000000474e55000681d53aa6c7b79b
[   14.920760] Tainted: [P]=PROPRIETARY_MODULE, [O]=OOT_MODULE, [E]=UNSIGNED_MODULE
[   14.920761] Hardware name: Gigabyte Technology Co., Ltd. Z390 D/Z390 D, BIOS F3 11/05/2021
[   14.920762] RIP: 0010:__nv_drm_handle_flip_event+0x1b0/0x1c0 [nvidia_drm]
[   14.920766] Code: 4d 8d 7e 10 4c 89 62 18 e8 ad 52 00 00 49 8b 46 10 4c 89 f2 48 8d 70 f0 49 39 df 75 9f e9 d5 fe ff ff 4c 89 e7 e8 80 5c 78 c2 <0f> 0b e9 bd fe ff ff 66 0f 1f 84 00 00 00 00 00 90 90 90 90 90 90
[   14.920767] RSP: 0018:ffff9c2d015f7dc0 EFLAGS: 00010282
[   14.920769] RAX: ffff8f5aa6b20008 RBX: ffff8f5aa6b20008 RCX: ffff8f5a937622e0
[   14.920770] RDX: 0000000000000001 RSI: 0000000000000001 RDI: ffff8f5aa6b20018
[   14.920771] RBP: ffff8f5aa6b20000 R08: 0000000000000000 R09: 0000000000000014
[   14.920771] R10: 0000000000000001 R11: 0000000000000000 R12: ffff8f5aa6b20018
[   14.920772] R13: ffff8f5a84628808 R14: ffff8f5a93762000 R15: ffffffffc5c23010
[   14.920773] FS:  0000000000000000(0000) GS:ffff8f79bdc80000(0000) knlGS:0000000000000000
[   14.920774] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[   14.920775] CR2: 00005aee59766a90 CR3: 0000000a4ca22002 CR4: 00000000003706f0
[   14.920776] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[   14.920776] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
[   14.920777] Call Trace:
[   14.920779]  <TASK>
[   14.920779]  ? __nv_drm_handle_flip_event+0x1b0/0x1c0 [nvidia_drm 1400000003000000474e55007e5fca96e6499d48]
[   14.920783]  ? __warn.cold+0x8e/0xe8
[   14.920785]  ? __nv_drm_handle_flip_event+0x1b0/0x1c0 [nvidia_drm 1400000003000000474e55007e5fca96e6499d48]
[   14.920789]  ? report_bug+0xff/0x140
[   14.920792]  ? handle_bug+0x3c/0x80
[   14.920793]  ? exc_invalid_op+0x17/0x70
[   14.920794]  ? asm_exc_invalid_op+0x1a/0x20
[   14.920796]  ? __pfx__main_loop+0x10/0x10 [nvidia_modeset 1400000003000000474e5500a2f7fe5916f20077]
[   14.920816]  ? __nv_drm_handle_flip_event+0x1b0/0x1c0 [nvidia_drm 1400000003000000474e55007e5fca96e6499d48]
[   14.920820]  ? __pfx__main_loop+0x10/0x10 [nvidia_modeset 1400000003000000474e5500a2f7fe5916f20077]
[   14.920835]  nv_drm_event_callback+0x82/0x90 [nvidia_drm 1400000003000000474e55007e5fca96e6499d48]
[   14.920839]  nvKmsKapiHandleEventQueueChange+0xa0/0xd0 [nvidia_modeset 1400000003000000474e5500a2f7fe5916f20077]
[   14.920865]  _main_loop+0x95/0x150 [nvidia_modeset 1400000003000000474e5500a2f7fe5916f20077]
[   14.920882]  kthread+0xcf/0x100
[   14.920884]  ? __pfx_kthread+0x10/0x10
[   14.920886]  ret_from_fork+0x31/0x50
[   14.920888]  ? __pfx_kthread+0x10/0x10
[   14.920889]  ret_from_fork_asm+0x1a/0x30
[   14.920892]  </TASK>
[   14.920893] ---[ end trace 0000000000000000 ]---
[   14.921427] ------------[ cut here ]------------
[   14.921428] WARNING: CPU: 5 PID: 554 at /build/nvidia/src/nvidia/560.35.03/build/nvidia-drm/nvidia-drm-crtc.h:264 __nv_drm_handle_flip_event+0x1b0/0x1c0 [nvidia_drm]
[   14.921434] Modules linked in: nft_masq nft_chain_nat nft_ct nf_log_syslog nft_log nf_tables ip6table_filter ip6_tables vboxnetflt(OE) vboxnetadp(OE) vboxdrv(OE) usbip_host usbip_core pkcs8_key_parser snd_sof_pci_intel_cnl snd_sof_intel_hda_generic soundwire_intel soundwire_cadence joydev mousedev snd_sof_intel_hda_common snd_sof_intel_hda_mlink nvidia_uvm(POE) nvidia_drm(POE) snd_sof_intel_hda nvidia_modeset(POE) snd_sof_pci snd_sof_xtensa_dsp snd_sof snd_sof_utils soundwire_generic_allocation soundwire_bus snd_soc_avs snd_soc_hda_codec snd_soc_skl intel_rapl_msr rtw88_8822bu intel_rapl_common rtw88_usb snd_soc_hdac_hda rtw88_8822b intel_uncore_frequency snd_hda_ext_core intel_uncore_frequency_common rtw88_core snd_soc_sst_ipc snd_soc_sst_dsp snd_soc_acpi_intel_match mac80211 intel_tcc_cooling nvidia(POE) snd_soc_acpi x86_pkg_temp_thermal snd_soc_core snd_hda_codec_realtek intel_powerclamp libarc4 snd_hda_codec_generic snd_compress ac97_bus snd_hda_scodec_component snd_hda_codec_hdmi coretemp snd_pcm_dmaengine
[   14.921455]  cfg80211 snd_hda_intel kvm_intel hid_generic snd_intel_dspcfg snd_intel_sdw_acpi rfkill usbhid snd_hda_codec iTCO_wdt kvm snd_hda_core intel_pmc_bxt iTCO_vendor_support ee1004 snd_hwdep rapl mei_pxp mei_hdcp snd_pcm nf_nat intel_cstate intel_uncore r8169 nf_conntrack snd_timer psmouse gigabyte_wmi realtek drm_ttm_helper nf_defrag_ipv6 spi_nor snd i2c_i801 ttm mdio_devres nf_defrag_ipv4 i2c_smbus wmi_bmof soundcore mtd libcrc32c intel_wmi_thunderbolt libphy i2c_mux tun slhc intel_pmc_core mei_me intel_vsec mei pinctrl_cannonlake ecryptfs intel_pch_thermal pmt_telemetry pmt_class acpi_pad mac_hid cbc encrypted_keys trusted asn1_encoder tee overlay vfat fat nls_iso8859_1 nls_cp866 nls_utf8 cuse cp210x pl2303 usb_storage spidev ppdev parport_pc parport i2c_dev sg crypto_user loop nfnetlink ip_tables x_tables raid1 md_mod crct10dif_pclmul crc32_pclmul polyval_clmulni polyval_generic ghash_clmulni_intel serio_raw atkbd sha512_ssse3 nvme sha256_ssse3 libps2 vivaldi_fmap sha1_ssse3 nvme_core aesni_intel gf128mul
[   14.921486]  nvme_auth crypto_simd spi_intel_pci cryptd spi_intel xhci_pci xhci_pci_renesas i8042 video serio wmi dm_mod ext4 crc32c_generic crc32c_intel crc16 mbcache jbd2
[   14.921493] CPU: 5 UID: 0 PID: 554 Comm: nvidia-modeset/ Tainted: P        W  OE      6.11.3-arch1-1 #1 1400000003000000474e55000681d53aa6c7b79b
[   14.921496] Tainted: [P]=PROPRIETARY_MODULE, [W]=WARN, [O]=OOT_MODULE, [E]=UNSIGNED_MODULE
[   14.921496] Hardware name: Gigabyte Technology Co., Ltd. Z390 D/Z390 D, BIOS F3 11/05/2021
[   14.921497] RIP: 0010:__nv_drm_handle_flip_event+0x1b0/0x1c0 [nvidia_drm]
[   14.921501] Code: 4d 8d 7e 10 4c 89 62 18 e8 ad 52 00 00 49 8b 46 10 4c 89 f2 48 8d 70 f0 49 39 df 75 9f e9 d5 fe ff ff 4c 89 e7 e8 80 5c 78 c2 <0f> 0b e9 bd fe ff ff 66 0f 1f 84 00 00 00 00 00 90 90 90 90 90 90
[   14.921502] RSP: 0018:ffff9c2d015f7dc0 EFLAGS: 00010282
[   14.921503] RAX: ffff8f5aa6b1a008 RBX: ffff8f5aa6b1a008 RCX: ffff8f5a937622e0
[   14.921504] RDX: 0000000000000001 RSI: 0000000000000000 RDI: ffff8f5aa6b1a018
[   14.921505] RBP: ffff8f5aa6b1a000 R08: 0000000000000000 R09: 0000000000000014
[   14.921506] R10: 0000000000000001 R11: 0000000000000000 R12: ffff8f5aa6b1a018
[   14.921507] R13: ffff8f5a84628808 R14: ffff8f5a93762000 R15: ffffffffc5c23010
[   14.921507] FS:  0000000000000000(0000) GS:ffff8f79bdc80000(0000) knlGS:0000000000000000
[   14.921508] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[   14.921509] CR2: 00005aee59766a90 CR3: 0000000a4ca22002 CR4: 00000000003706f0
[   14.921510] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[   14.921511] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
[   14.921512] Call Trace:
[   14.921512]  <TASK>
[   14.921513]  ? __nv_drm_handle_flip_event+0x1b0/0x1c0 [nvidia_drm 1400000003000000474e55007e5fca96e6499d48]
[   14.921517]  ? __warn.cold+0x8e/0xe8
[   14.921518]  ? __nv_drm_handle_flip_event+0x1b0/0x1c0 [nvidia_drm 1400000003000000474e55007e5fca96e6499d48]
[   14.921522]  ? report_bug+0xff/0x140
[   14.921524]  ? handle_bug+0x3c/0x80
[   14.921525]  ? exc_invalid_op+0x17/0x70
[   14.921526]  ? asm_exc_invalid_op+0x1a/0x20
[   14.921528]  ? __pfx__main_loop+0x10/0x10 [nvidia_modeset 1400000003000000474e5500a2f7fe5916f20077]
[   14.921545]  ? __nv_drm_handle_flip_event+0x1b0/0x1c0 [nvidia_drm 1400000003000000474e55007e5fca96e6499d48]
[   14.921549]  ? __pfx__main_loop+0x10/0x10 [nvidia_modeset 1400000003000000474e5500a2f7fe5916f20077]
[   14.921564]  nv_drm_event_callback+0x82/0x90 [nvidia_drm 1400000003000000474e55007e5fca96e6499d48]
[   14.921568]  nvKmsKapiHandleEventQueueChange+0xa0/0xd0 [nvidia_modeset 1400000003000000474e5500a2f7fe5916f20077]
[   14.921594]  _main_loop+0x95/0x150 [nvidia_modeset 1400000003000000474e5500a2f7fe5916f20077]
[   14.921610]  kthread+0xcf/0x100
[   14.921613]  ? __pfx_kthread+0x10/0x10
[   14.921614]  ret_from_fork+0x31/0x50
[   14.921616]  ? __pfx_kthread+0x10/0x10
[   14.921617]  ret_from_fork_asm+0x1a/0x30
[   14.921620]  </TASK>
[   14.921620] ---[ end trace 0000000000000000 ]---

(The only difference between two warnings besides registers content is '[W]=WARN' flag.)

Full dmesg output is here.

Video card: 01:00.0 VGA compatible controller: NVIDIA Corporation TU116 [GeForce GTX 1660 SUPER] (rev a1)

According to journal, it started to occur on Oct, 6, when linux upgraded to 6.11.2-arch1-1. However, the very first appearance is reported in the latest boot of kernel Linux version 6.11.1-arch1-1, and it hasn't occured in any previous 6.11.1-arch1-1 boots. Maybe nvidia package has already been upgraded but linux package hasn't. I can't figure out what exactly happened at that transition.

Kernel command line parameters related to nvidia modules are:

nvidia_drm.modeset=1
video=card0-HDMI-A-1:d

The parameters haven't changed recently.

Versions of currently installed packages:
linux 6.11.3.arch1-1
nvidia 560.35.03-14

The errors and warning also occur with nvidia-dkms 560.35.03-16

The errors and warnings don't occur with linux-lts 6.6.56-1 and corresponding nvidia-lts 1:560.35.03-15.

I've found possibly related topic on Nvidia forums, but it is almost 1 year old, for version 545 and without call trace in dmesg.

The issue doesn't seem to affect further system operation but makes me worry.

Last edited by dimich (2024-10-17 17:01:11)

Offline

#2 2024-10-16 21:28:18

seth
Member
Registered: 2012-09-03
Posts: 60,073

Re: [SOLVED] nvidia and linux >= 6.11.2: kernel errors and warnings

Just a hunch but try "nvidia_drm.fbdev=0", the default has recently changed.

Offline

#3 2024-10-17 17:00:41

dimich
Member
From: Kharkiv, Ukraine
Registered: 2009-11-03
Posts: 283

Re: [SOLVED] nvidia and linux >= 6.11.2: kernel errors and warnings

seth wrote:

Just a hunch but try "nvidia_drm.fbdev=0", the default has recently changed.

Indeed, this eliminates the errors and warnings. Thank you!

However, on my system efifb doesn't detect monitor resolution for some reason, and it resulted in a terrible virtual console resolution 1024x768:

efifb: framebuffer at 0x4000000000, using 3072k, total 3072k
efifb: mode is 1024x768x32, linelength=4096, pages=1

I fixed it by setting explicit resolution in one more kernel parameter:

video=efifb:1920x1080

So for now no errors in dmesg, virtual console resolution is fine:

efifb: framebuffer at 0x4000000000, using 8640k, total 8640k
efifb: mode is 1920x1080x32, linelength=8192, pages=1

and even scroll redraw works smoothly.

Last edited by dimich (2024-10-17 17:03:00)

Offline

Board footer

Powered by FluxBB