You are not logged in.

#1 2024-06-14 00:56:41

jslay
Member
Registered: 2024-05-20
Posts: 8

[Unsolvable] NVIDIA DKMS fails to compile every time I upgrade Kernel

I am using the beta drivers 550 so that I can try and run Wayland for some development stuff I am doing.

However, everytime the kernel is updated, I have to go through a cycle of trying to get the DKMS module to rebuild and work.

I have no idea what is going wrong. Initially thought it was GCC versions, but ive tried literally every version for the last 3 months. It seems like absolute luck when I do happen to get it to build successfully.

What am I doing wrong here? I have updated linux and linux-headers pkgs. I have up-to-date GCC 14.1.1, what gives? What is the correct process to do this? It used to build and work just fine when kernel would upgrade until GCC 14 was released.

$ cat /var/lib/dkms/nvidia/555.52.04/build/make.log
DKMS make.log for nvidia-555.52.04 for kernel 6.9.4-arch1-1 (x86_64)
Thu Jun 13 06:52:54 PM MDT 2024
make[1]: Entering directory '/usr/lib/modules/6.9.4-arch1-1/build'
  SYMLINK /var/lib/dkms/nvidia/555.52.04/build/nvidia/nv-kernel.o
  SYMLINK /var/lib/dkms/nvidia/555.52.04/build/nvidia-modeset/nv-modeset-kernel.o
 CONFTEST: hash__remap_4k_pfn
 CONFTEST: set_pages_uc
 CONFTEST: list_is_first
 CONFTEST: set_memory_uc
 CONFTEST: set_memory_array_uc
 CONFTEST: set_pages_array_uc
 CONFTEST: ioremap_cache
 CONFTEST: ioremap_wc
 CONFTEST: ioremap_driver_hardened
 CONFTEST: ioremap_driver_hardened_wc
 CONFTEST: ioremap_cache_shared
 CONFTEST: pci_get_domain_bus_and_slot
 CONFTEST: get_num_physpages
 CONFTEST: pde_data
 CONFTEST: xen_ioemu_inject_msi
 CONFTEST: phys_to_dma
 CONFTEST: get_dma_ops
 CONFTEST: dma_attr_macros
 CONFTEST: dma_map_page_attrs
 CONFTEST: write_cr4
 CONFTEST: of_find_node_by_phandle
 CONFTEST: of_node_to_nid
 CONFTEST: pnv_pci_get_npu_dev
 CONFTEST: of_get_ibm_chip_id
 CONFTEST: pci_bus_address
 CONFTEST: pci_stop_and_remove_bus_device
 CONFTEST: pci_rebar_get_possible_sizes
 CONFTEST: wait_for_random_bytes
 CONFTEST: register_cpu_notifier
 CONFTEST: cpuhp_setup_state
 CONFTEST: dma_map_resource
 CONFTEST: get_backlight_device_by_name
 CONFTEST: timer_setup
 CONFTEST: pci_enable_msix_range
 CONFTEST: kernel_read_has_pointer_pos_arg
 CONFTEST: kernel_write_has_pointer_pos_arg
 CONFTEST: dma_direct_map_resource
 CONFTEST: tegra_get_platform
 CONFTEST: tegra_bpmp_send_receive
 CONFTEST: flush_cache_all
 CONFTEST: vmf_insert_pfn
 CONFTEST: jiffies_to_timespec
 CONFTEST: ktime_get_raw_ts64
 CONFTEST: ktime_get_real_ts64
 CONFTEST: full_name_hash
 CONFTEST: pci_enable_atomic_ops_to_root
 CONFTEST: vga_tryget
 CONFTEST: cc_platform_has
 CONFTEST: seq_read_iter
 CONFTEST: follow_pfn
 CONFTEST: drm_gem_object_get
 CONFTEST: drm_gem_object_put_unlocked
 CONFTEST: add_memory_driver_managed
 CONFTEST: device_property_read_u64
 CONFTEST: devm_of_platform_populate
 CONFTEST: of_dma_configure
 CONFTEST: of_property_count_elems_of_size
 CONFTEST: of_property_read_variable_u8_array
 CONFTEST: of_property_read_variable_u32_array
 CONFTEST: i2c_new_client_device
 CONFTEST: i2c_unregister_device
 CONFTEST: of_get_named_gpio
 CONFTEST: devm_gpio_request_one
 CONFTEST: gpio_direction_input
 CONFTEST: gpio_direction_output
 CONFTEST: gpio_get_value
 CONFTEST: gpio_set_value
 CONFTEST: gpio_to_irq
 CONFTEST: icc_get
 CONFTEST: icc_put
 CONFTEST: icc_set_bw
 CONFTEST: dma_buf_export_args
 CONFTEST: dma_buf_ops_has_kmap
 CONFTEST: dma_buf_ops_has_kmap_atomic
 CONFTEST: dma_buf_ops_has_map
 CONFTEST: dma_buf_ops_has_map_atomic
 CONFTEST: dma_buf_has_dynamic_attachment
 CONFTEST: dma_buf_attachment_has_peer2peer
 CONFTEST: dma_set_mask_and_coherent
 CONFTEST: devm_clk_bulk_get_all
 CONFTEST: get_task_ioprio
 CONFTEST: mdev_set_iommu_device
 CONFTEST: offline_and_remove_memory
 CONFTEST: stack_trace
 CONFTEST: crypto_tfm_ctx_aligned
 CONFTEST: wait_on_bit_lock_argument_count
 CONFTEST: radix_tree_empty
 CONFTEST: radix_tree_replace_slot
 CONFTEST: pnv_npu2_init_context
 CONFTEST: cpumask_of_node
 CONFTEST: ioasid_get
 CONFTEST: mm_pasid_drop
 CONFTEST: mmget_not_zero
 CONFTEST: mmgrab
 CONFTEST: iommu_sva_bind_device_has_drvdata_arg
 CONFTEST: vm_fault_to_errno
 CONFTEST: find_next_bit_wrap
 CONFTEST: iommu_is_dma_domain
 CONFTEST: acpi_video_backlight_use_native
 CONFTEST: drm_dev_unref
 CONFTEST: drm_reinit_primary_mode_group
 CONFTEST: get_user_pages_remote
 CONFTEST: get_user_pages
 CONFTEST: pin_user_pages_remote
 CONFTEST: pin_user_pages
 CONFTEST: drm_gem_object_lookup
 CONFTEST: drm_atomic_state_ref_counting
 CONFTEST: drm_driver_has_gem_prime_res_obj
 CONFTEST: drm_atomic_helper_connector_dpms
 CONFTEST: drm_connector_funcs_have_mode_in_name
 CONFTEST: drm_connector_has_vrr_capable_property
 CONFTEST: drm_framebuffer_get
 CONFTEST: drm_dev_put
 CONFTEST: drm_format_num_planes
 CONFTEST: drm_connector_for_each_possible_encoder
 CONFTEST: drm_rotation_available
 CONFTEST: drm_vma_offset_exact_lookup_locked
 CONFTEST: nvhost_dma_fence_unpack
 CONFTEST: dma_fence_set_error
 CONFTEST: fence_set_error
 CONFTEST: sync_file_get_fence
 CONFTEST: drm_aperture_remove_conflicting_pci_framebuffers
 CONFTEST: drm_fbdev_generic_setup
 CONFTEST: drm_connector_attach_hdr_output_metadata_property
 CONFTEST: drm_helper_crtc_enable_color_mgmt
 CONFTEST: drm_crtc_enable_color_mgmt
 CONFTEST: drm_atomic_helper_legacy_gamma_set
 CONFTEST: is_export_symbol_gpl_of_node_to_nid
 CONFTEST: is_export_symbol_gpl_sme_active
 CONFTEST: is_export_symbol_present_swiotlb_map_sg_attrs
 CONFTEST: is_export_symbol_present_swiotlb_dma_ops
 CONFTEST: is_export_symbol_present___close_fd
 CONFTEST: is_export_symbol_present_close_fd
 CONFTEST: is_export_symbol_present_get_unused_fd
 CONFTEST: is_export_symbol_present_get_unused_fd_flags
 CONFTEST: is_export_symbol_present_nvhost_get_default_device
 CONFTEST: is_export_symbol_present_nvhost_syncpt_unit_interface_get_byte_offset
 CONFTEST: is_export_symbol_present_nvhost_syncpt_unit_interface_get_aperture
 CONFTEST: is_export_symbol_present_tegra_dce_register_ipc_client
 CONFTEST: is_export_symbol_present_tegra_dce_unregister_ipc_client
 CONFTEST: is_export_symbol_present_tegra_dce_client_ipc_send_recv
 CONFTEST: is_export_symbol_present_dram_clk_to_mc_clk
 CONFTEST: is_export_symbol_present_get_dram_num_channels
 CONFTEST: is_export_symbol_present_tegra_dram_types
 CONFTEST: is_export_symbol_present_pxm_to_node
 CONFTEST: is_export_symbol_present_screen_info
 CONFTEST: is_export_symbol_gpl_screen_info
 CONFTEST: is_export_symbol_present_i2c_bus_status
 CONFTEST: is_export_symbol_present_tegra_fuse_control_read
 CONFTEST: is_export_symbol_present_tegra_get_platform
 CONFTEST: is_export_symbol_present_pci_find_host_bridge
 CONFTEST: is_export_symbol_present_tsec_comms_send_cmd
 CONFTEST: is_export_symbol_present_tsec_comms_set_init_cb
 CONFTEST: is_export_symbol_present_tsec_comms_clear_init_cb
 CONFTEST: is_export_symbol_present_tsec_comms_alloc_mem_from_gscco
 CONFTEST: is_export_symbol_present_tsec_comms_free_gscco_mem
 CONFTEST: is_export_symbol_present_memory_block_size_bytes
 CONFTEST: is_export_symbol_present_tegra_platform_is_fpga
 CONFTEST: is_export_symbol_present_tegra_platform_is_sim
 CONFTEST: crypto
 CONFTEST: is_export_symbol_present_follow_pte
 CONFTEST: is_export_symbol_present_int_active_memcg
 CONFTEST: is_export_symbol_present_migrate_vma_setup
 CONFTEST: dma_ops
 CONFTEST: swiotlb_dma_ops
 CONFTEST: noncoherent_swiotlb_dma_ops
 CONFTEST: vm_fault_has_address
 CONFTEST: vm_insert_pfn_prot
 CONFTEST: vmf_insert_pfn_prot
 CONFTEST: vm_ops_fault_removed_vma_arg
 CONFTEST: kmem_cache_has_kobj_remove_work
 CONFTEST: sysfs_slab_unlink
 CONFTEST: proc_ops
 CONFTEST: timespec64
 CONFTEST: vmalloc_has_pgprot_t_arg
 CONFTEST: mm_has_mmap_lock
 CONFTEST: pci_channel_state
 CONFTEST: pci_dev_has_ats_enabled
 CONFTEST: remove_memory_has_nid_arg
 CONFTEST: add_memory_driver_managed_has_mhp_flags_arg
 CONFTEST: num_registered_fb
 CONFTEST: pci_driver_has_driver_managed_dma
 CONFTEST: vm_area_struct_has_const_vm_flags
 CONFTEST: memory_failure_has_trapno_arg
 CONFTEST: foll_longterm_present
 CONFTEST: bus_type_has_iommu_ops
 CONFTEST: backing_dev_info
 CONFTEST: mm_context_t
 CONFTEST: vm_fault_t
 CONFTEST: mmu_notifier_ops_invalidate_range
 CONFTEST: mmu_notifier_ops_arch_invalidate_secondary_tlbs
 CONFTEST: migrate_vma_added_flags
 CONFTEST: migrate_device_range
 CONFTEST: handle_mm_fault_has_mm_arg
 CONFTEST: handle_mm_fault_has_pt_regs_arg
 CONFTEST: mempolicy_has_unified_nodes
 CONFTEST: mempolicy_has_home_node
 CONFTEST: mpol_preferred_many_present
 CONFTEST: mmu_interval_notifier
 CONFTEST: drm_bus_present
 CONFTEST: drm_bus_has_bus_type
 CONFTEST: drm_bus_has_get_irq
 CONFTEST: drm_bus_has_get_name
 CONFTEST: drm_driver_has_device_list
 CONFTEST: drm_driver_has_legacy_dev_list
 CONFTEST: drm_driver_has_set_busid
 CONFTEST: drm_crtc_state_has_connectors_changed
 CONFTEST: drm_init_function_args
 CONFTEST: drm_helper_mode_fill_fb_struct
 CONFTEST: drm_master_drop_has_from_release_arg
 CONFTEST: drm_driver_unload_has_int_return_type
 CONFTEST: drm_atomic_helper_crtc_destroy_state_has_crtc_arg
 CONFTEST: drm_atomic_helper_plane_destroy_state_has_plane_arg
 CONFTEST: drm_mode_object_find_has_file_priv_arg
 CONFTEST: dma_buf_owner
 CONFTEST: drm_connector_list_iter
 CONFTEST: drm_atomic_helper_swap_state_has_stall_arg
 CONFTEST: drm_driver_prime_flag_present
 CONFTEST: drm_gem_object_has_resv
 CONFTEST: drm_crtc_state_has_async_flip
 CONFTEST: drm_crtc_state_has_pageflip_flags
 CONFTEST: drm_crtc_state_has_vrr_enabled
 CONFTEST: drm_format_modifiers_present
 CONFTEST: drm_vma_node_is_allowed_has_tag_arg
 CONFTEST: drm_vma_offset_node_has_readonly
 CONFTEST: drm_display_mode_has_vrefresh
 CONFTEST: drm_driver_master_set_has_int_return_type
 CONFTEST: drm_driver_has_gem_free_object
 CONFTEST: drm_prime_pages_to_sg_has_drm_device_arg
 CONFTEST: drm_driver_has_gem_prime_callbacks
 CONFTEST: drm_crtc_atomic_check_has_atomic_state_arg
 CONFTEST: drm_gem_object_vmap_has_map_arg
 CONFTEST: drm_plane_atomic_check_has_atomic_state_arg
 CONFTEST: drm_device_has_pdev
 CONFTEST: drm_crtc_state_has_no_vblank
 CONFTEST: drm_mode_config_has_allow_fb_modifiers
 CONFTEST: drm_has_hdr_output_metadata
 CONFTEST: dma_resv_add_fence
 CONFTEST: dma_resv_reserve_fences
 CONFTEST: reservation_object_reserve_shared_has_num_fences_arg
 CONFTEST: drm_connector_has_override_edid
 CONFTEST: drm_master_has_leases
 CONFTEST: drm_file_get_master
 CONFTEST: drm_modeset_lock_all_end
 CONFTEST: drm_connector_lookup
 CONFTEST: drm_connector_put
 CONFTEST: drm_driver_has_dumb_destroy
 CONFTEST: fence_ops_use_64bit_seqno
 CONFTEST: drm_aperture_remove_conflicting_pci_framebuffers_has_driver_arg
 CONFTEST: drm_mode_create_dp_colorspace_property_has_supported_colorspaces_arg
 CONFTEST: drm_syncobj_features_present
 CONFTEST: drm_unlocked_ioctl_flag_present
 CONFTEST: dom0_kernel_present
 CONFTEST: nvidia_vgpu_kvm_build
 CONFTEST: nvidia_grid_build
 CONFTEST: nvidia_grid_csp_build
 CONFTEST: pm_runtime_available
 CONFTEST: pci_class_multimedia_hd_audio
 CONFTEST: drm_available
 CONFTEST: vfio_pci_core_available
 CONFTEST: mdev_available
 CONFTEST: cmd_uphy_display_port_init
 CONFTEST: cmd_uphy_display_port_off
 CONFTEST: memory_failure_mf_sw_simulated_defined
 CONFTEST: drm_atomic_available
 CONFTEST: is_export_symbol_gpl_refcount_inc
 CONFTEST: is_export_symbol_gpl_refcount_dec_and_test
 CONFTEST: drm_alpha_blending_available
 CONFTEST: is_export_symbol_present_drm_gem_prime_fd_to_handle
 CONFTEST: is_export_symbol_present_drm_gem_prime_handle_to_fd
 CONFTEST: ib_peer_memory_symbols
  CC [M]  /var/lib/dkms/nvidia/555.52.04/build/nvidia/nv.o
  CC [M]  /var/lib/dkms/nvidia/555.52.04/build/nvidia/nv-pci.o
  CC [M]  /var/lib/dkms/nvidia/555.52.04/build/nvidia/nv-dmabuf.o
  CC [M]  /var/lib/dkms/nvidia/555.52.04/build/nvidia/nv-nano-timer.o
  CC [M]  /var/lib/dkms/nvidia/555.52.04/build/nvidia/nv-acpi.o
  CC [M]  /var/lib/dkms/nvidia/555.52.04/build/nvidia/nv-cray.o
  CC [M]  /var/lib/dkms/nvidia/555.52.04/build/nvidia/nv-dma.o
  CC [M]  /var/lib/dkms/nvidia/555.52.04/build/nvidia/nv-i2c.o
  CC [M]  /var/lib/dkms/nvidia/555.52.04/build/nvidia/nv-mmap.o
  CC [M]  /var/lib/dkms/nvidia/555.52.04/build/nvidia/nv-p2p.o
  CC [M]  /var/lib/dkms/nvidia/555.52.04/build/nvidia/nv-pat.o
  CC [M]  /var/lib/dkms/nvidia/555.52.04/build/nvidia/nv-procfs.o
  CC [M]  /var/lib/dkms/nvidia/555.52.04/build/nvidia/nv-usermap.o
  CC [M]  /var/lib/dkms/nvidia/555.52.04/build/nvidia/nv-vm.o
  CC [M]  /var/lib/dkms/nvidia/555.52.04/build/nvidia/nv-vtophys.o
  CC [M]  /var/lib/dkms/nvidia/555.52.04/build/nvidia/os-interface.o
  CC [M]  /var/lib/dkms/nvidia/555.52.04/build/nvidia/os-mlock.o
  CC [M]  /var/lib/dkms/nvidia/555.52.04/build/nvidia/os-pci.o
  CC [M]  /var/lib/dkms/nvidia/555.52.04/build/nvidia/os-registry.o
  CC [M]  /var/lib/dkms/nvidia/555.52.04/build/nvidia/os-usermap.o
  CC [M]  /var/lib/dkms/nvidia/555.52.04/build/nvidia/nv-modeset-interface.o
  CC [M]  /var/lib/dkms/nvidia/555.52.04/build/nvidia/nv-pci-table.o
  CC [M]  /var/lib/dkms/nvidia/555.52.04/build/nvidia/nv-kthread-q.o
  CC [M]  /var/lib/dkms/nvidia/555.52.04/build/nvidia/nv-memdbg.o
  CC [M]  /var/lib/dkms/nvidia/555.52.04/build/nvidia/nv-ibmnpu.o
  CC [M]  /var/lib/dkms/nvidia/555.52.04/build/nvidia/nv-report-err.o
  CC [M]  /var/lib/dkms/nvidia/555.52.04/build/nvidia/nv-rsync.o
  CC [M]  /var/lib/dkms/nvidia/555.52.04/build/nvidia/nv-msi.o
  CC [M]  /var/lib/dkms/nvidia/555.52.04/build/nvidia/nv-caps.o
  CC [M]  /var/lib/dkms/nvidia/555.52.04/build/nvidia/nv-caps-imex.o
  CC [M]  /var/lib/dkms/nvidia/555.52.04/build/nvidia/nv_uvm_interface.o
  CC [M]  /var/lib/dkms/nvidia/555.52.04/build/nvidia/libspdm_aead.o
/var/lib/dkms/nvidia/555.52.04/build/nvidia/libspdm_aead.c:41:5: warning: no previous prototype for ‘libspdm_aead_prealloc’ [-Wmissing-prototypes]
   41 | int libspdm_aead_prealloc(void **context, char const *alg)
      |     ^~~~~~~~~~~~~~~~~~~~~
/var/lib/dkms/nvidia/555.52.04/build/nvidia/libspdm_aead.c:171:5: warning: no previous prototype for ‘libspdm_aead_prealloced’ [-Wmissing-prototypes]
  171 | int libspdm_aead_prealloced(void *context,
      |     ^~~~~~~~~~~~~~~~~~~~~~~
  CC [M]  /var/lib/dkms/nvidia/555.52.04/build/nvidia/libspdm_ecc.o
  CC [M]  /var/lib/dkms/nvidia/555.52.04/build/nvidia/libspdm_hkdf.o
  CC [M]  /var/lib/dkms/nvidia/555.52.04/build/nvidia/libspdm_rand.o
  CC [M]  /var/lib/dkms/nvidia/555.52.04/build/nvidia/libspdm_shash.o
  CC [M]  /var/lib/dkms/nvidia/555.52.04/build/nvidia/libspdm_rsa.o
  CC [M]  /var/lib/dkms/nvidia/555.52.04/build/nvidia/libspdm_aead_aes_gcm.o
  CC [M]  /var/lib/dkms/nvidia/555.52.04/build/nvidia/libspdm_sha.o
  CC [M]  /var/lib/dkms/nvidia/555.52.04/build/nvidia/libspdm_hmac_sha.o
  CC [M]  /var/lib/dkms/nvidia/555.52.04/build/nvidia/libspdm_hkdf_sha.o
  CC [M]  /var/lib/dkms/nvidia/555.52.04/build/nvidia/libspdm_ec.o
  CC [M]  /var/lib/dkms/nvidia/555.52.04/build/nvidia/libspdm_x509.o
  CC [M]  /var/lib/dkms/nvidia/555.52.04/build/nvidia/libspdm_rsa_ext.o
  CC [M]  /var/lib/dkms/nvidia/555.52.04/build/nvidia/nvlink_linux.o
  CC [M]  /var/lib/dkms/nvidia/555.52.04/build/nvidia/nvlink_caps.o
  CC [M]  /var/lib/dkms/nvidia/555.52.04/build/nvidia/linux_nvswitch.o
/var/lib/dkms/nvidia/555.52.04/build/nvidia/nv-caps-imex.c:53:17: warning: no previous prototype for ‘nv_caps_imex_channel_get’ [-Wmissing-prototypes]
   53 | int NV_API_CALL nv_caps_imex_channel_get(int fd)
      |                 ^~~~~~~~~~~~~~~~~~~~~~~~
/var/lib/dkms/nvidia/555.52.04/build/nvidia/nv-caps-imex.c:90:17: warning: no previous prototype for ‘nv_caps_imex_channel_count’ [-Wmissing-prototypes]
   90 | int NV_API_CALL nv_caps_imex_channel_count(void)
      |                 ^~~~~~~~~~~~~~~~~~~~~~~~~~
/var/lib/dkms/nvidia/555.52.04/build/nvidia/nv-caps-imex.c:95:17: warning: no previous prototype for ‘nv_caps_imex_init’ [-Wmissing-prototypes]
   95 | int NV_API_CALL nv_caps_imex_init(void)
      |                 ^~~~~~~~~~~~~~~~~
/var/lib/dkms/nvidia/555.52.04/build/nvidia/nv-caps-imex.c:146:18: warning: no previous prototype for ‘nv_caps_imex_exit’ [-Wmissing-prototypes]
  146 | void NV_API_CALL nv_caps_imex_exit(void)
      |                  ^~~~~~~~~~~~~~~~~
/var/lib/dkms/nvidia/555.52.04/build/nvidia/nv-procfs.c:695:1: warning: no previous prototype for ‘exercise_error_forwarding_va’ [-Wmissing-prototypes]
  695 | exercise_error_forwarding_va(
      | ^~~~~~~~~~~~~~~~~~~~~~~~~~~~
/var/lib/dkms/nvidia/555.52.04/build/nvidia/nv-ibmnpu.c:395:6: warning: no previous prototype for ‘nv_init_ibmnpu_info’ [-Wmissing-prototypes]
  395 | void nv_init_ibmnpu_info(nv_state_t *nv)
      |      ^~~~~~~~~~~~~~~~~~~
/var/lib/dkms/nvidia/555.52.04/build/nvidia/nv-ibmnpu.c:399:6: warning: no previous prototype for ‘nv_destroy_ibmnpu_info’ [-Wmissing-prototypes]
  399 | void nv_destroy_ibmnpu_info(nv_state_t *nv)
      |      ^~~~~~~~~~~~~~~~~~~~~~
/var/lib/dkms/nvidia/555.52.04/build/nvidia/nv-ibmnpu.c:403:5: warning: no previous prototype for ‘nv_init_ibmnpu_devices’ [-Wmissing-prototypes]
  403 | int nv_init_ibmnpu_devices(nv_state_t *nv)
      |     ^~~~~~~~~~~~~~~~~~~~~~
/var/lib/dkms/nvidia/555.52.04/build/nvidia/nv-ibmnpu.c:408:6: warning: no previous prototype for ‘nv_unregister_ibmnpu_devices’ [-Wmissing-prototypes]
  408 | void nv_unregister_ibmnpu_devices(nv_state_t *nv)
      |      ^~~~~~~~~~~~~~~~~~~~~~~~~~~~
/var/lib/dkms/nvidia/555.52.04/build/nvidia/nv-ibmnpu.c:428:5: warning: no previous prototype for ‘nv_get_ibmnpu_chip_id’ [-Wmissing-prototypes]
  428 | int nv_get_ibmnpu_chip_id(nv_state_t *nv)
      |     ^~~~~~~~~~~~~~~~~~~~~
/var/lib/dkms/nvidia/555.52.04/build/nvidia/nv-ibmnpu.c:437:6: warning: no previous prototype for ‘nv_ibmnpu_cache_flush_numa_region’ [-Wmissing-prototypes]
  437 | void nv_ibmnpu_cache_flush_numa_region(nv_state_t *nv)
      |      ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/var/lib/dkms/nvidia/555.52.04/build/nvidia/os-interface.c:374:7: warning: no previous prototype for ‘os_mem_copy_custom’ [-Wmissing-prototypes]
  374 | void *os_mem_copy_custom(
      |       ^~~~~~~~~~~~~~~~~~
  CC [M]  /var/lib/dkms/nvidia/555.52.04/build/nvidia/procfs_nvswitch.o
  CC [M]  /var/lib/dkms/nvidia/555.52.04/build/nvidia/i2c_nvswitch.o
  CC [M]  /var/lib/dkms/nvidia/555.52.04/build/nvidia-uvm/uvm_ats_sva.o
/var/lib/dkms/nvidia/555.52.04/build/nvidia/nv-dma.c:293:6: warning: no previous prototype for ‘nv_load_dma_map_scatterlist’ [-Wmissing-prototypes]
  293 | void nv_load_dma_map_scatterlist(
      |      ^~~~~~~~~~~~~~~~~~~~~~~~~~~
/var/lib/dkms/nvidia/555.52.04/build/nvidia/nv-mmap.c:303:5: warning: conflicting types for ‘nv_encode_caching’ due to enum/integer mismatch; have ‘int(pgprot_t *, NvU32,  nv_memory_type_t)’ {aka ‘int(struct pgprot *, unsigned int,  nv_memory_type_t)’} [-Wenum-int-mismatch]
  303 | int nv_encode_caching(
      |     ^~~~~~~~~~~~~~~~~
In file included from /var/lib/dkms/nvidia/555.52.04/build/common/inc/nv-linux.h:1781,
                 from /var/lib/dkms/nvidia/555.52.04/build/nvidia/nv-mmap.c:27:
/var/lib/dkms/nvidia/555.52.04/build/common/inc/nv-proto.h:44:13: note: previous declaration of ‘nv_encode_caching’ with type ‘int(pgprot_t *, NvU32,  NvU32)’ {aka ‘int(struct pgprot *, unsigned int,  unsigned int)’}
   44 | int         nv_encode_caching           (pgprot_t *, NvU32, NvU32);
      |             ^~~~~~~~~~~~~~~~~
  CC [M]  /var/lib/dkms/nvidia/555.52.04/build/nvidia-uvm/uvm_conf_computing.o
/var/lib/dkms/nvidia/555.52.04/build/nvidia/nv-dma.c:489:23: warning: no previous prototype for ‘nv_dma_unmap_sgt’ [-Wmissing-prototypes]
  489 | NV_STATUS NV_API_CALL nv_dma_unmap_sgt(
      |                       ^~~~~~~~~~~~~~~~
  CC [M]  /var/lib/dkms/nvidia/555.52.04/build/nvidia-uvm/uvm_sec2_test.o
  CC [M]  /var/lib/dkms/nvidia/555.52.04/build/nvidia-uvm/uvm_maxwell_sec2.o
  CC [M]  /var/lib/dkms/nvidia/555.52.04/build/nvidia-uvm/uvm_hopper_sec2.o
  CC [M]  /var/lib/dkms/nvidia/555.52.04/build/nvidia-uvm/uvm_common.o
  CC [M]  /var/lib/dkms/nvidia/555.52.04/build/nvidia-uvm/uvm_linux.o
  CC [M]  /var/lib/dkms/nvidia/555.52.04/build/nvidia-uvm/nvstatus.o
/var/lib/dkms/nvidia/555.52.04/build/nvidia/nv.c:1236:23: warning: no previous prototype for ‘nv_get_num_dpaux_instances’ [-Wmissing-prototypes]
 1236 | NV_STATUS NV_API_CALL nv_get_num_dpaux_instances(nv_state_t *nv, NvU32 *num_instances)
      |                       ^~~~~~~~~~~~~~~~~~~~~~~~~~
  CC [M]  /var/lib/dkms/nvidia/555.52.04/build/nvidia-uvm/nvCpuUuid.o
  CC [M]  /var/lib/dkms/nvidia/555.52.04/build/nvidia-uvm/nv-kthread-q.o
  CC [M]  /var/lib/dkms/nvidia/555.52.04/build/nvidia-uvm/nv-kthread-q-selftest.o
  CC [M]  /var/lib/dkms/nvidia/555.52.04/build/nvidia-uvm/uvm.o
  CC [M]  /var/lib/dkms/nvidia/555.52.04/build/nvidia-uvm/uvm_tools.o
  CC [M]  /var/lib/dkms/nvidia/555.52.04/build/nvidia-uvm/uvm_global.o
  CC [M]  /var/lib/dkms/nvidia/555.52.04/build/nvidia-uvm/uvm_gpu.o
  CC [M]  /var/lib/dkms/nvidia/555.52.04/build/nvidia-uvm/uvm_gpu_isr.o
/var/lib/dkms/nvidia/555.52.04/build/nvidia/nv-caps.c:272:5: warning: no previous prototype for ‘nv_cap_procfs_init’ [-Wmissing-prototypes]
  272 | int nv_cap_procfs_init(void)
      |     ^~~~~~~~~~~~~~~~~~
  CC [M]  /var/lib/dkms/nvidia/555.52.04/build/nvidia-uvm/uvm_procfs.o
  CC [M]  /var/lib/dkms/nvidia/555.52.04/build/nvidia-uvm/uvm_va_space.o
  CC [M]  /var/lib/dkms/nvidia/555.52.04/build/nvidia-uvm/uvm_va_space_mm.o
  CC [M]  /var/lib/dkms/nvidia/555.52.04/build/nvidia-uvm/uvm_gpu_semaphore.o
  CC [M]  /var/lib/dkms/nvidia/555.52.04/build/nvidia-uvm/uvm_mem.o
  CC [M]  /var/lib/dkms/nvidia/555.52.04/build/nvidia-uvm/uvm_rm_mem.o
  CC [M]  /var/lib/dkms/nvidia/555.52.04/build/nvidia-uvm/uvm_channel.o
  CC [M]  /var/lib/dkms/nvidia/555.52.04/build/nvidia-uvm/uvm_lock.o
  CC [M]  /var/lib/dkms/nvidia/555.52.04/build/nvidia-uvm/uvm_hal.o
  CC [M]  /var/lib/dkms/nvidia/555.52.04/build/nvidia-uvm/uvm_processors.o
/var/lib/dkms/nvidia/555.52.04/build/nvidia/nvlink_linux.c:313:12: warning: no previous prototype for ‘nvlink_core_init’ [-Wmissing-prototypes]
  313 | int __init nvlink_core_init(void)
      |            ^~~~~~~~~~~~~~~~
/var/lib/dkms/nvidia/555.52.04/build/nvidia/nvlink_linux.c:389:6: warning: no previous prototype for ‘nvlink_core_exit’ [-Wmissing-prototypes]
  389 | void nvlink_core_exit(void)
      |      ^~~~~~~~~~~~~~~~
  CC [M]  /var/lib/dkms/nvidia/555.52.04/build/nvidia-uvm/uvm_range_tree.o
/var/lib/dkms/nvidia/555.52.04/build/nvidia/linux_nvswitch.c:1707:1: warning: no previous prototype for ‘nvswitch_init’ [-Wmissing-prototypes]
 1707 | nvswitch_init
      | ^~~~~~~~~~~~~
/var/lib/dkms/nvidia/555.52.04/build/nvidia/linux_nvswitch.c:1792:1: warning: no previous prototype for ‘nvswitch_exit’ [-Wmissing-prototypes]
 1792 | nvswitch_exit
      | ^~~~~~~~~~~~~
  CC [M]  /var/lib/dkms/nvidia/555.52.04/build/nvidia-uvm/uvm_rb_tree.o
In file included from ./include/linux/efi.h:23,
                 from /var/lib/dkms/nvidia/555.52.04/build/common/inc/nv-linux.h:217,
                 from /var/lib/dkms/nvidia/555.52.04/build/nvidia-uvm/uvm_linux.h:40,
                 from /var/lib/dkms/nvidia/555.52.04/build/nvidia-uvm/uvm_linux.c:24:
./include/linux/pstore.h:77:9: internal compiler error: Illegal instruction
   77 |         char                    *buf;
      |         ^~~~
0x1f84ca6 internal_error(char const*, ...)
	???:0
0x1fea95b line_maps::get_or_create_combined_loc(unsigned int, source_range, void*, unsigned int)
	???:0
0x2000db7 _cpp_lex_direct
	???:0
0x2008ea0 _cpp_lex_token
	???:0
0x7e8f10 c_lex_with_flags(tree_node**, unsigned int*, unsigned char*, int)
	???:0
0x7499a9 c_parser_declspecs(c_parser*, c_declspecs*, bool, bool, bool, bool, bool, bool, bool, c_lookahead_kind)
	???:0
0x748ddf c_parser_declarator(c_parser*, bool, c_dtr_syn, bool*)
	???:0
0x74aebf c_parser_declspecs(c_parser*, c_declspecs*, bool, bool, bool, bool, bool, bool, bool, c_lookahead_kind)
	???:0
0x76ecd5 c_parse_file()
	???:0
0x7e4b95 c_common_parse_file()
	???:0
Please submit a full bug report, with preprocessed source (by using -freport-bug).
Please include the complete backtrace with any bug report.
See <https://gitlab.archlinux.org/archlinux/packaging/packages/gcc/-/issues> for instructions.
make[3]: *** [scripts/Makefile.build:244: /var/lib/dkms/nvidia/555.52.04/build/nvidia-uvm/uvm_linux.o] Error 1
make[3]: *** Waiting for unfinished jobs....
/var/lib/dkms/nvidia/555.52.04/build/nvidia-uvm/nv-kthread-q-selftest.c:84:6: warning: no previous prototype for ‘on_nvq_assert’ [-Wmissing-prototypes]
   84 | void on_nvq_assert(void)
      |      ^~~~~~~~~~~~~
/var/lib/dkms/nvidia/555.52.04/build/nvidia-uvm/uvm_tools.c:1323:6: warning: no previous prototype for ‘uvm_tools_record_access_counter’ [-Wmissing-prototypes]
 1323 | void uvm_tools_record_access_counter(uvm_va_space_t *va_space,
      |      ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/var/lib/dkms/nvidia/555.52.04/build/nvidia-uvm/uvm_tools.c:2822:5: warning: no previous prototype for ‘uvm_tools_init’ [-Wmissing-prototypes]
 2822 | int uvm_tools_init(dev_t uvm_base_dev)
      |     ^~~~~~~~~~~~~~
/var/lib/dkms/nvidia/555.52.04/build/nvidia-uvm/uvm_tools.c:2883:6: warning: no previous prototype for ‘uvm_tools_exit’ [-Wmissing-prototypes]
 2883 | void uvm_tools_exit(void)
      |      ^~~~~~~~~~~~~~
/var/lib/dkms/nvidia/555.52.04/build/nvidia-uvm/uvm_gpu_semaphore.c:510:6: warning: no previous prototype for ‘tracking_semaphore_uses_mutex’ [-Wmissing-prototypes]
  510 | bool tracking_semaphore_uses_mutex(uvm_gpu_tracking_semaphore_t *tracking_semaphore)
      |      ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~
make[2]: *** [/usr/lib/modules/6.9.4-arch1-1/build/Makefile:1919: /var/lib/dkms/nvidia/555.52.04/build] Error 2
make[1]: *** [Makefile:240: __sub-make] Error 2
make[1]: Leaving directory '/usr/lib/modules/6.9.4-arch1-1/build'
make: *** [Makefile:89: modules] Error 2

Last edited by jslay (2024-06-14 20:32:41)

Offline

#2 2024-06-14 06:20:04

seth
Member
From: Won't reply 2 private help req
Registered: 2012-09-03
Posts: 75,939

Re: [Unsolvable] NVIDIA DKMS fails to compile every time I upgrade Kernel

./include/linux/pstore.h:77:9: internal compiler error: Illegal instruction

We've been here before: https://bbs.archlinux.org/viewtopic.php … 0#p2173290

I have to go through a cycle of trying to get the DKMS module to rebuild and work.

If the behavior is non-deterministic you should make sure to not run OOM (do you have a swapfile) and run memtest86+ for a night.

Offline

#3 2024-06-14 09:43:51

jslay
Member
Registered: 2024-05-20
Posts: 8

Re: [Unsolvable] NVIDIA DKMS fails to compile every time I upgrade Kernel

As I have stated, I have tried every version of gcc that seems reasonable within the last 3 months. I have no other version of gcc installed but the one from pacman. Linking me to the same thread that says I have the wrong gcc version is of no help. The system is a 13900K with 64GB of RAM. It is not an OOM issue.

I have 0 issues with anything else, as I am developer, I am compiling golang and java constantly on this machine with no issues. I highly doubt I have an issue with my memory.

The deterministic thing here, is it fails to compile 550 every time a kernel upgrade happens, and only this ever has issues. I have several other DKMS modules that are compiled with no issue. I can compile 545 with no issue.

Offline

#4 2024-06-14 10:14:00

seth
Member
From: Won't reply 2 private help req
Registered: 2012-09-03
Posts: 75,939

Re: [Unsolvable] NVIDIA DKMS fails to compile every time I upgrade Kernel

The comment in that thread was that you're Off topic because you're hitting a completely different Problem than the OP that matches the other thread linked there.
You also suggested that You're Not getting this with gcc 13 what would make some Sort of Sense because it's a Compiler internal Error.

The sublink also Points to the gcc bugtracker where you'd therefore have to Report that.

Nb that "i've ram so it's Not oom" ist non sequitur - If Things fo wrong the system can allocate all your (left) RAM in a Split second.

That's aside the issues you can get with zram or unbacked zswap.

Offline

#5 2024-06-14 20:28:46

jslay
Member
Registered: 2024-05-20
Posts: 8

Re: [Unsolvable] NVIDIA DKMS fails to compile every time I upgrade Kernel

Yes, this is what I had originally stated in the other thread, but this was just one of those lucky runs apparently.

I cannot seem to get it to consistently compile with any version it seems.

I guess this is not solvable. I’ve given up at this point. I cannot keep spending hours trying to find an issue with a beta driver anymore. I tried to report what I’m finding to keep others from going down the same rabbit holes, but this is not on me anymore.  Will just go back to 545, as that compiles with 0 issue every single time (again, not a ram issue).

My Wayland support for my software will just have to wait a few more months until this is solved by someone more knowledgeable

Last edited by jslay (2024-06-14 20:31:37)

Offline

#6 2024-06-14 20:46:12

seth
Member
From: Won't reply 2 private help req
Registered: 2012-09-03
Posts: 75,939

Re: [Unsolvable] NVIDIA DKMS fails to compile every time I upgrade Kernel

Afaiu it's not only the 555xx beta driver but also the 550xx driver and AGAIN: this is NOT a bug with the driver, the bug would be in gcc.

Except it's actually different from the SIGSEGV in https://bbs.archlinux.org/viewtopic.php?id=296086 and a remarkably consisten SIGILL - except is also doesn't always happen, but when it does always in this context (and unrelated to the GCC version)?

Does the CPU overheat?
Can you prevent it by limiting the parallel jobs (to leave some cores alone)?

Edit: or did you forget to install/load the microcode patches for your CPU?

Last edited by seth (2024-06-14 21:14:33)

Offline

#7 2024-06-25 22:28:49

Wereii
Member
Registered: 2024-06-25
Posts: 8

Re: [Unsolvable] NVIDIA DKMS fails to compile every time I upgrade Kernel

I hope this is not necrobumping yet, but problems with nvidia dkms have been occurring to me on persistent basis too and I too have 13900K(F) with 64G of RAM (DDR5 in my case). SIGILL in my current case too but as far as I remember even the specific error was not stable.

My fix for now has been to edit /var/lib/dkms/nvidia/<ver>/source/dkms.conf and in the line starting with

MAKE[0]=

replace

-j`nproc`

with -j8 or smaller number then run dkms install manually - this has always been enough to get the dkms build through.

These errors were occurring more often when I did have a slight under-clock configured, since then I have reduced it to almost nothing (any other benchmark, tool or game has been stable even with the previous values, except for nvidia dkms) and it didn't happen since except for today with nvidia/550.90.07 @ 6.9.6-arch1-1

Last edited by Wereii (2024-06-25 22:30:11)

Offline

#8 2024-06-26 07:18:09

jslay
Member
Registered: 2024-05-20
Posts: 8

Re: [Unsolvable] NVIDIA DKMS fails to compile every time I upgrade Kernel

The newer drivers in the last week or so (both normal and beta) seem to have resolved my issue. I can consistently build them now.

I double checked everything, last I updated my BIOS was June 7th with an update from June 6th from the manufacture, so I am fairly certain I have had the latest microcode, as I was experiencing instability from the recent issues that Intel had with these chips (usually would hang entire machine when compiling shaders or something when trying to play a game), and haven't experienced those same issues since.

As to why I couldn't build the drivers consistently before this last week? Not sure. GCC was not updated.

I had to go back to beta (555.52.04-1 atm) as 550 was causing me further issues in Wayland once I had updated to KDE Plasma 6.1

I am not running any overclock beyond the XMP-1 profile for my RAM. CPU clocking has not been modified.

I am also on 6.9.6-arch1-1

gcc 14.1.1+r58+gfc9fb69ad62-1
gcc-libs 14.1.1+r58+gfc9fb69ad62-1
lib32-gcc-libs 14.1.1+r58+gfc9fb69ad62-1
lib32-nvidia-utils-beta 555.52.04-1
nvidia-beta-dkms 555.52.04-1
nvidia-utils-beta 555.52.04-2

Last edited by jslay (2024-06-26 07:26:35)

Offline

#9 2024-06-26 10:09:33

Wereii
Member
Registered: 2024-06-25
Posts: 8

Re: [Unsolvable] NVIDIA DKMS fails to compile every time I upgrade Kernel

I have removed any custom BIOS settings by resetting my MSI board to defaults, I might not have the latest BIOS but I verified that intel-microcode is packed into the initramfs (lsinitcpio --early /boot/initramfs-linux.img contains GenuineIntel.bin)
Running the dkms build manually, the build starts to fail around 29-30 jobs (cores) with SIGILL.
The place where the SIGILL is raised seems unstable.
This is with intel-pstate powersave governor, the CPU package does not climb over 88C.

I will look into BIOS updates, also interested if I can cause these errors when compiling different projects but either way my hunch for now is that when both E-Cores and P-Cores are engaged and some specific combination of instructions is executed it shits itself, though rasdaemon shows no errors.

Offline

#10 2024-06-26 11:12:59

Wereii
Member
Registered: 2024-06-25
Posts: 8

Re: [Unsolvable] NVIDIA DKMS fails to compile every time I upgrade Kernel

Updated BIOS to latest non-beta (for my MSI PRO Z790-A WIFI that is 7E07vAB released 2024-05-14) and the crash seems to be gone, even with all 32 cores the build finishes without errors. Even with a bit of tweaks (lowering pl1 and xmp enabled) it does not crash.

Edit: Never mind, the current nvidia-dkms update triggered dpms build and that has failed the same way again.

Edit2:
Testing with phoronix-test-suite benchmark build-linux-kernel I can trigger the SIGILLs too, for now my hunch is power spikes - playing with cpu frequecny I do seem to get some stability when reducing max freq by 500 mhz (to 5000 mhz).

Does anyone here know if power spikes (that don't cause total system crash) would get reported somehow (waving at rasdaemon)?

Last edited by Wereii (2024-07-01 10:56:25)

Offline

#11 2024-07-01 15:22:59

Wereii
Member
Registered: 2024-06-25
Posts: 8

Re: [Unsolvable] NVIDIA DKMS fails to compile every time I upgrade Kernel

Downgrading gcc and gcc-libs to 13.2.1 I still got the SIGILLs but rasdaemon also catched a MCE event: Internal Parity Error on cpu 4 - which was in short followed by kernel oops (something along cpu4 taking too long) and it ground itself to halt (though that might have been caused by downgrading gcc-libs as some stuff seems to depend on it at runtime).
Either way, I can't seem to point out a single deciding factor on what is causing these errors bar having a faulty CPU or other HW.

For now I am testing beta BIOS for my mobo - the beta has the possibility to select Intel's default power options (instead of manufacturers specific ones, MSI in my case).

Last edited by Wereii (2024-07-01 15:23:31)

Offline

#12 2024-07-02 23:09:42

ubergarm
Member
Registered: 2024-07-02
Posts: 3

Re: [Unsolvable] NVIDIA DKMS fails to compile every time I upgrade Kernel

@Wereii - thanks for this - you saved me today!

It was my first time doing a

pacman -Syu

since installing harddrive in a new PC build. It actually froze up so hard had to hold the power button. Took a minute to fixup the missing vmlinuz-linux boot files, but after booting I couldn't startx. Reinstalled all the packages and still no dice. I knew the nvidia driver modules weren't loading and finally noticed that

pacman -Syu nvidia-dkms# 555.58-2

(was quietly throwing "Error! Bad return status for module build on kernel: 6.9.7-arch-1 (x86_64)." I tried a few times and only once it threw an few mce errors like: "mce: [Hardware Error]: CPU 8: Machine Check: 0 Bank 0: ..."

MoBo: MSI Pro Z690-A WiFi
CPU: i9-14900K
GPU: 3090TI
RAM: 2x 32GB sticks of DD5 5200

I first tried some BIOS settings like turning off XMP to underclock the RAM. Also updated the BIOS firmware to latest version released 2024-04-11. Still didn't help.

I saw your note above and I did a dirty trick of backing up `/usr/bin/nproc` and making a bash script called nproc that just did `echo 8` and was able to get pacman to install nvidia-dkms and the hooks properly rebuilt the initramfs and I was :gucci: again for xorg!

I'm gonna crank up the water cooling settings in BIOS and manually set the RAM configs then run memtest86+ as it sure feels like a bad stick of RAM in some ways, but might just be system instability when using all 32 cores... psure it also died a day ago with older gcc compiling llama.cpp with `-j$(nproc)` (had to dial it back to make -j8 also). Which suggests to me some kind of hardware/bios/temperature issue and not the underlying ARCH software packages. fwiw helldivers 2 crashes on me regularly (in windows 11 :oof:)...

Good luck sorting out your box!

Offline

#13 2024-07-03 01:57:05

ubergarm
Member
Registered: 2024-07-02
Posts: 3

Re: [Unsolvable] NVIDIA DKMS fails to compile every time I upgrade Kernel

@Wereii

Well, the solution in my case was to go into the motherboard BIOS and change the cooler type from water to box cooler. After doing this, it knocks down the power limit from some crazy high 4000+ Watts to the intel recommended 253W. (I have a 260 AIO water cooler). I even turned on XMP for the RAM.

I just tested and stress test CPU now stays around 90 deg C (in my warm 30 deg C room). It used to peg 100 deg C and throttle almost instantly.

The important part, I can now run `pacman -Syu nvidia-dkms` and it successfully compiles with nproc returning all 32 cores.

You can see the exact numbers you need in this guys video https://www.youtube.com/watch?v=s43Auv8ub7w

Cheers!

Offline

#14 2024-07-03 22:49:47

Wereii
Member
Registered: 2024-06-25
Posts: 8

Re: [Unsolvable] NVIDIA DKMS fails to compile every time I upgrade Kernel

Hey @ubergarm

glad I could help by sharing my struggles hah!

You have kind of confirmed my hunch that the PL1 (BIOS "cooler type") setting is what might be at play - seems these instabilities are mainly caused by motherboard manufacturers having higher power limits then Intel's recommended defaults
which does not bode well for the latest, itself so power-hungry, cpus. 

What confuses me still, is that I had always set that "cooler type" setting in BIOS to box cooler (even though I do have AIO water cooler, 4 kW seemed just insane) but I still had these instabilities. 
Until now as I switched to the latest beta BIOS (which adds an option to use Intel's intended values for these configurations) the crashing seems to be gone.

E: I didn't notice you linked the JayzTwoCents video which is exactly the problem linked in the url above, so I kind of paraphrased it here.

Last edited by Wereii (2024-07-04 00:12:52)

Offline

#15 2024-07-05 18:53:51

ubergarm
Member
Registered: 2024-07-02
Posts: 3

Re: [Unsolvable] NVIDIA DKMS fails to compile every time I upgrade Kernel

Thanks for confirming, @Wereii and very happy to hear it sounds like your system is now stable with the Intel configs.

In further testing, simply setting the MSI BIOS to "Boxed Cooler" 253W was *not enough* as compiling with all cores e.g. `make -j32` still borked about 20% of the time. I had to bump the Load Line Mode from default of 9 up to 12 which in minimal testing increases temperature (some cores still peg out and throttle at 100 deg C), but knock-on-wood the compiler stopped segfaulting / and even occasionally hard locking the system.

Here is the most simple explanation I've foundfor MSI BIOS: https://www.msi.com/blog/improving-gami … -i9-14900k

Okay, good luck everyone with an Intel i9 13th/14th gen chip getting it to run stable so you can actually use all the cores you paid for simultaneously!

Offline

Board footer

Powered by FluxBB