You are not logged in.
I am using the beta drivers 550 so that I can try and run Wayland for some development stuff I am doing.
However, everytime the kernel is updated, I have to go through a cycle of trying to get the DKMS module to rebuild and work.
I have no idea what is going wrong. Initially thought it was GCC versions, but ive tried literally every version for the last 3 months. It seems like absolute luck when I do happen to get it to build successfully.
What am I doing wrong here? I have updated linux and linux-headers pkgs. I have up-to-date GCC 14.1.1, what gives? What is the correct process to do this? It used to build and work just fine when kernel would upgrade until GCC 14 was released.
$ cat /var/lib/dkms/nvidia/555.52.04/build/make.log
DKMS make.log for nvidia-555.52.04 for kernel 6.9.4-arch1-1 (x86_64)
Thu Jun 13 06:52:54 PM MDT 2024
make[1]: Entering directory '/usr/lib/modules/6.9.4-arch1-1/build'
SYMLINK /var/lib/dkms/nvidia/555.52.04/build/nvidia/nv-kernel.o
SYMLINK /var/lib/dkms/nvidia/555.52.04/build/nvidia-modeset/nv-modeset-kernel.o
CONFTEST: hash__remap_4k_pfn
CONFTEST: set_pages_uc
CONFTEST: list_is_first
CONFTEST: set_memory_uc
CONFTEST: set_memory_array_uc
CONFTEST: set_pages_array_uc
CONFTEST: ioremap_cache
CONFTEST: ioremap_wc
CONFTEST: ioremap_driver_hardened
CONFTEST: ioremap_driver_hardened_wc
CONFTEST: ioremap_cache_shared
CONFTEST: pci_get_domain_bus_and_slot
CONFTEST: get_num_physpages
CONFTEST: pde_data
CONFTEST: xen_ioemu_inject_msi
CONFTEST: phys_to_dma
CONFTEST: get_dma_ops
CONFTEST: dma_attr_macros
CONFTEST: dma_map_page_attrs
CONFTEST: write_cr4
CONFTEST: of_find_node_by_phandle
CONFTEST: of_node_to_nid
CONFTEST: pnv_pci_get_npu_dev
CONFTEST: of_get_ibm_chip_id
CONFTEST: pci_bus_address
CONFTEST: pci_stop_and_remove_bus_device
CONFTEST: pci_rebar_get_possible_sizes
CONFTEST: wait_for_random_bytes
CONFTEST: register_cpu_notifier
CONFTEST: cpuhp_setup_state
CONFTEST: dma_map_resource
CONFTEST: get_backlight_device_by_name
CONFTEST: timer_setup
CONFTEST: pci_enable_msix_range
CONFTEST: kernel_read_has_pointer_pos_arg
CONFTEST: kernel_write_has_pointer_pos_arg
CONFTEST: dma_direct_map_resource
CONFTEST: tegra_get_platform
CONFTEST: tegra_bpmp_send_receive
CONFTEST: flush_cache_all
CONFTEST: vmf_insert_pfn
CONFTEST: jiffies_to_timespec
CONFTEST: ktime_get_raw_ts64
CONFTEST: ktime_get_real_ts64
CONFTEST: full_name_hash
CONFTEST: pci_enable_atomic_ops_to_root
CONFTEST: vga_tryget
CONFTEST: cc_platform_has
CONFTEST: seq_read_iter
CONFTEST: follow_pfn
CONFTEST: drm_gem_object_get
CONFTEST: drm_gem_object_put_unlocked
CONFTEST: add_memory_driver_managed
CONFTEST: device_property_read_u64
CONFTEST: devm_of_platform_populate
CONFTEST: of_dma_configure
CONFTEST: of_property_count_elems_of_size
CONFTEST: of_property_read_variable_u8_array
CONFTEST: of_property_read_variable_u32_array
CONFTEST: i2c_new_client_device
CONFTEST: i2c_unregister_device
CONFTEST: of_get_named_gpio
CONFTEST: devm_gpio_request_one
CONFTEST: gpio_direction_input
CONFTEST: gpio_direction_output
CONFTEST: gpio_get_value
CONFTEST: gpio_set_value
CONFTEST: gpio_to_irq
CONFTEST: icc_get
CONFTEST: icc_put
CONFTEST: icc_set_bw
CONFTEST: dma_buf_export_args
CONFTEST: dma_buf_ops_has_kmap
CONFTEST: dma_buf_ops_has_kmap_atomic
CONFTEST: dma_buf_ops_has_map
CONFTEST: dma_buf_ops_has_map_atomic
CONFTEST: dma_buf_has_dynamic_attachment
CONFTEST: dma_buf_attachment_has_peer2peer
CONFTEST: dma_set_mask_and_coherent
CONFTEST: devm_clk_bulk_get_all
CONFTEST: get_task_ioprio
CONFTEST: mdev_set_iommu_device
CONFTEST: offline_and_remove_memory
CONFTEST: stack_trace
CONFTEST: crypto_tfm_ctx_aligned
CONFTEST: wait_on_bit_lock_argument_count
CONFTEST: radix_tree_empty
CONFTEST: radix_tree_replace_slot
CONFTEST: pnv_npu2_init_context
CONFTEST: cpumask_of_node
CONFTEST: ioasid_get
CONFTEST: mm_pasid_drop
CONFTEST: mmget_not_zero
CONFTEST: mmgrab
CONFTEST: iommu_sva_bind_device_has_drvdata_arg
CONFTEST: vm_fault_to_errno
CONFTEST: find_next_bit_wrap
CONFTEST: iommu_is_dma_domain
CONFTEST: acpi_video_backlight_use_native
CONFTEST: drm_dev_unref
CONFTEST: drm_reinit_primary_mode_group
CONFTEST: get_user_pages_remote
CONFTEST: get_user_pages
CONFTEST: pin_user_pages_remote
CONFTEST: pin_user_pages
CONFTEST: drm_gem_object_lookup
CONFTEST: drm_atomic_state_ref_counting
CONFTEST: drm_driver_has_gem_prime_res_obj
CONFTEST: drm_atomic_helper_connector_dpms
CONFTEST: drm_connector_funcs_have_mode_in_name
CONFTEST: drm_connector_has_vrr_capable_property
CONFTEST: drm_framebuffer_get
CONFTEST: drm_dev_put
CONFTEST: drm_format_num_planes
CONFTEST: drm_connector_for_each_possible_encoder
CONFTEST: drm_rotation_available
CONFTEST: drm_vma_offset_exact_lookup_locked
CONFTEST: nvhost_dma_fence_unpack
CONFTEST: dma_fence_set_error
CONFTEST: fence_set_error
CONFTEST: sync_file_get_fence
CONFTEST: drm_aperture_remove_conflicting_pci_framebuffers
CONFTEST: drm_fbdev_generic_setup
CONFTEST: drm_connector_attach_hdr_output_metadata_property
CONFTEST: drm_helper_crtc_enable_color_mgmt
CONFTEST: drm_crtc_enable_color_mgmt
CONFTEST: drm_atomic_helper_legacy_gamma_set
CONFTEST: is_export_symbol_gpl_of_node_to_nid
CONFTEST: is_export_symbol_gpl_sme_active
CONFTEST: is_export_symbol_present_swiotlb_map_sg_attrs
CONFTEST: is_export_symbol_present_swiotlb_dma_ops
CONFTEST: is_export_symbol_present___close_fd
CONFTEST: is_export_symbol_present_close_fd
CONFTEST: is_export_symbol_present_get_unused_fd
CONFTEST: is_export_symbol_present_get_unused_fd_flags
CONFTEST: is_export_symbol_present_nvhost_get_default_device
CONFTEST: is_export_symbol_present_nvhost_syncpt_unit_interface_get_byte_offset
CONFTEST: is_export_symbol_present_nvhost_syncpt_unit_interface_get_aperture
CONFTEST: is_export_symbol_present_tegra_dce_register_ipc_client
CONFTEST: is_export_symbol_present_tegra_dce_unregister_ipc_client
CONFTEST: is_export_symbol_present_tegra_dce_client_ipc_send_recv
CONFTEST: is_export_symbol_present_dram_clk_to_mc_clk
CONFTEST: is_export_symbol_present_get_dram_num_channels
CONFTEST: is_export_symbol_present_tegra_dram_types
CONFTEST: is_export_symbol_present_pxm_to_node
CONFTEST: is_export_symbol_present_screen_info
CONFTEST: is_export_symbol_gpl_screen_info
CONFTEST: is_export_symbol_present_i2c_bus_status
CONFTEST: is_export_symbol_present_tegra_fuse_control_read
CONFTEST: is_export_symbol_present_tegra_get_platform
CONFTEST: is_export_symbol_present_pci_find_host_bridge
CONFTEST: is_export_symbol_present_tsec_comms_send_cmd
CONFTEST: is_export_symbol_present_tsec_comms_set_init_cb
CONFTEST: is_export_symbol_present_tsec_comms_clear_init_cb
CONFTEST: is_export_symbol_present_tsec_comms_alloc_mem_from_gscco
CONFTEST: is_export_symbol_present_tsec_comms_free_gscco_mem
CONFTEST: is_export_symbol_present_memory_block_size_bytes
CONFTEST: is_export_symbol_present_tegra_platform_is_fpga
CONFTEST: is_export_symbol_present_tegra_platform_is_sim
CONFTEST: crypto
CONFTEST: is_export_symbol_present_follow_pte
CONFTEST: is_export_symbol_present_int_active_memcg
CONFTEST: is_export_symbol_present_migrate_vma_setup
CONFTEST: dma_ops
CONFTEST: swiotlb_dma_ops
CONFTEST: noncoherent_swiotlb_dma_ops
CONFTEST: vm_fault_has_address
CONFTEST: vm_insert_pfn_prot
CONFTEST: vmf_insert_pfn_prot
CONFTEST: vm_ops_fault_removed_vma_arg
CONFTEST: kmem_cache_has_kobj_remove_work
CONFTEST: sysfs_slab_unlink
CONFTEST: proc_ops
CONFTEST: timespec64
CONFTEST: vmalloc_has_pgprot_t_arg
CONFTEST: mm_has_mmap_lock
CONFTEST: pci_channel_state
CONFTEST: pci_dev_has_ats_enabled
CONFTEST: remove_memory_has_nid_arg
CONFTEST: add_memory_driver_managed_has_mhp_flags_arg
CONFTEST: num_registered_fb
CONFTEST: pci_driver_has_driver_managed_dma
CONFTEST: vm_area_struct_has_const_vm_flags
CONFTEST: memory_failure_has_trapno_arg
CONFTEST: foll_longterm_present
CONFTEST: bus_type_has_iommu_ops
CONFTEST: backing_dev_info
CONFTEST: mm_context_t
CONFTEST: vm_fault_t
CONFTEST: mmu_notifier_ops_invalidate_range
CONFTEST: mmu_notifier_ops_arch_invalidate_secondary_tlbs
CONFTEST: migrate_vma_added_flags
CONFTEST: migrate_device_range
CONFTEST: handle_mm_fault_has_mm_arg
CONFTEST: handle_mm_fault_has_pt_regs_arg
CONFTEST: mempolicy_has_unified_nodes
CONFTEST: mempolicy_has_home_node
CONFTEST: mpol_preferred_many_present
CONFTEST: mmu_interval_notifier
CONFTEST: drm_bus_present
CONFTEST: drm_bus_has_bus_type
CONFTEST: drm_bus_has_get_irq
CONFTEST: drm_bus_has_get_name
CONFTEST: drm_driver_has_device_list
CONFTEST: drm_driver_has_legacy_dev_list
CONFTEST: drm_driver_has_set_busid
CONFTEST: drm_crtc_state_has_connectors_changed
CONFTEST: drm_init_function_args
CONFTEST: drm_helper_mode_fill_fb_struct
CONFTEST: drm_master_drop_has_from_release_arg
CONFTEST: drm_driver_unload_has_int_return_type
CONFTEST: drm_atomic_helper_crtc_destroy_state_has_crtc_arg
CONFTEST: drm_atomic_helper_plane_destroy_state_has_plane_arg
CONFTEST: drm_mode_object_find_has_file_priv_arg
CONFTEST: dma_buf_owner
CONFTEST: drm_connector_list_iter
CONFTEST: drm_atomic_helper_swap_state_has_stall_arg
CONFTEST: drm_driver_prime_flag_present
CONFTEST: drm_gem_object_has_resv
CONFTEST: drm_crtc_state_has_async_flip
CONFTEST: drm_crtc_state_has_pageflip_flags
CONFTEST: drm_crtc_state_has_vrr_enabled
CONFTEST: drm_format_modifiers_present
CONFTEST: drm_vma_node_is_allowed_has_tag_arg
CONFTEST: drm_vma_offset_node_has_readonly
CONFTEST: drm_display_mode_has_vrefresh
CONFTEST: drm_driver_master_set_has_int_return_type
CONFTEST: drm_driver_has_gem_free_object
CONFTEST: drm_prime_pages_to_sg_has_drm_device_arg
CONFTEST: drm_driver_has_gem_prime_callbacks
CONFTEST: drm_crtc_atomic_check_has_atomic_state_arg
CONFTEST: drm_gem_object_vmap_has_map_arg
CONFTEST: drm_plane_atomic_check_has_atomic_state_arg
CONFTEST: drm_device_has_pdev
CONFTEST: drm_crtc_state_has_no_vblank
CONFTEST: drm_mode_config_has_allow_fb_modifiers
CONFTEST: drm_has_hdr_output_metadata
CONFTEST: dma_resv_add_fence
CONFTEST: dma_resv_reserve_fences
CONFTEST: reservation_object_reserve_shared_has_num_fences_arg
CONFTEST: drm_connector_has_override_edid
CONFTEST: drm_master_has_leases
CONFTEST: drm_file_get_master
CONFTEST: drm_modeset_lock_all_end
CONFTEST: drm_connector_lookup
CONFTEST: drm_connector_put
CONFTEST: drm_driver_has_dumb_destroy
CONFTEST: fence_ops_use_64bit_seqno
CONFTEST: drm_aperture_remove_conflicting_pci_framebuffers_has_driver_arg
CONFTEST: drm_mode_create_dp_colorspace_property_has_supported_colorspaces_arg
CONFTEST: drm_syncobj_features_present
CONFTEST: drm_unlocked_ioctl_flag_present
CONFTEST: dom0_kernel_present
CONFTEST: nvidia_vgpu_kvm_build
CONFTEST: nvidia_grid_build
CONFTEST: nvidia_grid_csp_build
CONFTEST: pm_runtime_available
CONFTEST: pci_class_multimedia_hd_audio
CONFTEST: drm_available
CONFTEST: vfio_pci_core_available
CONFTEST: mdev_available
CONFTEST: cmd_uphy_display_port_init
CONFTEST: cmd_uphy_display_port_off
CONFTEST: memory_failure_mf_sw_simulated_defined
CONFTEST: drm_atomic_available
CONFTEST: is_export_symbol_gpl_refcount_inc
CONFTEST: is_export_symbol_gpl_refcount_dec_and_test
CONFTEST: drm_alpha_blending_available
CONFTEST: is_export_symbol_present_drm_gem_prime_fd_to_handle
CONFTEST: is_export_symbol_present_drm_gem_prime_handle_to_fd
CONFTEST: ib_peer_memory_symbols
CC [M] /var/lib/dkms/nvidia/555.52.04/build/nvidia/nv.o
CC [M] /var/lib/dkms/nvidia/555.52.04/build/nvidia/nv-pci.o
CC [M] /var/lib/dkms/nvidia/555.52.04/build/nvidia/nv-dmabuf.o
CC [M] /var/lib/dkms/nvidia/555.52.04/build/nvidia/nv-nano-timer.o
CC [M] /var/lib/dkms/nvidia/555.52.04/build/nvidia/nv-acpi.o
CC [M] /var/lib/dkms/nvidia/555.52.04/build/nvidia/nv-cray.o
CC [M] /var/lib/dkms/nvidia/555.52.04/build/nvidia/nv-dma.o
CC [M] /var/lib/dkms/nvidia/555.52.04/build/nvidia/nv-i2c.o
CC [M] /var/lib/dkms/nvidia/555.52.04/build/nvidia/nv-mmap.o
CC [M] /var/lib/dkms/nvidia/555.52.04/build/nvidia/nv-p2p.o
CC [M] /var/lib/dkms/nvidia/555.52.04/build/nvidia/nv-pat.o
CC [M] /var/lib/dkms/nvidia/555.52.04/build/nvidia/nv-procfs.o
CC [M] /var/lib/dkms/nvidia/555.52.04/build/nvidia/nv-usermap.o
CC [M] /var/lib/dkms/nvidia/555.52.04/build/nvidia/nv-vm.o
CC [M] /var/lib/dkms/nvidia/555.52.04/build/nvidia/nv-vtophys.o
CC [M] /var/lib/dkms/nvidia/555.52.04/build/nvidia/os-interface.o
CC [M] /var/lib/dkms/nvidia/555.52.04/build/nvidia/os-mlock.o
CC [M] /var/lib/dkms/nvidia/555.52.04/build/nvidia/os-pci.o
CC [M] /var/lib/dkms/nvidia/555.52.04/build/nvidia/os-registry.o
CC [M] /var/lib/dkms/nvidia/555.52.04/build/nvidia/os-usermap.o
CC [M] /var/lib/dkms/nvidia/555.52.04/build/nvidia/nv-modeset-interface.o
CC [M] /var/lib/dkms/nvidia/555.52.04/build/nvidia/nv-pci-table.o
CC [M] /var/lib/dkms/nvidia/555.52.04/build/nvidia/nv-kthread-q.o
CC [M] /var/lib/dkms/nvidia/555.52.04/build/nvidia/nv-memdbg.o
CC [M] /var/lib/dkms/nvidia/555.52.04/build/nvidia/nv-ibmnpu.o
CC [M] /var/lib/dkms/nvidia/555.52.04/build/nvidia/nv-report-err.o
CC [M] /var/lib/dkms/nvidia/555.52.04/build/nvidia/nv-rsync.o
CC [M] /var/lib/dkms/nvidia/555.52.04/build/nvidia/nv-msi.o
CC [M] /var/lib/dkms/nvidia/555.52.04/build/nvidia/nv-caps.o
CC [M] /var/lib/dkms/nvidia/555.52.04/build/nvidia/nv-caps-imex.o
CC [M] /var/lib/dkms/nvidia/555.52.04/build/nvidia/nv_uvm_interface.o
CC [M] /var/lib/dkms/nvidia/555.52.04/build/nvidia/libspdm_aead.o
/var/lib/dkms/nvidia/555.52.04/build/nvidia/libspdm_aead.c:41:5: warning: no previous prototype for ‘libspdm_aead_prealloc’ [-Wmissing-prototypes]
41 | int libspdm_aead_prealloc(void **context, char const *alg)
| ^~~~~~~~~~~~~~~~~~~~~
/var/lib/dkms/nvidia/555.52.04/build/nvidia/libspdm_aead.c:171:5: warning: no previous prototype for ‘libspdm_aead_prealloced’ [-Wmissing-prototypes]
171 | int libspdm_aead_prealloced(void *context,
| ^~~~~~~~~~~~~~~~~~~~~~~
CC [M] /var/lib/dkms/nvidia/555.52.04/build/nvidia/libspdm_ecc.o
CC [M] /var/lib/dkms/nvidia/555.52.04/build/nvidia/libspdm_hkdf.o
CC [M] /var/lib/dkms/nvidia/555.52.04/build/nvidia/libspdm_rand.o
CC [M] /var/lib/dkms/nvidia/555.52.04/build/nvidia/libspdm_shash.o
CC [M] /var/lib/dkms/nvidia/555.52.04/build/nvidia/libspdm_rsa.o
CC [M] /var/lib/dkms/nvidia/555.52.04/build/nvidia/libspdm_aead_aes_gcm.o
CC [M] /var/lib/dkms/nvidia/555.52.04/build/nvidia/libspdm_sha.o
CC [M] /var/lib/dkms/nvidia/555.52.04/build/nvidia/libspdm_hmac_sha.o
CC [M] /var/lib/dkms/nvidia/555.52.04/build/nvidia/libspdm_hkdf_sha.o
CC [M] /var/lib/dkms/nvidia/555.52.04/build/nvidia/libspdm_ec.o
CC [M] /var/lib/dkms/nvidia/555.52.04/build/nvidia/libspdm_x509.o
CC [M] /var/lib/dkms/nvidia/555.52.04/build/nvidia/libspdm_rsa_ext.o
CC [M] /var/lib/dkms/nvidia/555.52.04/build/nvidia/nvlink_linux.o
CC [M] /var/lib/dkms/nvidia/555.52.04/build/nvidia/nvlink_caps.o
CC [M] /var/lib/dkms/nvidia/555.52.04/build/nvidia/linux_nvswitch.o
/var/lib/dkms/nvidia/555.52.04/build/nvidia/nv-caps-imex.c:53:17: warning: no previous prototype for ‘nv_caps_imex_channel_get’ [-Wmissing-prototypes]
53 | int NV_API_CALL nv_caps_imex_channel_get(int fd)
| ^~~~~~~~~~~~~~~~~~~~~~~~
/var/lib/dkms/nvidia/555.52.04/build/nvidia/nv-caps-imex.c:90:17: warning: no previous prototype for ‘nv_caps_imex_channel_count’ [-Wmissing-prototypes]
90 | int NV_API_CALL nv_caps_imex_channel_count(void)
| ^~~~~~~~~~~~~~~~~~~~~~~~~~
/var/lib/dkms/nvidia/555.52.04/build/nvidia/nv-caps-imex.c:95:17: warning: no previous prototype for ‘nv_caps_imex_init’ [-Wmissing-prototypes]
95 | int NV_API_CALL nv_caps_imex_init(void)
| ^~~~~~~~~~~~~~~~~
/var/lib/dkms/nvidia/555.52.04/build/nvidia/nv-caps-imex.c:146:18: warning: no previous prototype for ‘nv_caps_imex_exit’ [-Wmissing-prototypes]
146 | void NV_API_CALL nv_caps_imex_exit(void)
| ^~~~~~~~~~~~~~~~~
/var/lib/dkms/nvidia/555.52.04/build/nvidia/nv-procfs.c:695:1: warning: no previous prototype for ‘exercise_error_forwarding_va’ [-Wmissing-prototypes]
695 | exercise_error_forwarding_va(
| ^~~~~~~~~~~~~~~~~~~~~~~~~~~~
/var/lib/dkms/nvidia/555.52.04/build/nvidia/nv-ibmnpu.c:395:6: warning: no previous prototype for ‘nv_init_ibmnpu_info’ [-Wmissing-prototypes]
395 | void nv_init_ibmnpu_info(nv_state_t *nv)
| ^~~~~~~~~~~~~~~~~~~
/var/lib/dkms/nvidia/555.52.04/build/nvidia/nv-ibmnpu.c:399:6: warning: no previous prototype for ‘nv_destroy_ibmnpu_info’ [-Wmissing-prototypes]
399 | void nv_destroy_ibmnpu_info(nv_state_t *nv)
| ^~~~~~~~~~~~~~~~~~~~~~
/var/lib/dkms/nvidia/555.52.04/build/nvidia/nv-ibmnpu.c:403:5: warning: no previous prototype for ‘nv_init_ibmnpu_devices’ [-Wmissing-prototypes]
403 | int nv_init_ibmnpu_devices(nv_state_t *nv)
| ^~~~~~~~~~~~~~~~~~~~~~
/var/lib/dkms/nvidia/555.52.04/build/nvidia/nv-ibmnpu.c:408:6: warning: no previous prototype for ‘nv_unregister_ibmnpu_devices’ [-Wmissing-prototypes]
408 | void nv_unregister_ibmnpu_devices(nv_state_t *nv)
| ^~~~~~~~~~~~~~~~~~~~~~~~~~~~
/var/lib/dkms/nvidia/555.52.04/build/nvidia/nv-ibmnpu.c:428:5: warning: no previous prototype for ‘nv_get_ibmnpu_chip_id’ [-Wmissing-prototypes]
428 | int nv_get_ibmnpu_chip_id(nv_state_t *nv)
| ^~~~~~~~~~~~~~~~~~~~~
/var/lib/dkms/nvidia/555.52.04/build/nvidia/nv-ibmnpu.c:437:6: warning: no previous prototype for ‘nv_ibmnpu_cache_flush_numa_region’ [-Wmissing-prototypes]
437 | void nv_ibmnpu_cache_flush_numa_region(nv_state_t *nv)
| ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/var/lib/dkms/nvidia/555.52.04/build/nvidia/os-interface.c:374:7: warning: no previous prototype for ‘os_mem_copy_custom’ [-Wmissing-prototypes]
374 | void *os_mem_copy_custom(
| ^~~~~~~~~~~~~~~~~~
CC [M] /var/lib/dkms/nvidia/555.52.04/build/nvidia/procfs_nvswitch.o
CC [M] /var/lib/dkms/nvidia/555.52.04/build/nvidia/i2c_nvswitch.o
CC [M] /var/lib/dkms/nvidia/555.52.04/build/nvidia-uvm/uvm_ats_sva.o
/var/lib/dkms/nvidia/555.52.04/build/nvidia/nv-dma.c:293:6: warning: no previous prototype for ‘nv_load_dma_map_scatterlist’ [-Wmissing-prototypes]
293 | void nv_load_dma_map_scatterlist(
| ^~~~~~~~~~~~~~~~~~~~~~~~~~~
/var/lib/dkms/nvidia/555.52.04/build/nvidia/nv-mmap.c:303:5: warning: conflicting types for ‘nv_encode_caching’ due to enum/integer mismatch; have ‘int(pgprot_t *, NvU32, nv_memory_type_t)’ {aka ‘int(struct pgprot *, unsigned int, nv_memory_type_t)’} [-Wenum-int-mismatch]
303 | int nv_encode_caching(
| ^~~~~~~~~~~~~~~~~
In file included from /var/lib/dkms/nvidia/555.52.04/build/common/inc/nv-linux.h:1781,
from /var/lib/dkms/nvidia/555.52.04/build/nvidia/nv-mmap.c:27:
/var/lib/dkms/nvidia/555.52.04/build/common/inc/nv-proto.h:44:13: note: previous declaration of ‘nv_encode_caching’ with type ‘int(pgprot_t *, NvU32, NvU32)’ {aka ‘int(struct pgprot *, unsigned int, unsigned int)’}
44 | int nv_encode_caching (pgprot_t *, NvU32, NvU32);
| ^~~~~~~~~~~~~~~~~
CC [M] /var/lib/dkms/nvidia/555.52.04/build/nvidia-uvm/uvm_conf_computing.o
/var/lib/dkms/nvidia/555.52.04/build/nvidia/nv-dma.c:489:23: warning: no previous prototype for ‘nv_dma_unmap_sgt’ [-Wmissing-prototypes]
489 | NV_STATUS NV_API_CALL nv_dma_unmap_sgt(
| ^~~~~~~~~~~~~~~~
CC [M] /var/lib/dkms/nvidia/555.52.04/build/nvidia-uvm/uvm_sec2_test.o
CC [M] /var/lib/dkms/nvidia/555.52.04/build/nvidia-uvm/uvm_maxwell_sec2.o
CC [M] /var/lib/dkms/nvidia/555.52.04/build/nvidia-uvm/uvm_hopper_sec2.o
CC [M] /var/lib/dkms/nvidia/555.52.04/build/nvidia-uvm/uvm_common.o
CC [M] /var/lib/dkms/nvidia/555.52.04/build/nvidia-uvm/uvm_linux.o
CC [M] /var/lib/dkms/nvidia/555.52.04/build/nvidia-uvm/nvstatus.o
/var/lib/dkms/nvidia/555.52.04/build/nvidia/nv.c:1236:23: warning: no previous prototype for ‘nv_get_num_dpaux_instances’ [-Wmissing-prototypes]
1236 | NV_STATUS NV_API_CALL nv_get_num_dpaux_instances(nv_state_t *nv, NvU32 *num_instances)
| ^~~~~~~~~~~~~~~~~~~~~~~~~~
CC [M] /var/lib/dkms/nvidia/555.52.04/build/nvidia-uvm/nvCpuUuid.o
CC [M] /var/lib/dkms/nvidia/555.52.04/build/nvidia-uvm/nv-kthread-q.o
CC [M] /var/lib/dkms/nvidia/555.52.04/build/nvidia-uvm/nv-kthread-q-selftest.o
CC [M] /var/lib/dkms/nvidia/555.52.04/build/nvidia-uvm/uvm.o
CC [M] /var/lib/dkms/nvidia/555.52.04/build/nvidia-uvm/uvm_tools.o
CC [M] /var/lib/dkms/nvidia/555.52.04/build/nvidia-uvm/uvm_global.o
CC [M] /var/lib/dkms/nvidia/555.52.04/build/nvidia-uvm/uvm_gpu.o
CC [M] /var/lib/dkms/nvidia/555.52.04/build/nvidia-uvm/uvm_gpu_isr.o
/var/lib/dkms/nvidia/555.52.04/build/nvidia/nv-caps.c:272:5: warning: no previous prototype for ‘nv_cap_procfs_init’ [-Wmissing-prototypes]
272 | int nv_cap_procfs_init(void)
| ^~~~~~~~~~~~~~~~~~
CC [M] /var/lib/dkms/nvidia/555.52.04/build/nvidia-uvm/uvm_procfs.o
CC [M] /var/lib/dkms/nvidia/555.52.04/build/nvidia-uvm/uvm_va_space.o
CC [M] /var/lib/dkms/nvidia/555.52.04/build/nvidia-uvm/uvm_va_space_mm.o
CC [M] /var/lib/dkms/nvidia/555.52.04/build/nvidia-uvm/uvm_gpu_semaphore.o
CC [M] /var/lib/dkms/nvidia/555.52.04/build/nvidia-uvm/uvm_mem.o
CC [M] /var/lib/dkms/nvidia/555.52.04/build/nvidia-uvm/uvm_rm_mem.o
CC [M] /var/lib/dkms/nvidia/555.52.04/build/nvidia-uvm/uvm_channel.o
CC [M] /var/lib/dkms/nvidia/555.52.04/build/nvidia-uvm/uvm_lock.o
CC [M] /var/lib/dkms/nvidia/555.52.04/build/nvidia-uvm/uvm_hal.o
CC [M] /var/lib/dkms/nvidia/555.52.04/build/nvidia-uvm/uvm_processors.o
/var/lib/dkms/nvidia/555.52.04/build/nvidia/nvlink_linux.c:313:12: warning: no previous prototype for ‘nvlink_core_init’ [-Wmissing-prototypes]
313 | int __init nvlink_core_init(void)
| ^~~~~~~~~~~~~~~~
/var/lib/dkms/nvidia/555.52.04/build/nvidia/nvlink_linux.c:389:6: warning: no previous prototype for ‘nvlink_core_exit’ [-Wmissing-prototypes]
389 | void nvlink_core_exit(void)
| ^~~~~~~~~~~~~~~~
CC [M] /var/lib/dkms/nvidia/555.52.04/build/nvidia-uvm/uvm_range_tree.o
/var/lib/dkms/nvidia/555.52.04/build/nvidia/linux_nvswitch.c:1707:1: warning: no previous prototype for ‘nvswitch_init’ [-Wmissing-prototypes]
1707 | nvswitch_init
| ^~~~~~~~~~~~~
/var/lib/dkms/nvidia/555.52.04/build/nvidia/linux_nvswitch.c:1792:1: warning: no previous prototype for ‘nvswitch_exit’ [-Wmissing-prototypes]
1792 | nvswitch_exit
| ^~~~~~~~~~~~~
CC [M] /var/lib/dkms/nvidia/555.52.04/build/nvidia-uvm/uvm_rb_tree.o
In file included from ./include/linux/efi.h:23,
from /var/lib/dkms/nvidia/555.52.04/build/common/inc/nv-linux.h:217,
from /var/lib/dkms/nvidia/555.52.04/build/nvidia-uvm/uvm_linux.h:40,
from /var/lib/dkms/nvidia/555.52.04/build/nvidia-uvm/uvm_linux.c:24:
./include/linux/pstore.h:77:9: internal compiler error: Illegal instruction
77 | char *buf;
| ^~~~
0x1f84ca6 internal_error(char const*, ...)
???:0
0x1fea95b line_maps::get_or_create_combined_loc(unsigned int, source_range, void*, unsigned int)
???:0
0x2000db7 _cpp_lex_direct
???:0
0x2008ea0 _cpp_lex_token
???:0
0x7e8f10 c_lex_with_flags(tree_node**, unsigned int*, unsigned char*, int)
???:0
0x7499a9 c_parser_declspecs(c_parser*, c_declspecs*, bool, bool, bool, bool, bool, bool, bool, c_lookahead_kind)
???:0
0x748ddf c_parser_declarator(c_parser*, bool, c_dtr_syn, bool*)
???:0
0x74aebf c_parser_declspecs(c_parser*, c_declspecs*, bool, bool, bool, bool, bool, bool, bool, c_lookahead_kind)
???:0
0x76ecd5 c_parse_file()
???:0
0x7e4b95 c_common_parse_file()
???:0
Please submit a full bug report, with preprocessed source (by using -freport-bug).
Please include the complete backtrace with any bug report.
See <https://gitlab.archlinux.org/archlinux/packaging/packages/gcc/-/issues> for instructions.
make[3]: *** [scripts/Makefile.build:244: /var/lib/dkms/nvidia/555.52.04/build/nvidia-uvm/uvm_linux.o] Error 1
make[3]: *** Waiting for unfinished jobs....
/var/lib/dkms/nvidia/555.52.04/build/nvidia-uvm/nv-kthread-q-selftest.c:84:6: warning: no previous prototype for ‘on_nvq_assert’ [-Wmissing-prototypes]
84 | void on_nvq_assert(void)
| ^~~~~~~~~~~~~
/var/lib/dkms/nvidia/555.52.04/build/nvidia-uvm/uvm_tools.c:1323:6: warning: no previous prototype for ‘uvm_tools_record_access_counter’ [-Wmissing-prototypes]
1323 | void uvm_tools_record_access_counter(uvm_va_space_t *va_space,
| ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/var/lib/dkms/nvidia/555.52.04/build/nvidia-uvm/uvm_tools.c:2822:5: warning: no previous prototype for ‘uvm_tools_init’ [-Wmissing-prototypes]
2822 | int uvm_tools_init(dev_t uvm_base_dev)
| ^~~~~~~~~~~~~~
/var/lib/dkms/nvidia/555.52.04/build/nvidia-uvm/uvm_tools.c:2883:6: warning: no previous prototype for ‘uvm_tools_exit’ [-Wmissing-prototypes]
2883 | void uvm_tools_exit(void)
| ^~~~~~~~~~~~~~
/var/lib/dkms/nvidia/555.52.04/build/nvidia-uvm/uvm_gpu_semaphore.c:510:6: warning: no previous prototype for ‘tracking_semaphore_uses_mutex’ [-Wmissing-prototypes]
510 | bool tracking_semaphore_uses_mutex(uvm_gpu_tracking_semaphore_t *tracking_semaphore)
| ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~
make[2]: *** [/usr/lib/modules/6.9.4-arch1-1/build/Makefile:1919: /var/lib/dkms/nvidia/555.52.04/build] Error 2
make[1]: *** [Makefile:240: __sub-make] Error 2
make[1]: Leaving directory '/usr/lib/modules/6.9.4-arch1-1/build'
make: *** [Makefile:89: modules] Error 2Last edited by jslay (2024-06-14 20:32:41)
Offline
./include/linux/pstore.h:77:9: internal compiler error: Illegal instructionWe've been here before: https://bbs.archlinux.org/viewtopic.php … 0#p2173290
I have to go through a cycle of trying to get the DKMS module to rebuild and work.
If the behavior is non-deterministic you should make sure to not run OOM (do you have a swapfile) and run memtest86+ for a night.
Offline
As I have stated, I have tried every version of gcc that seems reasonable within the last 3 months. I have no other version of gcc installed but the one from pacman. Linking me to the same thread that says I have the wrong gcc version is of no help. The system is a 13900K with 64GB of RAM. It is not an OOM issue.
I have 0 issues with anything else, as I am developer, I am compiling golang and java constantly on this machine with no issues. I highly doubt I have an issue with my memory.
The deterministic thing here, is it fails to compile 550 every time a kernel upgrade happens, and only this ever has issues. I have several other DKMS modules that are compiled with no issue. I can compile 545 with no issue.
Offline
The comment in that thread was that you're Off topic because you're hitting a completely different Problem than the OP that matches the other thread linked there.
You also suggested that You're Not getting this with gcc 13 what would make some Sort of Sense because it's a Compiler internal Error.
The sublink also Points to the gcc bugtracker where you'd therefore have to Report that.
Nb that "i've ram so it's Not oom" ist non sequitur - If Things fo wrong the system can allocate all your (left) RAM in a Split second.
That's aside the issues you can get with zram or unbacked zswap.
Offline
Yes, this is what I had originally stated in the other thread, but this was just one of those lucky runs apparently.
I cannot seem to get it to consistently compile with any version it seems.
I guess this is not solvable. I’ve given up at this point. I cannot keep spending hours trying to find an issue with a beta driver anymore. I tried to report what I’m finding to keep others from going down the same rabbit holes, but this is not on me anymore. Will just go back to 545, as that compiles with 0 issue every single time (again, not a ram issue).
My Wayland support for my software will just have to wait a few more months until this is solved by someone more knowledgeable
Last edited by jslay (2024-06-14 20:31:37)
Offline
Afaiu it's not only the 555xx beta driver but also the 550xx driver and AGAIN: this is NOT a bug with the driver, the bug would be in gcc.
Except it's actually different from the SIGSEGV in https://bbs.archlinux.org/viewtopic.php?id=296086 and a remarkably consisten SIGILL - except is also doesn't always happen, but when it does always in this context (and unrelated to the GCC version)?
Does the CPU overheat?
Can you prevent it by limiting the parallel jobs (to leave some cores alone)?
Edit: or did you forget to install/load the microcode patches for your CPU?
Last edited by seth (2024-06-14 21:14:33)
Offline
I hope this is not necrobumping yet, but problems with nvidia dkms have been occurring to me on persistent basis too and I too have 13900K(F) with 64G of RAM (DDR5 in my case). SIGILL in my current case too but as far as I remember even the specific error was not stable.
My fix for now has been to edit /var/lib/dkms/nvidia/<ver>/source/dkms.conf and in the line starting with
MAKE[0]=replace
-j`nproc`with -j8 or smaller number then run dkms install manually - this has always been enough to get the dkms build through.
These errors were occurring more often when I did have a slight under-clock configured, since then I have reduced it to almost nothing (any other benchmark, tool or game has been stable even with the previous values, except for nvidia dkms) and it didn't happen since except for today with nvidia/550.90.07 @ 6.9.6-arch1-1
Last edited by Wereii (2024-06-25 22:30:11)
Offline
The newer drivers in the last week or so (both normal and beta) seem to have resolved my issue. I can consistently build them now.
I double checked everything, last I updated my BIOS was June 7th with an update from June 6th from the manufacture, so I am fairly certain I have had the latest microcode, as I was experiencing instability from the recent issues that Intel had with these chips (usually would hang entire machine when compiling shaders or something when trying to play a game), and haven't experienced those same issues since.
As to why I couldn't build the drivers consistently before this last week? Not sure. GCC was not updated.
I had to go back to beta (555.52.04-1 atm) as 550 was causing me further issues in Wayland once I had updated to KDE Plasma 6.1
I am not running any overclock beyond the XMP-1 profile for my RAM. CPU clocking has not been modified.
I am also on 6.9.6-arch1-1
gcc 14.1.1+r58+gfc9fb69ad62-1
gcc-libs 14.1.1+r58+gfc9fb69ad62-1
lib32-gcc-libs 14.1.1+r58+gfc9fb69ad62-1lib32-nvidia-utils-beta 555.52.04-1
nvidia-beta-dkms 555.52.04-1
nvidia-utils-beta 555.52.04-2Last edited by jslay (2024-06-26 07:26:35)
Offline
I have removed any custom BIOS settings by resetting my MSI board to defaults, I might not have the latest BIOS but I verified that intel-microcode is packed into the initramfs (lsinitcpio --early /boot/initramfs-linux.img contains GenuineIntel.bin)
Running the dkms build manually, the build starts to fail around 29-30 jobs (cores) with SIGILL.
The place where the SIGILL is raised seems unstable.
This is with intel-pstate powersave governor, the CPU package does not climb over 88C.
I will look into BIOS updates, also interested if I can cause these errors when compiling different projects but either way my hunch for now is that when both E-Cores and P-Cores are engaged and some specific combination of instructions is executed it shits itself, though rasdaemon shows no errors.
Offline
Updated BIOS to latest non-beta (for my MSI PRO Z790-A WIFI that is 7E07vAB released 2024-05-14) and the crash seems to be gone, even with all 32 cores the build finishes without errors. Even with a bit of tweaks (lowering pl1 and xmp enabled) it does not crash.
Edit: Never mind, the current nvidia-dkms update triggered dpms build and that has failed the same way again.
Edit2:
Testing with phoronix-test-suite benchmark build-linux-kernel I can trigger the SIGILLs too, for now my hunch is power spikes - playing with cpu frequecny I do seem to get some stability when reducing max freq by 500 mhz (to 5000 mhz).
Does anyone here know if power spikes (that don't cause total system crash) would get reported somehow (waving at rasdaemon)?
Last edited by Wereii (2024-07-01 10:56:25)
Offline
Downgrading gcc and gcc-libs to 13.2.1 I still got the SIGILLs but rasdaemon also catched a MCE event: Internal Parity Error on cpu 4 - which was in short followed by kernel oops (something along cpu4 taking too long) and it ground itself to halt (though that might have been caused by downgrading gcc-libs as some stuff seems to depend on it at runtime).
Either way, I can't seem to point out a single deciding factor on what is causing these errors bar having a faulty CPU or other HW.
For now I am testing beta BIOS for my mobo - the beta has the possibility to select Intel's default power options (instead of manufacturers specific ones, MSI in my case).
Last edited by Wereii (2024-07-01 15:23:31)
Offline
@Wereii - thanks for this - you saved me today!
It was my first time doing a
pacman -Syusince installing harddrive in a new PC build. It actually froze up so hard had to hold the power button. Took a minute to fixup the missing vmlinuz-linux boot files, but after booting I couldn't startx. Reinstalled all the packages and still no dice. I knew the nvidia driver modules weren't loading and finally noticed that
pacman -Syu nvidia-dkms# 555.58-2(was quietly throwing "Error! Bad return status for module build on kernel: 6.9.7-arch-1 (x86_64)." I tried a few times and only once it threw an few mce errors like: "mce: [Hardware Error]: CPU 8: Machine Check: 0 Bank 0: ..."
MoBo: MSI Pro Z690-A WiFi
CPU: i9-14900K
GPU: 3090TI
RAM: 2x 32GB sticks of DD5 5200
I first tried some BIOS settings like turning off XMP to underclock the RAM. Also updated the BIOS firmware to latest version released 2024-04-11. Still didn't help.
I saw your note above and I did a dirty trick of backing up `/usr/bin/nproc` and making a bash script called nproc that just did `echo 8` and was able to get pacman to install nvidia-dkms and the hooks properly rebuilt the initramfs and I was :gucci: again for xorg!
I'm gonna crank up the water cooling settings in BIOS and manually set the RAM configs then run memtest86+ as it sure feels like a bad stick of RAM in some ways, but might just be system instability when using all 32 cores... psure it also died a day ago with older gcc compiling llama.cpp with `-j$(nproc)` (had to dial it back to make -j8 also). Which suggests to me some kind of hardware/bios/temperature issue and not the underlying ARCH software packages. fwiw helldivers 2 crashes on me regularly (in windows 11 :oof:)...
Good luck sorting out your box!
Offline
@Wereii
Well, the solution in my case was to go into the motherboard BIOS and change the cooler type from water to box cooler. After doing this, it knocks down the power limit from some crazy high 4000+ Watts to the intel recommended 253W. (I have a 260 AIO water cooler). I even turned on XMP for the RAM.
I just tested and stress test CPU now stays around 90 deg C (in my warm 30 deg C room). It used to peg 100 deg C and throttle almost instantly.
The important part, I can now run `pacman -Syu nvidia-dkms` and it successfully compiles with nproc returning all 32 cores.
You can see the exact numbers you need in this guys video https://www.youtube.com/watch?v=s43Auv8ub7w
Cheers!
Offline
Hey @ubergarm
glad I could help by sharing my struggles hah!
You have kind of confirmed my hunch that the PL1 (BIOS "cooler type") setting is what might be at play - seems these instabilities are mainly caused by motherboard manufacturers having higher power limits then Intel's recommended defaults
which does not bode well for the latest, itself so power-hungry, cpus.
What confuses me still, is that I had always set that "cooler type" setting in BIOS to box cooler (even though I do have AIO water cooler, 4 kW seemed just insane) but I still had these instabilities.
Until now as I switched to the latest beta BIOS (which adds an option to use Intel's intended values for these configurations) the crashing seems to be gone.
E: I didn't notice you linked the JayzTwoCents video which is exactly the problem linked in the url above, so I kind of paraphrased it here.
Last edited by Wereii (2024-07-04 00:12:52)
Offline
Thanks for confirming, @Wereii and very happy to hear it sounds like your system is now stable with the Intel configs.
In further testing, simply setting the MSI BIOS to "Boxed Cooler" 253W was *not enough* as compiling with all cores e.g. `make -j32` still borked about 20% of the time. I had to bump the Load Line Mode from default of 9 up to 12 which in minimal testing increases temperature (some cores still peg out and throttle at 100 deg C), but knock-on-wood the compiler stopped segfaulting / and even occasionally hard locking the system.
Here is the most simple explanation I've foundfor MSI BIOS: https://www.msi.com/blog/improving-gami … -i9-14900k
Okay, good luck everyone with an Intel i9 13th/14th gen chip getting it to run stable so you can actually use all the cores you paid for simultaneously!
Offline