You are not logged in.
I mainly suspect 2 problems:
1. nvidia-smi and others like nvtop report the card correctly but no process using it
Wed Apr 2 17:46:59 2025
+-----------------------------------------------------------------------------------------+
| NVIDIA-SMI 570.133.07 Driver Version: 570.133.07 CUDA Version: 12.8 |
|-----------------------------------------+------------------------+----------------------+
| GPU Name Persistence-M | Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap | Memory-Usage | GPU-Util Compute M. |
| | | MIG M. |
|=========================================+========================+======================|
| 0 NVIDIA GeForce RTX 5070 Off | 00000000:01:00.0 Off | N/A |
| 0% 47C P8 7W / 250W | 0MiB / 12227MiB | 0% Default |
| | | N/A |
+-----------------------------------------+------------------------+----------------------+
+-----------------------------------------------------------------------------------------+
| Processes: |
| GPU GI CI PID Type Process name GPU Memory |
| ID ID Usage |
|=========================================================================================|
| No running processes found |
+-----------------------------------------------------------------------------------------+
2.
kind of like this topic: https://bbs.archlinux.org/viewtopic.php?id=302266
> ls /sys/module/nvidia_drm
ls: cannot access '/sys/module/nvidia_drm': No such file or directory
but
> ls /sys/module | grep nvidia
nvidia
nvidia_uvm
and when I do
sudo modprobe nvidia_drm -vv
it will panic with
Apr 02 18:44:30 NAMED kernel: nvidia-modeset: Loading NVIDIA UNIX Open Kernel Mode Setting Driver for x86_64 570.133.07 Release Build (archlinux-builder@NAMED)
Apr 02 18:44:30 NAMED kernel: [drm] [nvidia-drm] [GPU ID 0x00000100] Loading driver
Apr 02 18:44:30 NAMED kernel: [drm] Initialized nvidia-drm 0.0.0 for 0000:01:00.0 on minor 1
Apr 02 18:44:31 NAMED kernel: Console: switching to colour dummy device 80x25
Apr 02 18:44:31 NAMED kernel: nvidia 0000:01:00.0: vgaarb: deactivate vga console
Apr 02 18:44:31 NAMED kernel: fbcon: nvidia-drmdrmfb (fb0) is primary device
Apr 02 18:44:31 NAMED kernel: Console: switching to colour frame buffer device 240x67
Apr 02 18:44:31 NAMED kernel: nvidia 0000:01:00.0: [drm] fb0: nvidia-drmdrmfb frame buffer device
Apr 02 18:44:31 NAMED kernel: llvmpipe-19[4805]: segfault at 1500 ip 0000766369553d0d sp 00007662277fd600 error 6
Apr 02 18:44:31 NAMED kernel: llvmpipe-20[4806]: segfault at 1600 ip 0000766369553d0d sp 0000766226ffc600 error 6
Apr 02 18:44:31 NAMED kernel: llvmpipe-22[4808]: segfault at 1700 ip 0000766369553d0d sp 0000766225ffa600 error 6
Apr 02 18:44:31 NAMED kernel: likely on CPU 23 (core 13, socket 0)
Apr 02 18:44:31 NAMED kernel: likely on CPU 8 (core 10, socket 0)
Apr 02 18:44:31 NAMED kernel: Code: 67 c0 c5 d9 60 d8 c5 d9 68 c0 c5 f1 61 e3 c5 f1 69 cb c5 e9 61 d8 c5 e9 69 c0 c5 d9 6c d1 c5 d9 6d c9 c5 e1 6c e0 c5 e1 6d c0 <c5> f9 7f 11 c5 f9 7f 0c 01 c5 f9 7f 24 11 c5 f9 7f 04 31 48 89 ec
Apr 02 18:44:31 NAMED kernel:
Apr 02 18:44:31 NAMED kernel: Code: 67 c0 c5 d9 60 d8 c5 d9 68 c0 c5 f1 61 e3 c5 f1 69 cb c5 e9 61 d8 c5 e9 69 c0 c5 d9 6c d1 c5 d9 6d c9 c5 e1 6c e0 c5 e1 6d c0 <c5> f9 7f 11 c5 f9 7f 0c 01 c5 f9 7f 24 11 c5 f9 7f 04 31 48 89 ec
Apr 02 18:44:31 NAMED kernel: likely on CPU 7 (core 9, socket 0)
Apr 02 18:44:31 NAMED kernel: Code: 67 c0 c5 d9 60 d8 c5 d9 68 c0 c5 f1 61 e3 c5 f1 69 cb c5 e9 61 d8 c5 e9 69 c0 c5 d9 6c d1 c5 d9 6d c9 c5 e1 6c e0 c5 e1 6d c0 <c5> f9 7f 11 c5 f9 7f 0c 01 c5 f9 7f 24 11 c5 f9 7f 04 31 48 89 ec
Apr 02 18:44:32 NAMED kernel: xdg-desktop-por[5836]: segfault at 14 ip 0000761d6d0c288c sp 00007ffff4084990 error 6 in libwayland-client.so.0.23.1[588c,761d6d0c0000+6000] likely on CPU 7 (core 9, socket 0)
Apr 02 18:44:32 NAMED kernel: Code: c3 00 00 00 01 81 fb 00 00 f0 00 77 61 49 8b 04 24 48 c1 e8 03 39 d8 72 65 39 c3 74 21 49 8b 44 24 10 43 8d 54 2d 00 48 09 d1 <48> 89 0c d8 31 c0 48 83 c4 18 5b 41 5c 41 5d 5d c3 0f 1f 00 48 89
Apr 02 18:44:42 NAMED kernel: ata8: SATA link up 6.0 Gbps (SStatus 133 SControl 300)
Apr 02 18:44:42 NAMED kernel: ata8.00: configured for UDMA/133
Apr 02 18:44:42 NAMED kernel: ahci 0000:10:00.0: port does not support device sleep
Apr 02 18:44:42 NAMED kernel: sd 7:0:0:0: [sda] Starting disk
Apr 02 18:44:42 NAMED kernel: sd 7:0:0:0: [sda] Starting disk
Apr 02 18:44:42 NAMED kernel: sd 7:0:0:0: [sda] Starting disk
Apr 02 18:44:42 NAMED systemd-shutdown[1]: Syncing filesystems and block devices.
---
version info:
it happened both on kernel `6.13.8` and `6.14.0`.
it happened both on `nvidia-open-dkms`, `nvidia-open-dkms-git`, `nvidia-open-beta`.
config:
I disabled my bundled AMD gpu by `install amdgpu /usr/bin/false` in `/etc/modprobe.d/blacklist.conf`
disabled nouveau by configing `/etc/modprobe.d/nvidia.conf` with
blacklist nouveau
options nvidia_drm modeset=1 fbdev=1
/etc/mkinitcpio.conf:
MODULES=(nvidia nvidia_modeset nvidia_uvm nvidia_drm)
...
HOOKS=(base udev autodetect microcode modconf keyboard keymap block filesystems fsck plymouth)
I'm connected the monitor with the HDMI port on the graph card instead of the HDMI port on the motherboard
Last edited by unamed^^ (Yesterday 00:30:17)
Σ(-᷅_-᷄๑)
Offline
Please post your complete system journal for the boot:
sudo journalctl -b | curl -F 'file=@-' 0x0.st
Offline
Please post your complete system journal for the boot:
sudo journalctl -b | curl -F 'file=@-' 0x0.st
can be sth about this?
systemd-modules-load[515]: Module 'nvidia_uvm' is deny-listed (by kmod)
> lspci -nnk | grep nvidia
Kernel driver in use: nvidia
Kernel modules: nouveau, nvidia_drm, nvidia
PS: I'm using Hyprland on wayland
Last edited by unamed^^ (Yesterday 16:07:08)
Σ(-᷅_-᷄๑)
Offline
I disabled my bundled AMD gpu by `install amdgpu /usr/bin/false` in `/etc/modprobe.d/blacklist.conf`
That's not how this works.
https://wiki.hyprland.org/Configuring/M … gpu-to-use
Apr 03 11:01:45 NAMED uwsm_hyprland.desktop[4631]: 79:00.0 VGA compatible controller [0300]: Advanced Micro Devices, Inc. [AMD/ATI] Granite Ridge [Radeon Graphics] [1002:13c0] (rev c2) (prog-if 00 [VGA controller])
But amdgpu isn't loaded, the the result is
Apr 03 11:01:45 NAMED uwsm_hyprland.desktop[4631]: [LOG] [AQ] drm: Enumerated device /sys/devices/pci0000:00/0000:00:01.1/0000:01:00.0/simple-framebuffer.0/drm/card0
Apr 03 11:01:45 NAMED uwsm_hyprland.desktop[4631]: [LOG] [AQ] drm: Starting backend for /dev/dri/card0, with driver simpledrm
Apr 03 11:01:47 NAMED uwsm_hyprland.desktop[4631]: [LOG] DRM driver information: simpledrm v1.0.0 from 0 description DRM driver for simple-framebuffer platform devices
nvidia_uvm is cuda-stuff, idk whether it's supported w/ nvidia-open, but it will not be the cause for the software rasterizer.
Adding "nvidia_drm.modeset=1" to the https://wiki.archlinux.org/title/Kernel_parameters (and *ONLY* there) will block the simplydumb device, but I'd not rely on hyprland picking up the nvidia GPU automagically by that (you may try, though - however it seems you're using an output that's connected to the AMD GPU?)
Offline
That's not how this works.
ok I remove the blacklist of my AMD gpu and change the config in `~/.config/hypr/hyprland.conf`
# NV
env = LIBVA_DRIVER_NAME,nvidia
env = GBM_BACKEND,nvidia-drm
env = __GLX_VENDOR_LIBRARY_NAME,nvidia
env = AQ_DRM_DEVICES,/dev/dri/card0:/dev/dri/card1
env = NVD_BACKEND,direct
> ls -l /dev/dri/by-path
total 0
lrwxrwxrwx 1 root root 8 Apr 3 17:10 pci-0000:01:00.0-platform-simple-framebuffer.0-card -> ../card0
lrwxrwxrwx 1 root root 8 Apr 3 17:10 pci-0000:79:00.0-card -> ../card1
lrwxrwxrwx 1 root root 13 Apr 3 17:10 pci-0000:79:00.0-render -> ../renderD128
> lspci | grep -E 'VGA|3D'
01:00.0 VGA compatible controller: NVIDIA Corporation GN22 [GeForce RTX 5070] (rev a1)
79:00.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI] Granite Ridge [Radeon Graphics] (rev c2)
but it is still using software I assume
PS: I got
libva error: /usr/lib/dri/nvidia_drv_video.so init failed
in starting google chrome
Last edited by unamed^^ (Yesterday 22:21:19)
Σ(-᷅_-᷄๑)
Offline