You are not logged in.

#1 2022-10-18 14:28:49

hjmu
Member
Registered: 2022-10-11
Posts: 6

NVIDIA-SMI has failed because it couldn't communicate[Solved]

I failed to install the nvidia driver. I followed the Custom kernel section 1.2, and I got the following message with nvidia-smi.

@arch ~]$ nvidia-smi 
NVIDIA-SMI has failed because it couldn't communicate with the NVIDIA driver. Make sure that the latest NVIDIA driver is installed and running.

I have just installed the arch for about two weeks and could not solve it. I referred to this post, but it is a different kernel. I also provide my PC information:

@arch ~]$ neofetch
                   -`                    myname@arch 
                  .o+`                   ------------- 
                 `ooo/                   OS: Arch Linux x86_64 
                `+oooo:                  Host: G7 7700 
               `+oooooo:                 Kernel: 6.0.2-zen1-1-zen 
               -+oooooo+:                Uptime: 22 mins 
             `/:-:++oooo+:               Packages: 1321 (pacman), 9 (flatpak) 
            `/++++/+++++++:              Shell: bash 5.1.16 
           `/++++++++++++++:             Resolution: 3840x2160 
          `/+++ooooooooooooo/`           DE: Plasma 5.26.1 
         ./ooosssso++osssssso+`          WM: KWin 
        .oossssso-````/ossssss+`         Theme: [Plasma], Breeze [GTK2/3] 
       -osssssso.      :ssssssso.        Icons: Adwaita [Plasma], Adwaita [GTK2 
      :osssssss/        osssso+++.       Terminal: gnome-terminal 
     /ossssssss/        +ssssooo/-       CPU: Intel i7-10750H (12) @ 5.000GHz 
   `/ossssso+/:-        -:/+osssso+-     GPU: NVIDIA GeForce RTX 2070 Mobile /  
  `+sso+:-`                 `.-/+oso:    GPU: Intel CometLake-H GT2 [UHD Graphi 
 `++:.                           `-/+/   Memory: 3806MiB / 15631MiB 
 .`                                 `/
@arch ~]$ lspci -k | grep -A 2 -E "(VGA|3D)"
00:02.0 VGA compatible controller: Intel Corporation CometLake-H GT2 [UHD Graphics] (rev 05)
	DeviceName: Onboard - Video
	Subsystem: Dell Device 09f9
--
01:00.0 VGA compatible controller: NVIDIA Corporation TU106M [GeForce RTX 2070 Mobile / Max-Q Refresh] (rev a1)
	Subsystem: Dell Device 09f9
	Kernel driver in use: nouveau

(So it is still using nouveau)

@arch ~]$ pacman -Qs nvidia
local/egl-Wayland 2:1.1.11-2
    EGLStream-based Wayland external platform
local/lib32-OpenCL-nvidia 520.56.06-1
    OpenCL implemention for NVIDIA (32-bit)
local/libvdpau 1.5-1
    Nvidia VDPAU library
local/nvidia-dkms 520.56.06-1
    NVIDIA drivers - module sources
local/nvidia-utils 520.56.06-1
    NVIDIA drivers utilities
local/opencl-nvidia 520.56.06-1
    OpenCL implemention for NVIDIA
local/xf86-video-nouveau 1.0.17-2 (xorg-drivers)
    Open Source 3D acceleration driver for nVidia cards

Last edited by hjmu (2022-10-19 04:10:39)

Offline

#2 2022-10-18 15:19:34

astralc
Member
Registered: 2022-09-17
Posts: 127

Re: NVIDIA-SMI has failed because it couldn't communicate[Solved]

nvidia-smi only work if you using 'nvidia' module, currently you are using nouveau.

did you installed kernel headers (e.g. linux-zen-headers)? I see you installed nvidia-dkms, so it need the headers.
https://wiki.archlinux.org/title/Dynami … le_Support

Offline

#3 2022-10-19 04:05:17

hjmu
Member
Registered: 2022-10-11
Posts: 6

Re: NVIDIA-SMI has failed because it couldn't communicate[Solved]

Well, here comes something really interesting...
I installed the linux-zen-headers yesterday like this BEFORE I posted the previous question:

@arch ~]$ pacman -Qs linux-zen-headers
local/linux-zen-headers 6.0.2.zen1-1
    Headers and scripts for building modules for the Linux ZEN kernel

I failed to have nvidia-smi work, even after shutdown and reboot. But today, I woke up and found it works:

@arch ~]$ nvidia-smi
Wed Oct 19 11:54:48 2022       
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 520.56.06    Driver Version: 520.56.06    CUDA Version: 11.8     |
|-------------------------------+----------------------+----------------------+
| GPU  Name        Persistence-M| Bus-Id        Disp.A | Volatile Uncorr. ECC |
| Fan  Temp  Perf  Pwr:Usage/Cap|         Memory-Usage | GPU-Util  Compute M. |
|                               |                      |               MIG M. |
|===============================+======================+======================|
|   0  NVIDIA GeForce ...  On   | 00000000:01:00.0 Off |                  N/A |
| N/A   48C    P8     5W /  N/A |      1MiB /  8192MiB |      0%      Default |
|                               |                      |                  N/A |
+-------------------------------+----------------------+----------------------+
                                                                               
+-----------------------------------------------------------------------------+
| Processes:                                                                  |
|  GPU   GI   CI        PID   Type   Process name                  GPU Memory |
|        ID   ID                                                   Usage      |
|=============================================================================|
|  No running processes found                                                 |
+-----------------------------------------------------------------------------+

That is like a miracle. But, it works with a side-effect: I cannot use an external monitor as xrandr cannot detect it:

@arch ~]$ xrandr
Screen 0: minimum 320 x 200, current 1920 x 1080, maximum 16384 x 16384
eDP-1 connected primary 1920x1080+0+0 (normal left inverted right x axis y axis) 382mm x 215mm
   1920x1080     60.05*+  60.01    59.97    59.96    59.93  
   1680x1050     59.95    59.88  
   1400x1050     59.98  
   1600x900      59.99    59.94    59.95    59.82  
   1280x1024     60.02  
   1400x900      59.96    59.88  
   1280x960      60.00  
   1440x810      60.00    59.97  
   1368x768      59.88    59.85  
   1280x800      59.99    59.97    59.81    59.91  
   1280x720      60.00    59.99    59.86    59.74  
   1024x768      60.04    60.00  
   960x720       60.00  
   928x696       60.05  
   896x672       60.01  
   1024x576      59.95    59.96    59.90    59.82  
   960x600       59.93    60.00  
   960x540       59.96    59.99    59.63    59.82  
   800x600       60.00    60.32    56.25  
   840x525       60.01    59.88  
   864x486       59.92    59.57  
   700x525       59.98  
   800x450       59.95    59.82  
   640x512       60.02  
   700x450       59.96    59.88  
   640x480       60.00    59.94  
   720x405       59.51    58.99  
   684x384       59.88    59.85  
   640x400       59.88    59.98  
   640x360       59.86    59.83    59.84    59.32  
   512x384       60.00  
   512x288       60.00    59.92  
   480x270       59.63    59.82  
   400x300       60.32    56.34  
   432x243       59.92    59.57  
   320x240       60.05  
   360x202       59.51    59.13  
   320x180       59.84    59.32

and neofetch. I had a different resolution yesterday:

@arch ~]$ neofetch
                   -`                    myname@arch 
                  .o+`                   ------------- 
                 `ooo/                   OS: Arch Linux x86_64 
                `+oooo:                  Host: G7 7700 
               `+oooooo:                 Kernel: 6.0.2-zen1-1-zen 
               -+oooooo+:                Uptime: 14 mins 
             `/:-:++oooo+:               Packages: 1321 (pacman), 10 (flatpak) 
            `/++++/+++++++:              Shell: bash 5.1.16 
           `/++++++++++++++:             Resolution: 1920x1080 
          `/+++ooooooooooooo/`           DE: Plasma 5.26.1 
         ./ooosssso++osssssso+`          WM: KWin 
        .oossssso-````/ossssss+`         Theme: [Plasma], Breeze [GTK2/3] 
       -osssssso.      :ssssssso.        Icons: Adwaita [Plasma], Adwaita [GTK2/3] 
      :osssssss/        osssso+++.       Terminal: gnome-terminal 
     /ossssssss/        +ssssooo/-       CPU: Intel i7-10750H (12) @ 5.000GHz 
   `/ossssso+/:-        -:/+osssso+-     GPU: NVIDIA GeForce RTX 2070 Mobile / Max-Q Refresh 
  `+sso+:-`                 `.-/+oso:    GPU: Intel CometLake-H GT2 [UHD Graphics] 
 `++:.                           `-/+/   Memory: 3264MiB / 15631MiB

lspci also works:

@arch ~]$ lspci -k | grep -A 2 -E "(VGA|3D)"
00:02.0 VGA compatible controller: Intel Corporation CometLake-H GT2 [UHD Graphics] (rev 05)
	DeviceName: Onboard - Video
	Subsystem: Dell Device 09f9
--
01:00.0 VGA compatible controller: NVIDIA Corporation TU106M [GeForce RTX 2070 Mobile / Max-Q Refresh] (rev a1)
	Subsystem: Dell Device 09f9
	Kernel driver in use: nvidia

Now the old problem was solved, but a new one was incurred. Shall I start a new post and mark this as solved, or you help me with a new problem?

Offline

#4 2022-10-19 06:36:38

seth
Member
From: Won't reply 2 private help req
Registered: 2012-09-03
Posts: 76,464

Re: NVIDIA-SMI has failed because it couldn't communicate[Solved]

There's no miracle - installing the headers allowed nvidia-dkms to be built for that kernel and you end up using it w/ the reboot.

The other output isn't available because this is an optimus system and the nvidia driver doesn't seem available or usd for X11.
See https://wiki.archlinux.org/title/PRIME and post your xorg log, https://wiki.archlinux.org/title/Xorg#General

Offline

Board footer

Powered by FluxBB