You are not logged in.

#1 2022-04-05 16:40:03

Kokos Bot
Member
Registered: 2020-06-20
Posts: 9

modprobe nvidia fails, can't change power state ...

Hello,

I want to enable the secondary GPU of my HP ZBook Fury G8 laptop to use it with cuda and tensorflow, but I cannot load the kernel module.

The inbuild and the secondary GPU.

$ lspci -v
...
00:02.0 VGA compatible controller: Intel Corporation TigerLake-H GT1 [UHD Graphics] (rev 01) (prog-if 00 [VGA controller])
        DeviceName: Onboard IGD
        Subsystem: Hewlett-Packard Company Device 8870
        Flags: bus master, fast devsel, latency 0, IRQ 187, IOMMU group 2
        Memory at 614c000000 (64-bit, non-prefetchable) [size=16M]
        Memory at 4000000000 (64-bit, prefetchable) [size=256M]
        I/O ports at 4000 [size=64]
        Expansion ROM at 000c0000 [virtual] [disabled] [size=128K]
        Capabilities: <access denied>
        Kernel driver in use: i915
        Kernel modules: i915
...
01:00.0 VGA compatible controller: NVIDIA Corporation GA107GLM [RTX A2000 Mobile] (rev ff) (prog-if ff)
        !!! Unknown header type 7f
        Kernel modules: nouveau, nvidia_drm, nvidia
...

I updated my system and installed the nvidia driver.
After that I tried to load the kernel module with no success.

# modprobe nvidia
modprobe: ERROR: could not insert 'nvidia': No such device

This is the output in the kernel log:

Apr 05 17:53:12 mw-1 kernel: nvidia: module license 'NVIDIA' taints kernel.
Apr 05 17:53:12 mw-1 kernel: Disabling lock debugging due to kernel taint
Apr 05 17:53:12 mw-1 kernel: nvidia-nvlink: Nvlink Core is being initialized, major device number 508
Apr 05 17:53:12 mw-1 kernel: 
Apr 05 17:53:12 mw-1 kernel: nvidia 0000:01:00.0: can't change power state from D3cold to D0 (config space inaccessible)
Apr 05 17:53:12 mw-1 kernel: nvidia 0000:01:00.0: vgaarb: changed VGA decodes: olddecodes=io+mem,decodes=none:owns=none
Apr 05 17:53:12 mw-1 kernel: NVRM: The NVIDIA GPU 0000:01:00.0
                             NVRM: (PCI ID: 10de:25b8) installed in this system has
                             NVRM: fallen off the bus and is not responding to commands.
Apr 05 17:53:12 mw-1 kernel: nvidia: probe of 0000:01:00.0 failed with error -1
Apr 05 17:53:12 mw-1 kernel: NVRM: The NVIDIA probe routine failed for 1 device(s).
Apr 05 17:53:12 mw-1 kernel: NVRM: None of the NVIDIA devices were initialized.
Apr 05 17:53:12 mw-1 kernel: nvidia-nvlink: Unregistered the Nvlink Core, major device number 508

The power state error occurs when I want to toggle the state with bbswitch, too.

I read the troubleshooting part of the wiki and read some forum posts here, but could not help myself.
In the nvidia forum I found this post, but the presented fix there did not work on my system. The solution fails because of the modprobe nvidia problem, too,

I would appreciate a hint
Greeting Kokos Bot

Offline

Board footer

Powered by FluxBB