You are not logged in.

#1 2022-01-22 12:17:04

jscob
Member
Registered: 2022-01-22
Posts: 1

NVIDIA driver failing to load for eGPU

Hi, I'm trying to get an RTX 3070 in a Razer Core X enclosure to work over thunderbolt using the proprietary graphics driver (nvidia-495.46-10) but running into issues when attempting to use it
The laptop I am using it with already has a mobile nvidia card, I am unsure how to disable this to rule it out, there doesn't appear to be any option in the bios
Thunderbolt enclosure is running latest firmware (33.00), laptop is an ASUS ROG Zephyrus M16 GU603HM with most recent firmware
To rule out any hardware issues I have tested the same setup in Windows

Journal output after connecting the enclosure and then running "nvidia-smi" after a few seconds:

Jan 22 12:18:17 laptop kernel: pcieport 0000:00:07.0: pciehp: Slot(0): Card present
Jan 22 12:18:17 laptop kernel: pcieport 0000:00:07.0: pciehp: Slot(0): Link Up
Jan 22 12:18:17 laptop kernel: asus_wmi: Unknown key 7b pressed
Jan 22 12:18:17 laptop kernel: pci 0000:03:00.0: [8086:15da] type 01 class 0x060400
Jan 22 12:18:17 laptop kernel: pci 0000:03:00.0: enabling Extended Tags
Jan 22 12:18:17 laptop kernel: pci 0000:03:00.0: supports D1 D2
Jan 22 12:18:17 laptop kernel: pci 0000:03:00.0: PME# supported from D0 D1 D2 D3hot D3cold
Jan 22 12:18:17 laptop kernel: pci 0000:03:00.0: 8.000 Gb/s available PCIe bandwidth, limited by 2.5 GT/s PCIe x4 link at 0000:00:07.0 (capable of 31.504 Gb/s with 8.0 GT/s PCIe x4 link)
Jan 22 12:18:17 laptop kernel: pci 0000:03:00.0: Adding to iommu group 23
Jan 22 12:18:17 laptop kernel: pcieport 0000:00:07.0: ASPM: current common clock configuration is inconsistent, reconfiguring
Jan 22 12:18:17 laptop kernel: pci 0000:03:00.0: bridge configuration invalid ([bus 00-00]), reconfiguring
Jan 22 12:18:17 laptop kernel: pci 0000:04:01.0: [8086:15da] type 01 class 0x060400
Jan 22 12:18:17 laptop kernel: pci 0000:04:01.0: enabling Extended Tags
Jan 22 12:18:17 laptop kernel: pci 0000:04:01.0: supports D1 D2
Jan 22 12:18:17 laptop kernel: pci 0000:04:01.0: PME# supported from D0 D1 D2 D3hot D3cold
Jan 22 12:18:17 laptop kernel: pci 0000:04:01.0: Adding to iommu group 24
Jan 22 12:18:17 laptop kernel: pci 0000:03:00.0: PCI bridge to [bus 04-2d]
Jan 22 12:18:17 laptop kernel: pci 0000:03:00.0:   bridge window [io  0x0000-0x0fff]
Jan 22 12:18:17 laptop kernel: pci 0000:03:00.0:   bridge window [mem 0x00000000-0x000fffff]
Jan 22 12:18:17 laptop kernel: pci 0000:03:00.0:   bridge window [mem 0x00000000-0x000fffff 64bit pref]
Jan 22 12:18:17 laptop kernel: pci 0000:04:01.0: bridge configuration invalid ([bus 00-00]), reconfiguring
Jan 22 12:18:17 laptop kernel: pci 0000:05:00.0: [10de:2484] type 00 class 0x030000
Jan 22 12:18:17 laptop kernel: pci 0000:05:00.0: reg 0x10: [mem 0x00000000-0x00ffffff]
Jan 22 12:18:17 laptop kernel: pci 0000:05:00.0: reg 0x14: [mem 0x00000000-0x0fffffff 64bit pref]
Jan 22 12:18:17 laptop kernel: pci 0000:05:00.0: reg 0x1c: [mem 0x00000000-0x01ffffff 64bit pref]
Jan 22 12:18:17 laptop kernel: pci 0000:05:00.0: reg 0x24: [io  0x0000-0x007f]
Jan 22 12:18:17 laptop kernel: pci 0000:05:00.0: reg 0x30: [mem 0x00000000-0x0007ffff pref]
Jan 22 12:18:17 laptop kernel: pci 0000:05:00.0: PME# supported from D0 D3hot
Jan 22 12:18:17 laptop kernel: pci 0000:05:00.0: 8.000 Gb/s available PCIe bandwidth, limited by 2.5 GT/s PCIe x4 link at 0000:00:07.0 (capable of 252.048 Gb/s with 16.0 GT/s PCIe x16 link)
Jan 22 12:18:17 laptop kernel: pci 0000:05:00.0: vgaarb: VGA device added: decodes=io+mem,owns=none,locks=none
Jan 22 12:18:17 laptop kernel: i915 0000:00:02.0: vgaarb: changed VGA decodes: olddecodes=none,decodes=none:owns=io+mem
Jan 22 12:18:17 laptop kernel: pci 0000:05:00.0: Adding to iommu group 24
Jan 22 12:18:17 laptop kernel: pci 0000:05:00.1: [10de:228b] type 00 class 0x040300
Jan 22 12:18:17 laptop kernel: pci 0000:05:00.1: reg 0x10: [mem 0x00000000-0x00003fff]
Jan 22 12:18:17 laptop kernel: pci 0000:05:00.1: Adding to iommu group 24
Jan 22 12:18:17 laptop kernel: pci 0000:04:01.0: PCI bridge to [bus 05-2d]
Jan 22 12:18:17 laptop kernel: pci 0000:04:01.0:   bridge window [io  0x0000-0x0fff]
Jan 22 12:18:17 laptop kernel: pci 0000:04:01.0:   bridge window [mem 0x00000000-0x000fffff]
Jan 22 12:18:17 laptop kernel: pci 0000:04:01.0:   bridge window [mem 0x00000000-0x000fffff 64bit pref]
Jan 22 12:18:17 laptop kernel: pci_bus 0000:05: busn_res: [bus 05-2d] end is updated to 05
Jan 22 12:18:17 laptop kernel: pci_bus 0000:04: busn_res: [bus 04-2d] end is updated to 05
Jan 22 12:18:17 laptop kernel: pci 0000:03:00.0: BAR 15: assigned [mem 0x6210000000-0x622bffffff 64bit pref]
Jan 22 12:18:17 laptop kernel: pci 0000:03:00.0: BAR 14: assigned [mem 0x76000000-0x821fffff]
Jan 22 12:18:17 laptop kernel: pci 0000:03:00.0: BAR 13: assigned [io  0x6000-0x6fff]
Jan 22 12:18:17 laptop kernel: pci 0000:04:01.0: BAR 15: assigned [mem 0x6210000000-0x622bffffff 64bit pref]
Jan 22 12:18:17 laptop kernel: pci 0000:04:01.0: BAR 14: assigned [mem 0x76000000-0x821fffff]
Jan 22 12:18:17 laptop kernel: pci 0000:04:01.0: BAR 13: assigned [io  0x6000-0x6fff]
Jan 22 12:18:17 laptop kernel: pci 0000:05:00.0: BAR 1: assigned [mem 0x6210000000-0x621fffffff 64bit pref]
Jan 22 12:18:17 laptop kernel: pci 0000:05:00.0: BAR 3: assigned [mem 0x6220000000-0x6221ffffff 64bit pref]
Jan 22 12:18:17 laptop kernel: pci 0000:05:00.0: BAR 0: assigned [mem 0x76000000-0x76ffffff]
Jan 22 12:18:17 laptop kernel: pci 0000:05:00.0: BAR 6: assigned [mem 0x77000000-0x7707ffff pref]
Jan 22 12:18:17 laptop kernel: pci 0000:05:00.1: BAR 0: assigned [mem 0x77080000-0x77083fff]
Jan 22 12:18:17 laptop kernel: pci 0000:05:00.0: BAR 5: assigned [io  0x6000-0x607f]
Jan 22 12:18:17 laptop kernel: pci 0000:04:01.0: PCI bridge to [bus 05]
Jan 22 12:18:17 laptop kernel: pci 0000:04:01.0:   bridge window [io  0x6000-0x6fff]
Jan 22 12:18:17 laptop kernel: pci 0000:04:01.0:   bridge window [mem 0x76000000-0x821fffff]
Jan 22 12:18:17 laptop kernel: pci 0000:04:01.0:   bridge window [mem 0x6210000000-0x622bffffff 64bit pref]
Jan 22 12:18:17 laptop kernel: pci 0000:03:00.0: PCI bridge to [bus 04-05]
Jan 22 12:18:17 laptop kernel: pci 0000:03:00.0:   bridge window [io  0x6000-0x6fff]
Jan 22 12:18:17 laptop kernel: pci 0000:03:00.0:   bridge window [mem 0x76000000-0x821fffff]
Jan 22 12:18:17 laptop kernel: pci 0000:03:00.0:   bridge window [mem 0x6210000000-0x622bffffff 64bit pref]
Jan 22 12:18:17 laptop kernel: pcieport 0000:00:07.0: PCI bridge to [bus 03-2d]
Jan 22 12:18:17 laptop kernel: pcieport 0000:00:07.0:   bridge window [io  0x6000-0x6fff]
Jan 22 12:18:17 laptop kernel: pcieport 0000:00:07.0:   bridge window [mem 0x76000000-0x821fffff]
Jan 22 12:18:17 laptop kernel: pcieport 0000:00:07.0:   bridge window [mem 0x6210000000-0x622bffffff 64bit pref]
Jan 22 12:18:17 laptop kernel: pcieport 0000:03:00.0: enabling device (0000 -> 0003)
Jan 22 12:18:17 laptop kernel: pcieport 0000:04:01.0: enabling device (0000 -> 0003)
Jan 22 12:18:17 laptop kernel: nvidia 0000:05:00.0: enabling device (0000 -> 0003)
Jan 22 12:18:17 laptop kernel: nvidia 0000:05:00.0: vgaarb: changed VGA decodes: olddecodes=io+mem,decodes=none:owns=none
Jan 22 12:18:17 laptop kernel: pci 0000:05:00.1: D0 power state depends on 0000:05:00.0
Jan 22 12:18:17 laptop kernel: snd_hda_intel 0000:05:00.1: enabling device (0000 -> 0002)
Jan 22 12:18:17 laptop kernel: snd_hda_intel 0000:05:00.1: Disabling MSI
Jan 22 12:18:17 laptop kernel: snd_hda_intel 0000:05:00.1: Handle vga_switcheroo audio client
Jan 22 12:18:17 laptop kernel: input: HDA NVidia HDMI/DP,pcm=3 as /devices/pci0000:00/0000:00:07.0/0000:03:00.0/0000:04:01.0/0000:05:00.1/sound/card2/input78
Jan 22 12:18:17 laptop kernel: input: HDA NVidia HDMI/DP,pcm=7 as /devices/pci0000:00/0000:00:07.0/0000:03:00.0/0000:04:01.0/0000:05:00.1/sound/card2/input79
Jan 22 12:18:17 laptop kernel: input: HDA NVidia HDMI/DP,pcm=8 as /devices/pci0000:00/0000:00:07.0/0000:03:00.0/0000:04:01.0/0000:05:00.1/sound/card2/input80
Jan 22 12:18:17 laptop kernel: input: HDA NVidia HDMI/DP,pcm=9 as /devices/pci0000:00/0000:00:07.0/0000:03:00.0/0000:04:01.0/0000:05:00.1/sound/card2/input81
Jan 22 12:18:17 laptop kernel: input: HDA NVidia HDMI/DP,pcm=10 as /devices/pci0000:00/0000:00:07.0/0000:03:00.0/0000:04:01.0/0000:05:00.1/sound/card2/input82
Jan 22 12:18:17 laptop kernel: input: HDA NVidia HDMI/DP,pcm=11 as /devices/pci0000:00/0000:00:07.0/0000:03:00.0/0000:04:01.0/0000:05:00.1/sound/card2/input83
Jan 22 12:18:17 laptop kernel: input: HDA NVidia HDMI/DP,pcm=12 as /devices/pci0000:00/0000:00:07.0/0000:03:00.0/0000:04:01.0/0000:05:00.1/sound/card2/input84
Jan 22 12:18:23 laptop boltd[573]: [0064a029-dcfa-Core X                     ] parent is 40930d9d-524b...
Jan 22 12:18:23 laptop boltd[573]: [0064a029-dcfa-Core X                     ] connected: authorized (/sys/devices/pci0000:00/0000:00:0d.2/domain0/0-0/0-1)
Jan 22 12:18:23 laptop kernel: thunderbolt 0-1: new device found, vendor=0x127 device=0x1
Jan 22 12:18:23 laptop kernel: thunderbolt 0-1: Razer Core X
Jan 22 12:18:23 laptop boltd[573]: [0064a029-dcfa-Core X                     ] udev: device changed: authorized -> authorized
Jan 22 12:18:32 laptop kernel: NVRM: GPU 0000:05:00.0: RmInitAdapter failed! (0x26:0x56:1479)
Jan 22 12:18:32 laptop kernel: NVRM: GPU 0000:05:00.0: rm_init_adapter failed, device minor number 1
Jan 22 12:18:33 laptop kernel: NVRM: GPU 0000:05:00.0: RmInitAdapter failed! (0x26:0x56:1479)
Jan 22 12:18:33 laptop kernel: NVRM: GPU 0000:05:00.0: rm_init_adapter failed, device minor number 1

"lspci -k" output

01:00.0 VGA compatible controller: NVIDIA Corporation GA106M [GeForce RTX 3060 Mobile / Max-Q] (rev a1)
	DeviceName: VGA
	Subsystem: ASUSTeK Computer Inc. Device 130c
	Kernel driver in use: nvidia
	Kernel modules: nouveau, nvidia_drm, nvidia
01:00.1 Audio device: NVIDIA Corporation Device 228e (rev a1)
	Subsystem: ASUSTeK Computer Inc. Device 130c
	Kernel driver in use: snd_hda_intel
	Kernel modules: snd_hda_intel
<snip>
05:00.0 VGA compatible controller: NVIDIA Corporation GA104 [GeForce RTX 3070] (rev a1)
	Subsystem: NVIDIA Corporation GA104 [GeForce RTX 3070]
	Kernel driver in use: nvidia
	Kernel modules: nouveau, nvidia_drm, nvidia
05:00.1 Audio device: NVIDIA Corporation GA104 High Definition Audio Controller (rev a1)
	Subsystem: NVIDIA Corporation Device 146b
	Kernel driver in use: snd_hda_intel
	Kernel modules: snd_hda_intel

Any ideas appreciated

Last edited by jscob (2022-01-22 12:20:05)

Offline

#2 2022-09-22 09:24:36

minerscale
Member
Registered: 2022-09-22
Posts: 1

Re: NVIDIA driver failing to load for eGPU

Hi jscob,

Sorry for the necro, but I'm having the exact same problem right now and I would like to know if you worked it out. It just randomly started for me yesterday on a Razer Core X with a 3070ti.

Any insight is greatly appreciated

Update: Huzzah! I fixed it! I updated my BIOS. It randomly stopped working, I updated my BIOS, it fixed the problem. I have no idea what the problem was. I will never know. At least it's fixed now.

Last edited by minerscale (2022-09-22 10:32:56)

Offline

Board footer

Powered by FluxBB