You are not logged in.

#1 2024-03-07 23:46:25

b0ss_
Member
Registered: 2021-01-16
Posts: 83

[SOLVED!] Can't load AMD drivers in APU+GPU setup

Hi.

Yesterday I got myself an RX580 and I installed it into an M720s Thinkcentre.  I swapped the case and the power supply.

I tried to boot into my arch install but it didn't work. So, I thought that maybe I had to reinstall it, as it was configured to run on the APU, which in retrospect doesn't make a lot of sense.

The system hung at "Loading initial ramdisk". I tried a lot of stuff, and the kernel parameter "acpi=off" worked. But now, the "amdgpu" module was not being loaded. Not for the RX580 nor for the Ryzen 2200G. It still displays a picture, but a very slow one.

I've tried configuring the mkinitcpio.conf file to load amdgpu, amdgpu AND radeon, radeon alone, but nothing worked. Now it only "loads" the AMDGPU driver (it doesn't actually load it though). I tried to modprobe -f amdpu, modprobe amdgpu but neither worked.

I also tried installing the amdgpu-pro drivers, but they didn't work either. I also tried some kernel parameters, such as the ones in the wiki to enable SI and CIK support (although the RX580 arch is GCN4...). to no avail. Also tried "amdgpu.dc=0", no luck.

I also tried to load a different OS. Linux Mint, to be precise. It threw an error related to "VGACON", but I wasn't able to reproduce it on Arch. It didn't boot without compatibility kernel options.

Dmesg: https://termbin.com/egds

mkinitcpio.conf

MODULES=(amdgpu)
BINARIES=()
FILES=()
HOOKS=(base udev autodetect keyboard keymap modconf block filesystems fsck microcode)

/etc/default/grub

# GRUB boot loader configuration

GRUB_DEFAULT=0
GRUB_TIMEOUT=5
GRUB_DISTRIBUTOR="Arch"
GRUB_CMDLINE_LINUX_DEFAULT="loglevel=3 quiet acpi=off"
GRUB_CMDLINE_LINUX="zswap.enabled=0 rootfstype=ext4"

# Preload both GPT and MBR modules so that they are not missed
GRUB_PRELOAD_MODULES="part_gpt part_msdos"

# Uncomment to enable booting from LUKS encrypted devices
#GRUB_ENABLE_CRYPTODISK=y

# Set to 'countdown' or 'hidden' to change timeout behavior,
# press ESC key to display menu.
GRUB_TIMEOUT_STYLE=menu

# Uncomment to use basic console
GRUB_TERMINAL_INPUT=console

# Uncomment to disable graphical terminal
#GRUB_TERMINAL_OUTPUT=console

# The resolution used on graphical terminal
# note that you can use only modes which your graphic card supports via VBE
# you can see them in real GRUB with the command `videoinfo'
GRUB_GFXMODE=auto

# Uncomment to allow the kernel use the same resolution used by grub
GRUB_GFXPAYLOAD_LINUX=keep

# Uncomment if you want GRUB to pass to the Linux kernel the old parameter
# format "root=/dev/xxx" instead of "root=/dev/disk/by-uuid/xxx"
#GRUB_DISABLE_LINUX_UUID=true

# Uncomment to disable generation of recovery mode menu entries
GRUB_DISABLE_RECOVERY=true

# Uncomment and set to the desired menu colors.  Used by normal and wallpaper
# modes only.  Entries specified as foreground/background.
#GRUB_COLOR_NORMAL="light-blue/black"
#GRUB_COLOR_HIGHLIGHT="light-cyan/blue"

# Uncomment one of them for the gfx desired, a image background or a gfxtheme
#GRUB_BACKGROUND="/path/to/wallpaper"
#GRUB_THEME="/path/to/gfxtheme"

# Uncomment to get a beep at GRUB start
#GRUB_INIT_TUNE="480 440 1"

# Uncomment to make GRUB remember the last selection. This requires
# setting 'GRUB_DEFAULT=saved' above.
#GRUB_SAVEDEFAULT=true

# Uncomment to disable submenus in boot menu
#GRUB_DISABLE_SUBMENU=y

# Probing for other operating systems is disabled for security reasons. Read
# documentation on GRUB_DISABLE_OS_PROBER, if still want to enable this
# functionality install os-prober and uncomment to detect and include other
# operating systems.
#GRUB_DISABLE_OS_PROBER=false

This are my specs:

$ inxi -FxxxZ
System:
  Host: home Kernel: 6.7.8-arch1-1 arch: x86_64 bits: 64 compiler: gcc
    v: 13.2.1 clocksource: tsc
  Desktop: KDE Plasma v: 6.0.1 tk: Qt v: N/A wm: kwin_wayland vt: 2 dm: SDDM
    Distro: Arch Linux
Machine:
  Type: Desktop System: LENOVO product: 10MBS02D00 v: Lenovo Product
    serial: <superuser required> Chassis: type: 3 serial: <superuser required>
  Mobo: LENOVO model: 3141 v: SDK0K17763 WIN 1801944579810
    serial: <superuser required>
    part-nu: LENOVO_MT_10MB_BU_Lenovo_FM_Lenovo Product
    uuid: <superuser required> UEFI: LENOVO v: M25KT5AA date: 08/31/2020
CPU:
  Info: single core model: AMD Ryzen 3 PRO 2200G with Radeon Vega Graphics
    bits: 64 type: UP smt: <unsupported> arch: Zen rev: 0 cache: L1: 96 KiB
    L2: 512 KiB L3: 4 MiB
  Speed (MHz): 3493 min/max: N/A core: 1: 3493 bogomips: 6989
  Flags: avx avx2 ht lm nx pae sse sse2 sse3 sse4_1 sse4_2 sse4a ssse3 svm
Graphics:
  Device-1: AMD Polaris 20 XL [Radeon RX 580 2048SP] driver: N/A arch: GCN-4
    pcie: speed: 8 GT/s lanes: 8 bus-ID: 01:00.0 chip-ID: 1002:6fdf
    class-ID: 0300
  Display: wayland server: X.org v: 1.21.1.11 with: Xwayland v: 23.2.4
    compositor: kwin_wayland driver: X: loaded: modesetting
    alternate: fbdev,vesa gpu: N/A display-ID: 0
  Monitor-1: Unknown-1 res: 1360x768 size: N/A modes: N/A
  API: EGL v: 1.5 platforms: device: 0 drv: swrast gbm: drv: kms_swrast
    surfaceless: drv: swrast wayland: drv: swrast x11: drv: swrast
  API: OpenGL v: 4.5 vendor: mesa v: 24.0.2-arch1.2 glx-v: 1.4
    direct-render: yes renderer: llvmpipe (LLVM 17.0.6 256 bits)
    device-ID: ffffffff:ffffffff display-ID: :0.0
  API: Vulkan Message: No Vulkan data available.
Audio:
  Device-1: AMD Ellesmere HDMI Audio [Radeon RX 470/480 / 570/580/590]
    driver: snd_hda_intel v: kernel pcie: speed: 8 GT/s lanes: 8 bus-ID: 01:00.1
    chip-ID: 1002:aaf0 class-ID: 0403
  Device-2: AMD ACP/ACP3X/ACP6x Audio Coprocessor driver: snd_pci_acp3x
    v: kernel pcie: speed: 8 GT/s lanes: 16 bus-ID: 0b:00.5 chip-ID: 1022:15e2
    class-ID: 0480
  Device-3: AMD Family 17h/19h HD Audio vendor: Lenovo driver: snd_hda_intel
    v: kernel pcie: speed: 8 GT/s lanes: 16 bus-ID: 0b:00.6 chip-ID: 1022:15e3
    class-ID: 0403
  API: ALSA v: k6.7.8-arch1-1 status: kernel-api
  Server-1: PipeWire v: 1.0.3 status: active with: 1: pipewire-pulse
    status: active 2: wireplumber status: active 3: pipewire-alsa type: plugin
    4: pw-jack type: plugin
Network:
  Device-1: Qualcomm Atheros QCA6174 802.11ac Wireless Network Adapter
    vendor: Lenovo driver: ath10k_pci v: kernel pcie: speed: 2.5 GT/s lanes: 1
    bus-ID: 07:00.0 chip-ID: 168c:003e class-ID: 0280
  IF: wlan0 state: down mac: 52:34:12:2f:3b:94
  Device-2: Realtek RTL8111/8168/8211/8411 PCI Express Gigabit Ethernet
    vendor: Lenovo RTL8111/8168/8411 driver: r8169 v: kernel pcie:
    speed: 2.5 GT/s lanes: 1 port: ec00 bus-ID: 0a:00.0 chip-ID: 10ec:8168
    class-ID: 0200
  IF: enp10s0f0 state: down mac: 00:d8:61:8d:0d:b3
  IF-ID-1: enp11s0f3u3 state: unknown speed: -1 duplex: half
    mac: 3e:89:c4:06:83:ab
Bluetooth:
  Device-1: Samsung Galaxy series misc. (tethering mode) driver: rndis_host
    v: kernel type: USB rev: 2.0 speed: 480 Mb/s lanes: 1 bus-ID: 4-3:4
    chip-ID: 04e8:6863 class-ID: 0a00 serial: R92TB0XPQRN
  Report: This feature requires one of these tools: hciconfig/bt-adapter
  Device-2: Qualcomm Atheros QCA61x4 Bluetooth 4.0 driver: btusb v: 0.8
    type: USB rev: 2.0 speed: 12 Mb/s lanes: 1 bus-ID: 4-4:3 chip-ID: 0cf3:e300
    class-ID: e001
  Report: ID: hci0 rfk-id: 0 state: down bt-service: disabled rfk-block:
    hardware: no software: no address: N/A
Drives:
  Local Storage: total: 1.13 TiB used: 442.95 GiB (38.3%)
  ID-1: /dev/sda vendor: Gigabyte model: GP-GSTFS31240GNTD size: 223.57 GiB
    speed: 6.0 Gb/s tech: SSD serial: SN234008923716 fw-rev: 61.5 scheme: GPT
  ID-2: /dev/sdb vendor: Western Digital model: WD10EZEX-60WN4A0
    size: 931.51 GiB speed: 6.0 Gb/s tech: HDD rpm: 7200 serial: WD-WCC6Y6TDFJ46
    fw-rev: 1A01 scheme: GPT
  ID-3: /dev/sdc vendor: Realtek model: Driver size: 7.8 MiB type: USB
    rev: 2.0 spd: 480 Mb/s lanes: 1 tech: N/A serial: Storage fw-rev: 1.00
Partition:
  ID-1: / size: 218.51 GiB used: 12.14 GiB (5.6%) fs: ext4 dev: /dev/sda2
  ID-2: /boot size: 511 MiB used: 220.8 MiB (43.2%) fs: vfat dev: /dev/sda1
  ID-3: /home size: 915.82 GiB used: 430.6 GiB (47.0%) fs: ext4
    dev: /dev/sdb1
Swap:
  ID-1: swap-1 type: zram size: 4 GiB used: 0 KiB (0.0%) priority: 100
    dev: /dev/zram0
Sensors:
  System Temperatures: cpu: 59.2 C mobo: N/A
  Fan Speeds (rpm): N/A
Info:
  Memory: total: 32 GiB available: 31.28 GiB used: 2.44 GiB (7.8%)
  Processes: 155 Power: uptime: 5m states: freeze,mem,disk suspend: s2idle
    wakeups: 0 hibernate: shutdown Init: systemd v: 255 default: graphical
  Packages: pm: pacman pkgs: 951 Compilers: gcc: 13.2.1 Shell: Bash
    v: 5.2.26 running-in: kitty inxi: 3.3.33

Btw: I have no idea how I'm running Plasma Wayland. My session usually just crashed back into lydm when I tried to log into Hyprland. Who knows!

Edit: forgot to post a couple things:

$ pacman -Qn | grep amd
amd-ucode 20240220.97b693d2-1
xf86-video-amdgpu 23.0.0-1
$ pacman -Qm | grep amd
(nothing)
$ pacman -Qn | grep radeon
radeontool 1.6.3-4
radeontop 1.4-2
vulkan-radeon 1:24.0.2-2
$ vulkaninfo
ERROR: [Loader Message] Code 0 : setup_loader_term_phys_devs:  Failed to detect any valid GPUs in the current config
ERROR at /usr/src/debug/vulkan-tools/Vulkan-Tools-1.3.269/vulkaninfo/./vulkaninfo.h:237:vkEnumeratePhysicalDevices failed with ERROR_INITIALIZATION_FAILED
$ sudo modprobe amdgpu
[sudo] contraseña para b0ss: 
modprobe: ERROR: could not insert 'amdgpu': No such device
$ lspci -k
00:00.0 Host bridge: Advanced Micro Devices, Inc. [AMD] Raven/Raven2 Root Complex
	Subsystem: Lenovo Raven/Raven2 Root Complex
00:00.2 IOMMU: Advanced Micro Devices, Inc. [AMD] Raven/Raven2 IOMMU
	Subsystem: Lenovo Raven/Raven2 IOMMU
00:01.0 Host bridge: Advanced Micro Devices, Inc. [AMD] Family 17h (Models 00h-1fh) PCIe Dummy Host Bridge
00:01.1 PCI bridge: Advanced Micro Devices, Inc. [AMD] Raven/Raven2 PCIe GPP Bridge [6:0]
	Subsystem: Lenovo Raven/Raven2 PCIe GPP Bridge [6:0]
	Kernel driver in use: pcieport
00:01.2 PCI bridge: Advanced Micro Devices, Inc. [AMD] Raven/Raven2 PCIe GPP Bridge [6:0]
	Subsystem: Lenovo Raven/Raven2 PCIe GPP Bridge [6:0]
	Kernel driver in use: pcieport
00:08.0 Host bridge: Advanced Micro Devices, Inc. [AMD] Family 17h (Models 00h-1fh) PCIe Dummy Host Bridge
00:08.1 PCI bridge: Advanced Micro Devices, Inc. [AMD] Raven/Raven2 Internal PCIe GPP Bridge 0 to Bus A
	Subsystem: Advanced Micro Devices, Inc. [AMD] Raven/Raven2 Internal PCIe GPP Bridge 0 to Bus A
	Kernel driver in use: pcieport
00:08.2 PCI bridge: Advanced Micro Devices, Inc. [AMD] Raven/Raven2 Internal PCIe GPP Bridge 0 to Bus B
	Subsystem: Advanced Micro Devices, Inc. [AMD] Raven/Raven2 Internal PCIe GPP Bridge 0 to Bus B
	Kernel driver in use: pcieport
00:14.0 SMBus: Advanced Micro Devices, Inc. [AMD] FCH SMBus Controller (rev 61)
	Subsystem: Lenovo FCH SMBus Controller
	Kernel driver in use: piix4_smbus
	Kernel modules: i2c_piix4, sp5100_tco
00:14.3 ISA bridge: Advanced Micro Devices, Inc. [AMD] FCH LPC Bridge (rev 51)
	Subsystem: Lenovo FCH LPC Bridge
00:18.0 Host bridge: Advanced Micro Devices, Inc. [AMD] Raven/Raven2 Device 24: Function 0
00:18.1 Host bridge: Advanced Micro Devices, Inc. [AMD] Raven/Raven2 Device 24: Function 1
00:18.2 Host bridge: Advanced Micro Devices, Inc. [AMD] Raven/Raven2 Device 24: Function 2
00:18.3 Host bridge: Advanced Micro Devices, Inc. [AMD] Raven/Raven2 Device 24: Function 3
	Kernel driver in use: k10temp
	Kernel modules: k10temp
00:18.4 Host bridge: Advanced Micro Devices, Inc. [AMD] Raven/Raven2 Device 24: Function 4
00:18.5 Host bridge: Advanced Micro Devices, Inc. [AMD] Raven/Raven2 Device 24: Function 5
00:18.6 Host bridge: Advanced Micro Devices, Inc. [AMD] Raven/Raven2 Device 24: Function 6
00:18.7 Host bridge: Advanced Micro Devices, Inc. [AMD] Raven/Raven2 Device 24: Function 7
01:00.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI] Polaris 20 XL [Radeon RX 580 2048SP] (rev ef)
	Subsystem: Advanced Micro Devices, Inc. [AMD/ATI] Polaris 20 XL [Radeon RX 580 2048SP]
	Kernel modules: amdgpu
01:00.1 Audio device: Advanced Micro Devices, Inc. [AMD/ATI] Ellesmere HDMI Audio [Radeon RX 470/480 / 570/580/590]
	Subsystem: Advanced Micro Devices, Inc. [AMD/ATI] Ellesmere HDMI Audio [Radeon RX 470/480 / 570/580/590]
	Kernel driver in use: snd_hda_intel
	Kernel modules: snd_hda_intel
02:00.0 USB controller: Advanced Micro Devices, Inc. [AMD] 400 Series Chipset USB 3.1 xHCI Compliant Host Controller (rev 01)
	Subsystem: Lenovo 400 Series Chipset USB 3.1 xHCI Compliant Host Controller
	Kernel driver in use: xhci_hcd
	Kernel modules: xhci_pci
02:00.1 SATA controller: Advanced Micro Devices, Inc. [AMD] 400 Series Chipset SATA Controller (rev 01)
	Subsystem: Lenovo 400 Series Chipset SATA Controller
	Kernel driver in use: ahci
02:00.2 PCI bridge: Advanced Micro Devices, Inc. [AMD] 400 Series Chipset PCIe Bridge (rev 01)
	Subsystem: Lenovo 400 Series Chipset PCIe Bridge
	Kernel driver in use: pcieport
03:00.0 PCI bridge: Advanced Micro Devices, Inc. [AMD] 400 Series Chipset PCIe Port (rev 01)
	DeviceName: Broadcom 5762
	Subsystem: Lenovo 400 Series Chipset PCIe Port
	Kernel driver in use: pcieport
03:01.0 PCI bridge: Advanced Micro Devices, Inc. [AMD] 400 Series Chipset PCIe Port (rev 01)
	Subsystem: Lenovo 400 Series Chipset PCIe Port
	Kernel driver in use: pcieport
03:04.0 PCI bridge: Advanced Micro Devices, Inc. [AMD] 400 Series Chipset PCIe Port (rev 01)
	Subsystem: Lenovo 400 Series Chipset PCIe Port
	Kernel driver in use: pcieport
03:05.0 PCI bridge: Advanced Micro Devices, Inc. [AMD] 400 Series Chipset PCIe Port (rev 01)
	Subsystem: ASMedia Technology Inc. 400 Series Chipset PCIe Port
	Kernel driver in use: pcieport
03:06.0 PCI bridge: Advanced Micro Devices, Inc. [AMD] 400 Series Chipset PCIe Port (rev 01)
	Subsystem: ASMedia Technology Inc. 400 Series Chipset PCIe Port
	Kernel driver in use: pcieport
03:07.0 PCI bridge: Advanced Micro Devices, Inc. [AMD] 400 Series Chipset PCIe Port (rev 01)
	Subsystem: ASMedia Technology Inc. 400 Series Chipset PCIe Port
	Kernel driver in use: pcieport
07:00.0 Network controller: Qualcomm Atheros QCA6174 802.11ac Wireless Network Adapter (rev 32)
	Subsystem: Lenovo QCA6174 802.11ac Wireless Network Adapter
	Kernel driver in use: ath10k_pci
	Kernel modules: ath10k_pci
08:00.0 PCI bridge: Integrated Technology Express, Inc. IT8893E PCIe to PCI Bridge (rev 41)
0a:00.0 Ethernet controller: Realtek Semiconductor Co., Ltd. RTL8111/8168/8211/8411 PCI Express Gigabit Ethernet Controller (rev 0e)
	Subsystem: Lenovo RTL8111/8168/8411 PCI Express Gigabit Ethernet Controller
	Kernel driver in use: r8169
	Kernel modules: r8169
0a:00.1 Serial controller: Realtek Semiconductor Co., Ltd. RTL8111xP UART #1 (rev 0e)
	Subsystem: Lenovo RTL8111xP UART
	Kernel driver in use: serial
0a:00.2 Serial controller: Realtek Semiconductor Co., Ltd. RTL8111xP UART #2 (rev 0e)
	Subsystem: Lenovo RTL8111xP UART
	Kernel driver in use: serial
0a:00.3 IPMI Interface: Realtek Semiconductor Co., Ltd. RTL8111xP IPMI interface (rev 0e)
	Subsystem: Lenovo RTL8111xP IPMI interface
	Kernel modules: ipmi_si
0a:00.4 USB controller: Realtek Semiconductor Co., Ltd. RTL811x EHCI host controller (rev 0e)
	Subsystem: Lenovo RTL811x EHCI host controller
	Kernel driver in use: ehci-pci
0b:00.0 Non-Essential Instrumentation [1300]: Advanced Micro Devices, Inc. [AMD] Zeppelin/Raven/Raven2 PCIe Dummy Function (rev d5)
	Subsystem: Lenovo Zeppelin/Raven/Raven2 PCIe Dummy Function
0b:00.2 Encryption controller: Advanced Micro Devices, Inc. [AMD] Family 17h (Models 10h-1fh) Platform Security Processor
	Subsystem: Advanced Micro Devices, Inc. [AMD] Family 17h (Models 10h-1fh) Platform Security Processor
	Kernel driver in use: ccp
	Kernel modules: ccp
0b:00.3 USB controller: Advanced Micro Devices, Inc. [AMD] Raven USB 3.1
	Subsystem: Lenovo Raven USB 3.1
	Kernel driver in use: xhci_hcd
	Kernel modules: xhci_pci
0b:00.4 USB controller: Advanced Micro Devices, Inc. [AMD] Raven USB 3.1
	Subsystem: Lenovo Raven USB 3.1
	Kernel driver in use: xhci_hcd
	Kernel modules: xhci_pci
0b:00.5 Multimedia controller: Advanced Micro Devices, Inc. [AMD] ACP/ACP3X/ACP6x Audio Coprocessor
	Subsystem: Advanced Micro Devices, Inc. [AMD] ACP/ACP3X/ACP6x Audio Coprocessor
	Kernel driver in use: snd_pci_acp3x
	Kernel modules: snd_pci_acp3x, snd_rn_pci_acp3x, snd_pci_acp5x, snd_pci_acp6x, snd_acp_pci, snd_rpl_pci_acp6x, snd_pci_ps, snd_sof_amd_renoir, snd_sof_amd_rembrandt, snd_sof_amd_vangogh, snd_sof_amd_acp63
0b:00.6 Audio device: Advanced Micro Devices, Inc. [AMD] Family 17h/19h HD Audio Controller
	DeviceName: Realtek ALC898
	Subsystem: Lenovo Family 17h/19h HD Audio Controller
	Kernel driver in use: snd_hda_intel
	Kernel modules: snd_hda_intel
0c:00.0 SATA controller: Advanced Micro Devices, Inc. [AMD] FCH SATA Controller [AHCI mode] (rev 61)
	Subsystem: Lenovo FCH SATA Controller [AHCI mode]
	Kernel driver in use: ahci
$ lsmod
Module                  Size  Used by
rndis_host             24576  0
cdc_ether              24576  1 rndis_host
usbnet                 65536  2 rndis_host,cdc_ether
mii                    16384  1 usbnet
snd_seq_dummy          12288  0
snd_hrtimer            12288  1
snd_seq               131072  7 snd_seq_dummy
snd_seq_device         16384  1 snd_seq
ath10k_pci             65536  0
ath10k_core           774144  1 ath10k_pci
ath                    40960  1 ath10k_core
intel_rapl_msr         20480  0
intel_rapl_common      40960  1 intel_rapl_msr
snd_sof_amd_acp63      16384  0
snd_sof_amd_vangogh    16384  0
snd_sof_amd_rembrandt    16384  0
kvm_amd               204800  0
snd_sof_amd_renoir     16384  0
snd_sof_amd_acp        69632  4 snd_sof_amd_rembrandt,snd_sof_amd_vangogh,snd_sof_amd_acp63,snd_sof_amd_renoir
snd_sof_pci            24576  4 snd_sof_amd_rembrandt,snd_sof_amd_vangogh,snd_sof_amd_acp63,snd_sof_amd_renoir
kvm                  1376256  1 kvm_amd
snd_sof_xtensa_dsp     16384  1 snd_sof_amd_acp
snd_sof               434176  2 snd_sof_amd_acp,snd_sof_pci
irqbypass              12288  1 kvm
crct10dif_pclmul       12288  1
snd_sof_utils          16384  1 snd_sof
crc32_pclmul           12288  0
snd_hda_codec_realtek   200704  1
snd_hda_codec_generic   114688  1 snd_hda_codec_realtek
polyval_clmulni        12288  0
snd_hda_codec_hdmi     94208  1
ledtrig_audio          12288  1 snd_hda_codec_generic
polyval_generic        12288  1 polyval_clmulni
snd_soc_core          462848  1 snd_sof
btusb                  86016  0
snd_hda_intel          65536  4
mac80211             1568768  1 ath10k_core
ccm                    20480  0
snd_compress           28672  1 snd_soc_core
snd_intel_dspcfg       40960  2 snd_hda_intel,snd_sof
algif_aead             12288  0
btrtl                  32768  1 btusb
crypto_null            16384  1 algif_aead
gf128mul               16384  1 polyval_generic
des3_ede_x86_64        45056  0
cbc                    12288  0
ac97_bus               12288  1 snd_soc_core
btintel                57344  1 btusb
snd_intel_sdw_acpi     16384  1 snd_intel_dspcfg
des_generic            12288  0
ghash_clmulni_intel    16384  0
snd_pcm_dmaengine      16384  1 snd_soc_core
libdes                 20480  2 des_generic,des3_ede_x86_64
sha512_ssse3           53248  0
algif_skcipher         12288  0
cmac                   12288  0
md4                    12288  0
btbcm                  24576  1 btusb
algif_hash             12288  0
af_alg                 36864  3 algif_hash,algif_skcipher,algif_aead
snd_pci_ps             28672  0
snd_rpl_pci_acp6x      16384  0
sha256_ssse3           32768  0
snd_hda_codec         225280  4 snd_hda_codec_generic,snd_hda_codec_hdmi,snd_hda_intel,snd_hda_codec_realtek
btmtk                  12288  1 btusb
joydev                 24576  0
mousedev               24576  0
snd_acp_pci            12288  0
libarc4                12288  1 mac80211
snd_hda_core          151552  5 snd_hda_codec_generic,snd_hda_codec_hdmi,snd_hda_intel,snd_hda_codec,snd_hda_codec_realtek
mac_hid                12288  0
sha1_ssse3             32768  0
snd_acp_legacy_common    16384  1 snd_acp_pci
aesni_intel           360448  0
snd_hwdep              20480  1 snd_hda_codec
bluetooth            1114112  6 btrtl,btmtk,btintel,btbcm,btusb
snd_pci_acp6x          20480  0
r8169                 114688  0
hid_generic            12288  0
crypto_simd            16384  1 aesni_intel
cfg80211             1343488  3 ath,mac80211,ath10k_core
snd_pcm               204800  13 snd_sof_amd_acp,snd_hda_codec_hdmi,snd_pci_acp6x,snd_hda_intel,snd_hda_codec,snd_sof,snd_compress,snd_soc_core,snd_sof_utils,snd_hda_core,snd_pci_ps,snd_pcm_dmaengine
snd_pci_acp5x          16384  0
snd_rn_pci_acp3x       24576  0
realtek                40960  1
snd_timer              53248  3 snd_seq,snd_hrtimer,snd_pcm
cryptd                 28672  2 crypto_simd,ghash_clmulni_intel
snd_acp_config         20480  9 snd_rn_pci_acp3x,snd_pci_acp6x,snd_pci_acp5x,snd_sof_amd_rembrandt,snd_sof_amd_vangogh,snd_sof_amd_acp63,snd_acp_pci,snd_pci_ps,snd_sof_amd_renoir
rapl                   20480  0
mdio_devres            12288  1 r8169
vfat                   20480  1
uas                    32768  0
snd_soc_acpi           16384  2 snd_sof_amd_acp,snd_acp_config
fat                   106496  1 vfat
ecdh_generic           16384  1 bluetooth
pcspkr                 12288  0
k10temp                16384  0
libphy                233472  3 r8169,mdio_devres,realtek
ipmi_devintf           16384  0
usb_storage            90112  1 uas
cdc_acm                49152  0
snd                   159744  22 snd_hda_codec_generic,snd_seq,snd_seq_device,snd_hda_codec_hdmi,snd_hwdep,snd_hda_intel,snd_hda_codec,snd_hda_codec_realtek,snd_sof,snd_timer,snd_compress,snd_soc_core,snd_pcm
rfkill                 40960  7 bluetooth,cfg80211
snd_pci_acp3x          16384  0
ipmi_msghandler        94208  1 ipmi_devintf
sp5100_tco             20480  0
ccp                   163840  1 kvm_amd
soundcore              16384  1 snd
i2c_piix4              32768  0
usbhid                 77824  0
pkcs8_key_parser       12288  0
i2c_dev                28672  0
crypto_user            20480  0
fuse                  212992  3
loop                   40960  0
dm_mod                225280  0
nfnetlink              20480  1
zram                   45056  2
ip_tables              36864  0
x_tables               69632  1 ip_tables
ext4                 1171456  2
crc32c_generic         12288  0
crc16                  12288  2 bluetooth,ext4
mbcache                16384  1 ext4
jbd2                  221184  1 ext4
crc32c_intel           16384  4
xhci_pci               28672  0
xhci_pci_renesas       24576  1 xhci_pci
amdxcp                 12288  0
i2c_algo_bit           20480  0
drm_ttm_helper         12288  0
ttm                   110592  1 drm_ttm_helper
drm_exec               12288  0
gpu_sched              65536  0
drm_suballoc_helper    12288  0
drm_buddy              20480  0
drm_display_helper    229376  0
cec                    86016  1 drm_display_helper

Last edited by b0ss_ (2024-03-11 00:22:01)

Offline

#2 2024-03-08 15:25:51

seth
Member
Registered: 2012-09-03
Posts: 64,222

Re: [SOLVED!] Can't load AMD drivers in APU+GPU setup

https://bbs.archlinux.org/viewtopic.php … 3#p2155713

Although this machine did come with Win10 installed, I never used it. Do you think that would cause issues?

That counts as dualbooting, 3rd link below. Mandatory.
Disable it (it's NOT the BIOS setting!) and reboot windows and linux twice for voodo reasons.

It still displays a picture, but a very slow one.

[    0.124153] [drm] Initialized simpledrm 1.0.0 20200625 for simple-framebuffer.0 on minor 0
[    0.124786] simple-framebuffer simple-framebuffer.0: [drm] fb0: simpledrmdrmfb frame buffer device

modprobe: ERROR: could not insert 'amdgpu': No such device

That's a weird response from modprobe

01:00.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI] Polaris 20 XL [Radeon RX 580 2048SP] (rev ef)
	Subsystem: Advanced Micro Devices, Inc. [AMD/ATI] Polaris 20 XL [Radeon RX 580 2048SP]
	Kernel modules: amdgpu

I don't have an AMD chip at all and I can perfectly fine load the module (could be because of the disabled acpi)

sudo LC_ALL=C modprobe -v amdgpu

Can you boot the multi-user.target (2nd link below) w/o acpi=off, but maybe with "nomodeset"?

Offline

#3 2024-03-08 17:46:49

b0ss_
Member
Registered: 2021-01-16
Posts: 83

Re: [SOLVED!] Can't load AMD drivers in APU+GPU setup

seth wrote:

https://bbs.archlinux.org/viewtopic.php … 3#p2155713

Although this machine did come with Win10 installed, I never used it. Do you think that would cause issues?

That counts as dualbooting, 3rd link below. Mandatory.
Disable it (it's NOT the BIOS setting!) and reboot windows and linux twice for voodo reasons.

Hi Seth! Hope to find you well. I will try booting into WinPE and disable it from there. Should work, right?

seth wrote:

Can you boot the multi-user.target (2nd link below) w/o acpi=off, but maybe with "nomodeset"?

I will try, although I think I have tried the nomodeset kernel parameter before without acpi=off. Nevertheless, I'll give it a second chance.

Thanks!

Offline

#4 2024-03-08 18:37:11

b0ss_
Member
Registered: 2021-01-16
Posts: 83

Re: [SOLVED!] Can't load AMD drivers in APU+GPU setup

Hey!

I booted into WinPE. Tried to disable fast boot by regedit and cmd, but it was already disabled. I also tried the kernel parameters you sent and they worked. I tried to modprobe amdgpu to no luck, though.

$ sudo modprobe amdgpu
modprobe: ERROR: could not insert 'amdgpu': Invalid argument

Although I think this is to be expected.

$ sudo journalctl -b | grep "AMD"
mar 08 15:27:27 home kernel: RAMDISK: [mem 0xcdb88000-0xcfbc2fff]
mar 08 15:27:27 home kernel: ACPI: SSDT 0x00000000DB831688 0007DC (v01 LENOVO TC-M25   000015A0 AMD  00000001)
mar 08 15:27:27 home kernel: ACPI: CRAT 0x00000000DB831E68 000810 (v01 LENOVO TC-M25   000015A0 AMD  00000001)
mar 08 15:27:27 home kernel: ACPI: CDIT 0x00000000DB832678 000029 (v01 LENOVO TC-M25   000015A0 AMD  00000001)
mar 08 15:27:27 home kernel: ACPI: IVRS 0x00000000DB835718 0000D0 (v02 LENOVO TC-M25   000015A0 AMD  00000000)
mar 08 15:27:27 home kernel: AMD-Vi: Using global IVHD EFR:0x4f77ef22294ada, EFR2:0x0
mar 08 15:27:27 home kernel: AMD Zen1 DIV0 bug detected. Disable SMT for full protection.
mar 08 15:27:27 home kernel: smpboot: CPU0: AMD Ryzen 3 PRO 2200G with Radeon Vega Graphics (family: 0x17, model: 0x11, stepping: 0x0)
mar 08 15:27:27 home kernel: Performance Events: Fam17h+ core perfctr, AMD PMU driver.
mar 08 15:27:27 home kernel: pci 0000:00:00.2: AMD-Vi: IOMMU performance counters supported
mar 08 15:27:27 home kernel: AMD-Vi: Extended features (0x4f77ef22294ada, 0x0): PPR NX GT IA GA PC GA_vAPIC
mar 08 15:27:27 home kernel: AMD-Vi: Interrupt remapping enabled
mar 08 15:27:27 home kernel: AMD-Vi: Virtual APIC enabled
mar 08 15:27:27 home kernel: perf/amd_iommu: Detected AMD IOMMU #0 (2 banks, 4 counters/bank).

lspci -k: https://termbin.com/ro70
lsmod: https://termbin.com/vo4z
journalctl -u: https://termbin.com/lbso

If anything else is needed, do please ask!

Offline

#5 2024-03-08 20:18:38

b0ss_
Member
Registered: 2021-01-16
Posts: 83

Re: [SOLVED!] Can't load AMD drivers in APU+GPU setup

Good news: inside WinPE, I was able to determine that the GPU is indeed not dead. As I'm writting this, I'm running some GPU stress tests (after I succesfully installed the Adrenalin drivers from AMD) to make sure that the point of failure is not the GPU itself. Now I'm thinking about the "acpi_osi" kernel parameter and spoof Windows 10 (aka Windows 2020). I'll be back!

Edit: didn't work. Compiling linux-lts515 with linux-lts515-headers to try and see if it may be a kernel issue. Don't think it is, as I tried to boot into Linux Mint yesterday and it didn't work either.

sad

Last edited by b0ss_ (2024-03-08 20:34:53)

Offline

#6 2024-03-08 21:02:06

seth
Member
Registered: 2012-09-03
Posts: 64,222

Re: [SOLVED!] Can't load AMD drivers in APU+GPU setup

sudo modprobe -v amdgpu

will get you more information about what fails.
Also

modprobe -c | grep amdgpu | grep -v alias

Offline

#7 2024-03-08 23:56:41

b0ss_
Member
Registered: 2021-01-16
Posts: 83

Re: [SOLVED!] Can't load AMD drivers in APU+GPU setup

sudo modprobe -v amdgpu
insmod /lib/modules/6.7.9-arch1-1/kernel/drivers/gpu/drm/amdgpu/amdgpu.ko.zst si.support=1 cik_support=1
modprobe: ERROR: could not insert 'amdgpu': Invalid argument.

Before I booted into the current lts kernel (6.6.21-1-lts) and it gave me another but similar error:

insmod /lib/modules/6.6.21-1-lts/kernel/drivers/platform/x86/wmi.ko.zst
modprobe: ERROR: could not insert 'wmi': Invalid parameter.

It has to do with the "acpi=off" parameter, I know that because booting into multi-user.target with nomodeset I can easily modprobe wmi.

$ modprobe -c | grep amdgpu | grep -v alias
(nothing, I deleted si.support and cik.support from the mkinitcpio.conf as they don't really help me at all)

Offline

#8 2024-03-09 00:08:08

Scimmia
Fellow
Registered: 2012-09-01
Posts: 12,723

Re: [SOLVED!] Can't load AMD drivers in APU+GPU setup

si.support is not si_support. Check /etc/modprobe.d/

Offline

#9 2024-03-09 00:24:31

b0ss_
Member
Registered: 2021-01-16
Posts: 83

Re: [SOLVED!] Can't load AMD drivers in APU+GPU setup

Scimmia wrote:

si.support is not si_support. Check /etc/modprobe.d/

Damn I didn't realize this. I deleted the amdgpu.conf file, but I'll try it with the correct parameter. Although it might've been an issue of me rewriting the terminal output (I can't ssh into the pc for some reason).

No luck. I corrected both parameters, nothing. I deleted the amdgpu.conf file (again) and nothing.

Thanks!

Offline

#10 2024-03-09 01:00:52

b0ss_
Member
Registered: 2021-01-16
Posts: 83

Re: [SOLVED!] Can't load AMD drivers in APU+GPU setup

I was looking around in journalctl and I found something interesting, something that could help me, maybe?

https://termbin.com/nde9

$ journalctl --since=today | grep amdgpu
(snip)
mar 08 15:38:47 home kernel: amdgpu 0000:01:00.0: amdgpu: SE 4, SH per SE 1, CU per SH 9, active_cu_number 32
mar 08 15:38:47 home kernel: amdgpu 0000:01:00.0: amdgpu: Using BACO for runtime pm
mar 08 15:38:47 home kernel: [drm] Initialized amdgpu 3.57.0 20150101 for 0000:01:00.0 on minor 1
mar 08 15:38:47 home kernel: amdgpu 0000:01:00.0: [drm] Cannot find any crtc or sizes
mar 08 15:57:10 home sudo[785]:     b0ss : TTY=tty2 ; PWD=/home/b0ss ; USER=root ; COMMAND=/sbin/modprobe amdgpu -f
mar 08 15:57:10 home kernel: amdgpu: bad vermagic: kernel tainted.
mar 08 15:57:10 home kernel: amdgpu: module verification failed: signature and/or required key missing - tainting kernel
mar 08 15:57:12 home kernel: amdgpu: module has bad taint, not creating trace events
mar 08 16:01:28 home sudo[828]:     b0ss : TTY=tty2 ; PWD=/home/b0ss ; USER=root ; COMMAND=/sbin/modprobe amdgpu
mar 08 16:28:54 home sudo[647]:     b0ss : TTY=tty1 ; PWD=/home/b0ss ; USER=root ; COMMAND=/usr/bin/modprobe -f amdgpu
mar 08 16:28:54 home kernel: amdgpu: bad vermagic: kernel tainted.
mar 08 16:28:54 home kernel: amdgpu: module verification failed: signature and/or required key missing - tainting kernel
mar 08 16:28:57 home kernel: amdgpu: module has bad taint, not creating trace events
(/snip)

Given the bad vermagic, the next thing I tried was to "modprobe --force-vermagic -f amdgpu". I stopped and read the manual and realized that the '-f' flag already does that hmm Now, how can we find the root of the issue at hand? Given the bad vermagic, something is probably wrong with the kernel/some critical file, right?

sad

Offline

#11 2024-03-09 08:13:45

seth
Member
Registered: 2012-09-03
Posts: 64,222

Re: [SOLVED!] Can't load AMD drivers in APU+GPU setup

That means you've been trying to load the module from a different kernel.

mar 08 15:38:47 home kernel: amdgpu: ATOM BIOS: xxx-xxx-xxx

Have you edited that??
Please don't post random greps, but the complete journal of a boot.
The only thing that's maybe sensitive in there is a WAN routable IPv6 and your wifi MACs, not BIOS version numbers.

I deleted the amdgpu.conf file

There's a good chance it's still in your initramfs, so regenerate that.

Offline

#12 2024-03-09 18:15:21

b0ss_
Member
Registered: 2021-01-16
Posts: 83

Re: [SOLVED!] Can't load AMD drivers in APU+GPU setup

seth wrote:

That means you've been trying to load the module from a different kernel.

Uh, how? I havent moved stuff. And I realized that the bad vermagic is probably related with me using the '-f' flag.

seth wrote:
mar 08 15:38:47 home kernel: amdgpu: ATOM BIOS: xxx-xxx-xxx

Have you edited that??
Please don't post random greps, but the complete journal of a boot.
The only thing that's maybe sensitive in there is a WAN routable IPv6 and your wifi MACs, not BIOS version numbers.

I have not edited anything. But you're right, I should post complete boot logs, sorry lad.

seth wrote:

I deleted the amdgpu.conf file

There's a good chance it's still in your initramfs, so regenerate that.

I did regenerate it though haha.

I think I should give a little more detail on what I've found about the problem. You see, when I boot without "acpi=off" (and no other kernel parameter apart from the default 'quiet loglevel=3'), I get stuck on the initial ramdisk. But, and this might be interesting, a little bit of image "corruption" (or artifacting) occurs. It's just a little tiny bit, like 10 to 20 pixels in a horizontal line. Could this be related to GPU idle states? I read on another post that some people on the NVIDIA side have issues on GRUB with the GFXPAYLOAD option, which after that I set to "text". Nothing has changed, though.

Do you have any ideas? Because I certainly don't. Thanks, mate.

Offline

#13 2024-03-09 19:12:09

b0ss_
Member
Registered: 2021-01-16
Posts: 83

Re: [SOLVED!] Can't load AMD drivers in APU+GPU setup

I tried something else.

I activated the APU graphics on the BIOS (set to Auto, not just iGPU) and booted from that. The amdgpu module is loaded to the 2200G iGPU and the dedicated GPU can't be found on "lspci -nnk" even though it is connected to the motherboard, active and plugged into a monitor. It, of course, doesn't display anything. I did remove the "acpi=off" kernel command from the GRUB cmdline.

lspci -nnk: https://termbin.com/sjcc
sudo dmesg: https://termbin.com/cajj
sudo journalctl -b: https://termbin.com/wt09

Offline

#14 2024-03-09 21:34:31

seth
Member
Registered: 2012-09-03
Posts: 64,222

Re: [SOLVED!] Can't load AMD drivers in APU+GPU setup

The journal has the amdgpu module loaded, try the LTS kernel - you might be hitting https://bbs.archlinux.org/viewtopic.php?id=293496

Offline

#15 2024-03-09 22:37:45

b0ss_
Member
Registered: 2021-01-16
Posts: 83

Re: [SOLVED!] Can't load AMD drivers in APU+GPU setup

seth wrote:

The journal has the amdgpu module loaded, try the LTS kernel - you might be hitting https://bbs.archlinux.org/viewtopic.php?id=293496

It could be, but the "corruption" is very minimal: https://imgur.com/a/6sL7CMK that's literally all that happens after a few seconds.

Trying the LTS kernel was one of the very first things I did. It did not work. Nevertheless, I will download and boot into the 2023.09 Arch ISO. I don't think it'll work, as I tried to boot into Mint a couple days ago and that has a quite older kernel than we do.

Tried 2023.9: it gets stuck at "Triggering uevents...". I've had this happen before, when I was first re-installing Arch. The installer version I was using was 2023.10 iirc. Using "acpi=off" makes it work, but the same driver issue occurs, so it's possibly something else. I'll go even backwards, to 2021.11 and see what happens there.

Offline

#16 2024-03-09 22:54:50

seth
Member
Registered: 2012-09-03
Posts: 64,222

Re: [SOLVED!] Can't load AMD drivers in APU+GPU setup

Since the amdgpu module principally loads (can't be blacklsited or have a bad vermagic or module option), what exactly happens now when you boot the system w/ acpi=off and try

modprobe -v amdgpu

Offline

#17 2024-03-09 23:09:06

b0ss_
Member
Registered: 2021-01-16
Posts: 83

Re: [SOLVED!] Can't load AMD drivers in APU+GPU setup

seth wrote:

Since the amdgpu module principally loads (can't be blacklsited or have a bad vermagic or module option), what exactly happens now when you boot the system w/ acpi=off and try

modprobe -v amdgpu

I'll reflash the 2023.9 install and try again. I tried to 2021.11 and had very different results: https://imgur.com/a/jllgRbA the corruption here occurs ONLY when using cmdline options "loglevel=7 amdgpu.dc=0 amdgpu.dpm=0". If I don't use any kernel option, the image goes to black. Even with! acpi=off, it goes to black.

Will reflash and post the output!

Offline

#18 2024-03-09 23:31:07

b0ss_
Member
Registered: 2021-01-16
Posts: 83

Re: [SOLVED!] Can't load AMD drivers in APU+GPU setup

Hey.

With 2023.09, the same issue we saw before exists:

root@archiso ~ # modprobe -v amdgpu
insmod /lib/modules/6.4.12-arch1-1/kernel/drivers/platform/x86/wmi.ko.zst 
modprobe: ERROR: could not insert 'amdgpu': No such device
root@archiso ~ # modprobe -v wmi
insmod /lib/modules/6.4.12-arch1-1/kernel/drivers/platform/x86/wmi.ko.zst 
modprobe: ERROR: could not insert 'wmi': No such device
root@archiso ~ # insmod /lib/modules/6.4.12-arch1-1/kernel/drivers/gpu/drm/amd/amdgpu/amdgpu.ko.zst
insmod: ERROR: could not insert module /lib/modules/6.4.12-arch1-1/kernel/drivers/gpu/drm/amd/amdgpu/amdgpu.ko.zst: Unknown symbol in module

The last one is new. I think I haven't seen that error before, but I may've forgotten. Also, what is wmi? Why when I try to load amdgpu, it tries to insert wmi? I think it fails because it's related to ACPI from what I saw here anddd here.

journalctl -b of 2023.09: https://termbin.com/749q
dmesg of 2023.09: https://termbin.com/vlp0

I don't see anything related with amdgpu... do you see any error, or does something sticks out to you?

Offline

#19 2024-03-09 23:41:14

seth
Member
Registered: 2012-09-03
Posts: 64,222

Re: [SOLVED!] Can't load AMD drivers in APU+GPU setup

I suspect the module links into acpi and cannot be loaded w/o (last message) and the errors are mostly misleadign itr.

Can you get a journal (eg. via ssh or by rebooting w/ https://wiki.archlinux.org/title/Keyboa … el_(SysRq) ) from a boot w/o acpi=off?

Offline

#20 2024-03-10 00:17:01

b0ss_
Member
Registered: 2021-01-16
Posts: 83

Re: [SOLVED!] Can't load AMD drivers in APU+GPU setup

seth wrote:

I suspect the module links into acpi and cannot be loaded w/o (last message) and the errors are mostly misleadign itr.

Can you get a journal (eg. via ssh or by rebooting w/ https://wiki.archlinux.org/title/Keyboa … el_(SysRq) ) from a boot w/o acpi=off?

The issue is that when I boot with the GPU as the display output, the system just dies. It doesn't respond to anything. I tried to "Alt+SysRq (PrintScreen in my kbd)+b" to reboot, and nothing. I created "99-sysctl.conf" file in /etc/sysctl, wrote "kernel.sysrq = 1" and closed. Reconfigured
the /etc/default/grub file and added "sysrq_always_enabled=1" to make it work 100%.

And I can't ssh into the system. I know it doesn't boot into the ly display manager, because when it does, the small speaker that is embedded to the mobo makes a little tiny sound. So no ssh.

I can't get the output to a website, so I'll have to post a picture of "loglevel=9 sysrq_always_enabled=1" w/o acpi=off: https://imgur.com/a/gVdrauk

What immediately sticks out to me, is that the RX 580 is a POLARIS20/Ellesmere card, not a POLARIS10. I read around that the RX580 is just a RX480 on steroids, so that might be why they're seen as the same card by the kernel.

Any ideas?

EDIT: forgot to post the journalctl w/o acpi=off: https://termbin.com/dlc3

Last edited by b0ss_ (2024-03-10 00:23:13)

Offline

#21 2024-03-10 10:17:53

seth
Member
Registered: 2012-09-03
Posts: 64,222

Re: [SOLVED!] Can't load AMD drivers in APU+GPU setup

Journal has a Vega GPU.
Since the GPU is new, have you tested it in a different system?
Did you forget to attach the 8/6-pin dedicated power?
Can windows use the GPU in full capacity (ie. you can run some GL benchmark on it)?
Is it possible to boot the system w/ both GPUs enabled? (in general, even if at hand only w/ acpi=off, but they both show up in lspci)?

Offline

#22 2024-03-10 17:46:30

b0ss_
Member
Registered: 2021-01-16
Posts: 83

Re: [SOLVED!] Can't load AMD drivers in APU+GPU setup

seth wrote:

Journal has a Vega GPU.

Yes, I can't boot with the GPU plugged to the monitor. If I set the video priority to auto in the BIOS menu and plug the video cable to the motherboard and use the integrated graphics, the dedicated GPU doesn't show up.

seth wrote:

Since the GPU is new, have you tested it in a different system?
Did you forget to attach the 8/6-pin dedicated power?
Can windows use the GPU in full capacity (ie. you can run some GL benchmark on it)?

1. This is the only system I have, aka I don't have any other mobo sad
2. Yes! As a matter of fact, I had to run to buy a new power supply lol. Tried to use it without the 8-pin but the power LED on the GPU was red, after I plugged the 8-pin, it turned white. Which obviously meant that the GPU wasn't receiving the necessary power to function. Without the 8pin, it doesn't show up on the lspci at all. Not even if I plugged the DP cable to it.
3. Yep, I booted into WinPE the othey day, installed the Adrenalin drivers and everything worked as expected (minus DX10 and onwards, since WinPE is a very stripped down version of Windows made to be used in recovery environments).

seth wrote:

Is it possible to boot the system w/ both GPUs enabled? (in general, even if at hand only w/ acpi=off, but they both show up in lspci)?

Technically, yes. With "acpi=off", they both show up. If I disable "acpi=off" and plug into the GPU, the system crashes. If I plug the DP into the mobo, the system just works, but it doesn't show the RX580 on lspci. Not even under /sys/kernel/debug/dri.

I've ran out of ideas.

Offline

#23 2024-03-10 20:34:23

seth
Member
Registered: 2012-09-03
Posts: 64,222

Re: [SOLVED!] Can't load AMD drivers in APU+GPU setup

If I plug the DP into the mobo, the system just works, but it doesn't show the RX580 on lspci. Not even under /sys/kernel/debug/dri.

Post that journal, still.
Also the output of "lspci -tvnn" from the boot where both showed up.

1. Have you tried a different PCI slot (notably one that the mobo manual refers to as PEG)
2. Do you have a second monitor…?

Offline

#24 2024-03-10 21:27:49

b0ss_
Member
Registered: 2021-01-16
Posts: 83

Re: [SOLVED!] Can't load AMD drivers in APU+GPU setup

seth wrote:

If I plug the DP into the mobo, the system just works, but it doesn't show the RX580 on lspci. Not even under /sys/kernel/debug/dri.

Post that journal, still.
Also the output of "lspci -tvnn" from the boot where both showed up.

1. Have you tried a different PCI slot (notably one that the mobo manual refers to as PEG)
2. Do you have a second monitor…?

Here's a journal where both show up (with acpi=off, as without, only the APU shows up): https://termbin.com/r1li

lspci -tvnn:

$ lspci -tvnn
-[0000:00]-+-00.0  Advanced Micro Devices, Inc. [AMD] Raven/Raven2 Root Complex [1022:15d0]
           +-00.2  Advanced Micro Devices, Inc. [AMD] Raven/Raven2 IOMMU [1022:15d1]
           +-01.0  Advanced Micro Devices, Inc. [AMD] Family 17h (Models 00h-1fh) PCIe Dummy Host Bridge [1022:1452]
           +-01.1-[01]--+-00.0  Advanced Micro Devices, Inc. [AMD/ATI] Polaris 20 XL [Radeon RX 580 2048SP] [1002:6fdf]
           |            \-00.1  Advanced Micro Devices, Inc. [AMD/ATI] Ellesmere HDMI Audio [Radeon RX 470/480 / 570/580/590] [1002:aaf0]
           +-01.2-[02-09]--+-00.0  Advanced Micro Devices, Inc. [AMD] 400 Series Chipset USB 3.1 xHCI Compliant Host Controller [1022:43d5]
           |               +-00.1  Advanced Micro Devices, Inc. [AMD] 400 Series Chipset SATA Controller [1022:43c8]
           |               \-00.2-[03-09]--+-00.0-[04]--
           |                               +-01.0-[05]--
           |                               +-04.0-[06]--
           |                               +-06.0-[07-08]----00.0-[08]--
           |                               \-07.0-[09]--+-00.0  Realtek Semiconductor Co., Ltd. RTL8111/8168/8211/8411 PCI Express Gigabit Ethernet Controller [10ec:8168]
           |                                            +-00.1  Realtek Semiconductor Co., Ltd. RTL8111xP UART #1 [10ec:816a]
           |                                            +-00.2  Realtek Semiconductor Co., Ltd. RTL8111xP UART #2 [10ec:816b]
           |                                            +-00.3  Realtek Semiconductor Co., Ltd. RTL8111xP IPMI interface [10ec:816c]
           |                                            \-00.4  Realtek Semiconductor Co., Ltd. RTL811x EHCI host controller [10ec:816d]
           +-08.0  Advanced Micro Devices, Inc. [AMD] Family 17h (Models 00h-1fh) PCIe Dummy Host Bridge [1022:1452]
           +-08.1-[0a]--+-00.0  Advanced Micro Devices, Inc. [AMD] Zeppelin/Raven/Raven2 PCIe Dummy Function [1022:145a]
           |            +-00.2  Advanced Micro Devices, Inc. [AMD] Family 17h (Models 10h-1fh) Platform Security Processor [1022:15df]
           |            +-00.3  Advanced Micro Devices, Inc. [AMD] Raven USB 3.1 [1022:15e0]
           |            +-00.4  Advanced Micro Devices, Inc. [AMD] Raven USB 3.1 [1022:15e1]
           |            +-00.5  Advanced Micro Devices, Inc. [AMD] ACP/ACP3X/ACP6x Audio Coprocessor [1022:15e2]
           |            \-00.6  Advanced Micro Devices, Inc. [AMD] Family 17h/19h HD Audio Controller [1022:15e3]
           +-08.2-[0b]----00.0  Advanced Micro Devices, Inc. [AMD] FCH SATA Controller [AHCI mode] [1022:7901]
           +-14.0  Advanced Micro Devices, Inc. [AMD] FCH SMBus Controller [1022:790b]
           +-14.3  Advanced Micro Devices, Inc. [AMD] FCH LPC Bridge [1022:790e]
           +-18.0  Advanced Micro Devices, Inc. [AMD] Raven/Raven2 Device 24: Function 0 [1022:15e8]
           +-18.1  Advanced Micro Devices, Inc. [AMD] Raven/Raven2 Device 24: Function 1 [1022:15e9]
           +-18.2  Advanced Micro Devices, Inc. [AMD] Raven/Raven2 Device 24: Function 2 [1022:15ea]
           +-18.3  Advanced Micro Devices, Inc. [AMD] Raven/Raven2 Device 24: Function 3 [1022:15eb]
           +-18.4  Advanced Micro Devices, Inc. [AMD] Raven/Raven2 Device 24: Function 4 [1022:15ec]
           +-18.5  Advanced Micro Devices, Inc. [AMD] Raven/Raven2 Device 24: Function 5 [1022:15ed]
           +-18.6  Advanced Micro Devices, Inc. [AMD] Raven/Raven2 Device 24: Function 6 [1022:15ee]
           \-18.7  Advanced Micro Devices, Inc. [AMD] Raven/Raven2 Device 24: Function 7 [1022:15ef]

1. This mobo only has this PCIE port. The documentation contains some basic information about the mobo, and the slot I'm using is the only one that can be used to insert a GPU.
2. I'm actually waiting for a monitor to arrive. I've been using an LG TV as a display for a couple of months now, but that hasn't been an issue. What I thought might have been a problem is that I use a HDMI to DP adapter. Nevertheless, the GPU has an HDMI connection that I tried and the results were the same.

Offline

#25 2024-03-10 21:49:11

seth
Member
Registered: 2012-09-03
Posts: 64,222

Re: [SOLVED!] Can't load AMD drivers in APU+GPU setup

The journal doesn't have 1002:15dd and it also doesn't show up in lspci, instead it turns into

+-08.1-[0a]--+-00.0  Advanced Micro Devices, Inc. [AMD] Zeppelin/Raven/Raven2 PCIe Dummy Function [1022:145a]

what likely explains why you can't plug an output there (and kills my idea to just plug an output into both)

Can you boot w/ "pci=noacpi" or "pci=nocrs"?

Offline

Board footer

Powered by FluxBB