You are not logged in.
My Wi-Fi driver keeps shutting off randomly on my laptop (Asus TUF A15) using the MediaTek MT7921 card. I have to reboot the system to get it working again, and even then it only works intermittently before failing again.
❯ lspci -k | grep -A4 -i network
pcilib: Error reading /sys/bus/pci/devices/0000:00:08.3/label: Operation not permitted
03:00.0 Network controller: MEDIATEK Corp. MT7921 802.11ax PCIe Wireless Network Adapter [Filogic 330]
Subsystem: AzureWave Device 4680
Kernel driver in use: mt7921e
Kernel modules: mt7921esudo journalctl -b | curl -F 'file=@-' 0x0.st
These are the services and I'm not sure if more than one thing is trying to control the network.
❯ systemctl list-units --type service --no-pager
UNIT LOAD ACTIVE SUB DESCRIPTION
avahi-daemon.service loaded active running Avahi mDNS/DNS-SD Stack
bluetooth.service loaded active running Bluetooth service
containerd.service loaded active running containerd container runtime
cups-browsed.service loaded active running Make remote CUPS printers available lo…
cups.service loaded active running CUPS Scheduler
dbus-broker.service loaded active running D-Bus System Message Bus
docker.service loaded active running Docker Application Container Engine
kmod-static-nodes.service loaded active exited Create List of Static Device Nodes
NetworkManager-wait-online.service loaded active exited Network Manager Wait Online
NetworkManager.service loaded active running Network Manager
plymouth-quit-wait.service loaded active exited Hold until boot process finishes up
plymouth-quit.service loaded active exited Terminate Plymouth Boot Screen
plymouth-read-write.service loaded active exited Tell Plymouth To Write Out Runtime Data
plymouth-start.service loaded active exited Show Plymouth Boot Screen
polkit.service loaded active running Authorization Manager
postgresql.service loaded active running PostgreSQL database server
power-profiles-daemon.service loaded active running Power Profiles daemon
rtkit-daemon.service loaded active running RealtimeKit Scheduling Policy Service
sddm.service loaded active running Simple Desktop Display Manager
systemd-backlight@backlight:nvidia_0.s… loaded active exited Load/Save Screen Backlight Brightness …
systemd-backlight@leds:asus::kbd_backl… loaded active exited Load/Save Screen Backlight Brightness …
systemd-journal-flush.service loaded active exited Flush Journal to Persistent Storage
systemd-journald.service loaded active running Journal Service
systemd-logind.service loaded active running User Login Management
systemd-modules-load.service loaded active exited Load Kernel Modules
systemd-random-seed.service loaded active exited Load/Save OS Random Seed
systemd-remount-fs.service loaded active exited Remount Root and Kernel File Systems
systemd-resolved.service loaded active running Network Name Resolution
systemd-sysctl.service loaded active exited Apply Kernel Variables
systemd-timesyncd.service loaded active running Network Time Synchronization
systemd-tmpfiles-setup-dev-early.servi… loaded active exited Create Static Device Nodes in /dev gra…
systemd-tmpfiles-setup-dev.service loaded active exited Create Static Device Nodes in /dev
systemd-tmpfiles-setup.service loaded active exited Create System Files and Directories
systemd-udev-load-credentials.service loaded active exited Load udev Rules from Credentials
systemd-udev-trigger.service loaded active exited Coldplug All udev Devices
systemd-udevd.service loaded active running Rule-based Manager for Device Events a…
systemd-update-utmp.service loaded active exited Record System Boot/Shutdown in UTMP
systemd-user-sessions.service loaded active exited Permit User Sessions
systemd-userdb-load-credentials.service loaded active exited Load JSON user/group Records from Cred…
systemd-userdbd.service loaded active running User Database Manager
systemd-vconsole-setup.service loaded active exited Virtual Console Setup
systemd-zram-setup@zram0.service loaded active exited Create swap on /dev/zram0
udisks2.service loaded active running Disk Manager
ufw.service loaded active exited CLI Netfilter Manager
upower.service loaded active running Daemon for power management
user-runtime-dir@1000.service loaded active exited User Runtime Directory /run/user/1000
user@1000.service loaded active running User Manager for UID 1000
wpa_supplicant.service loaded active running WPA supplicant
Legend: LOAD → Reflects whether the unit definition was properly loaded.
ACTIVE → The high-level unit activation state, i.e. generalization of SUB.
SUB → The low-level unit activation state, values depend on unit type.
48 loaded units listed. Pass --all to see loaded but inactive units, too.
To show all installed unit files use 'systemctl list-unit-files'.I’ve updated the MediaTek firmware package, but it didn’t help:
❯ pacman -Qs linux-firmware-mediatek
local/linux-firmware-mediatek 20251111-1
Firmware files for Linux - Firmware for MediaTek and Ralink devicesI've tried few things:
Boot parameter: pcie_aspm=off
Blacklisting mt7921e at boot, then manually loading it with:
sudo modprobe -v mt7921eSometimes the laptop even freezes and the Caps Lock LED starts blinking.
Last edited by rapzy (2025-11-18 04:01:32)
Offline
Nov 18 09:01:46 sarjyant sudo[1762]: sarjyant : TTY=pts/0 ; PWD=/home/sarjyant ; USER=root ; COMMAND=/usr/bin/modprobe -v mt7921e
Nov 18 09:01:46 sarjyant kernel: mt7921e 0000:03:00.0: enabling device (0000 -> 0002)
Nov 18 09:01:46 sarjyant kernel: mt7921e 0000:03:00.0: disabling ASPM L1
Nov 18 09:01:46 sarjyant kernel: mt7921e 0000:03:00.0: ASIC revision: 79610010
Nov 18 09:01:46 sarjyant kernel: mt7921e 0000:03:00.0: HW/SW Version: 0x8a108a10, Build Time: 20250625153620a
Nov 18 09:01:46 sarjyant kernel: mt7921e 0000:03:00.0: WM Firmware Version: ____010000, Build Time: 20250625153703
Nov 18 09:01:47 sarjyant NetworkManager[688]: <info> [1763435807.7088] rfkill1: found Wi-Fi radio killswitch (at /sys/devices/pci0000:00/0000:00:02.2/0000:03:00.0/ieee80211/phy0/rfkill1) (driver mt7921e)
Nov 18 09:01:47 sarjyant kernel: mt7921e 0000:03:00.0 wlp3s0: renamed from wlan0gets loaded late manually but then there're no indications of crashes, signal breakdowns or network losses in the journal.
Looks like there's a stale AP
Nov 18 09:01:49 sarjyant wpa_supplicant[1803]: wlp3s0: SME: Trying to authenticate with c2:8b:a9:55:7c:3b (SSID='Cloco@Wlink-Static' freq=5745 MHz)
Nov 18 09:01:50 sarjyant wpa_supplicant[1803]: wlp3s0: Trying to associate with c2:8b:a9:55:7c:3b (SSID='Cloco@Wlink-Static' freq=5745 MHz)
Nov 18 09:01:50 sarjyant wpa_supplicant[1803]: wlp3s0: Added BSSID c2:8b:a9:55:7c:3b into ignore list, ignoring for 10 seconds
Nov 18 09:01:52 sarjyant wpa_supplicant[1803]: wlp3s0: SME: Trying to authenticate with ba:8b:a9:44:cf:7f (SSID='Cloco@Wlink-Static' freq=5320 MHz)
Nov 18 09:01:52 sarjyant wpa_supplicant[1803]: wlp3s0: Trying to associate with ba:8b:a9:44:cf:7f (SSID='Cloco@Wlink-Static' freq=5320 MHz)
Nov 18 09:01:52 sarjyant wpa_supplicant[1803]: wlp3s0: Associated with ba:8b:a9:44:cf:7f
Nov 18 09:01:52 sarjyant wpa_supplicant[1803]: wlp3s0: CTRL-EVENT-SUBNET-STATUS-UPDATE status=0
Nov 18 09:01:53 sarjyant wpa_supplicant[1803]: wlp3s0: WPA: Key negotiation completed with ba:8b:a9:44:cf:7f [PTK=CCMP GTK=CCMP]
Nov 18 09:01:53 sarjyant wpa_supplicant[1803]: wlp3s0: CTRL-EVENT-CONNECTED - Connection to ba:8b:a9:44:cf:7f completed [id=0 id_str=]but that's dealt with and no further issues are apparent.
Please post a journal that actually covers a network breakdown or crash of any kind ("Caps Lock LED starts blinking" is a kernel panic, you'll not get journals from those w/o https://wiki.archlinux.org/title/Kdump )
Fwwi, there's https://bbs.archlinux.org/viewtopic.php … 9#p2273249 (maybe doesn't only affect iwlwifi)
Offline
This journal contains the crash.
https://0x0.st/Kfl2.txt
Offline
Nov 19 09:12:21 sarjyant kernel: mt7921e 0000:03:00.0: HW/SW Version: 0x8a108a10, Build Time: 20250625153620a
Nov 19 09:12:22 sarjyant kernel: mt7921e 0000:03:00.0: WM Firmware Version: ____010000, Build Time: 20250625153703
Nov 19 09:12:22 sarjyant NetworkManager[699]: <info> [1763522842.9018] rfkill1: found Wi-Fi radio killswitch (at /sys/devices/pci0000:00/0000:00:02.2/0000:03:00.0/ieee80211/phy0/rfkill1) (driver mt7921e)
Nov 19 09:12:22 sarjyant kernel: mt7921e 0000:03:00.0 wlp3s0: renamed from wlan0
Nov 19 09:30:02 sarjyant kernel: mt7921e 0000:03:00.0: driver own failed
Nov 19 09:30:03 sarjyant kernel: mt7921e 0000:03:00.0: driver own failed
Nov 19 09:30:04 sarjyant kernel: mt7921e 0000:03:00.0: driver own failed
Nov 19 09:30:06 sarjyant kernel: mt7921e 0000:03:00.0: Timeout for driver ownhttps://bbs.archlinux.org/viewtopic.php … 4#p2267274
But the device initially works and the firmware is different and s seems the chip.
pci=noaer
Why is that there? Do you otherwise get a lot of noise on the bus?
Also please post
lspci -tvnnmt7921e.disable_aspm=1but there's already
Nov 19 09:12:20 sarjyant kernel: PCIe ASPM is disabled
Nov 19 09:12:20 sarjyant kernel: acpi PNP0A08:00: _OSC: not requesting OS control; OS requires [ExtendedConfig ASPM ClockPM MSI]
Nov 19 09:12:21 sarjyant kernel: mt7921e 0000:03:00.0: disabling ASPM L1Offline
Why is that there? Do you otherwise get a lot of noise on the bus?
I was trying out different solutions I found on the internet.
❯ lspci -tvnn
-[0000:00]-+-00.0 Advanced Micro Devices, Inc. [AMD] Family 17h-19h PCIe Root Complex [1022:14b5]
+-00.2 Advanced Micro Devices, Inc. [AMD] Family 17h-19h IOMMU [1022:14b6]
+-01.0 Advanced Micro Devices, Inc. [AMD] Family 17h-19h PCIe Dummy Host Bridge [1022:14b7]
+-01.1-[01]--+-00.0 NVIDIA Corporation GA107 [GeForce RTX 2050] [10de:25ad]
| \-00.1 NVIDIA Corporation GA107 High Definition Audio Controller [10de:2291]
+-02.0 Advanced Micro Devices, Inc. [AMD] Family 17h-19h PCIe Dummy Host Bridge [1022:14b7]
+-02.1-[02]----00.0 Realtek Semiconductor Co., Ltd. RTL8111/8168/8211/8411 PCI Express Gigabit Ethernet Controller [10ec:8168]
+-02.2-[03]----00.0 MEDIATEK Corp. MT7921 802.11ax PCIe Wireless Network Adapter [Filogic 330] [14c3:7961]
+-02.4-[04]----00.0 Sandisk Corp WD SN560/SN740/SN770/SN5000 NVMe SSD [15b7:5017]
+-03.0 Advanced Micro Devices, Inc. [AMD] Family 17h-19h PCIe Dummy Host Bridge [1022:14b7]
+-04.0 Advanced Micro Devices, Inc. [AMD] Family 17h-19h PCIe Dummy Host Bridge [1022:14b7]
+-08.0 Advanced Micro Devices, Inc. [AMD] Family 17h-19h PCIe Dummy Host Bridge [1022:14b7]
+-08.1-[05]--+-00.0 Advanced Micro Devices, Inc. [AMD/ATI] Dummy Function (absent graphics controller) [1002:145a]
| +-00.2 Advanced Micro Devices, Inc. [AMD] Family 19h PSP/CCP [1022:1649]
| +-00.3 Advanced Micro Devices, Inc. [AMD] Rembrandt USB4 XHCI controller #3 [1022:161d]
| +-00.4 Advanced Micro Devices, Inc. [AMD] Rembrandt USB4 XHCI controller #4 [1022:161e]
| +-00.5 Advanced Micro Devices, Inc. [AMD] Audio Coprocessor [1022:15e2]
| \-00.6 Advanced Micro Devices, Inc. [AMD] Family 17h/19h/1ah HD Audio Controller [1022:15e3]
+-08.3-[06]--+-00.0 Advanced Micro Devices, Inc. [AMD] Rembrandt USB4 XHCI controller #8 [1022:161f]
| +-00.3 Advanced Micro Devices, Inc. [AMD] Rembrandt USB4 XHCI controller #5 [1022:15d6]
| \-00.4 Advanced Micro Devices, Inc. [AMD] Rembrandt USB4 XHCI controller #6 [1022:15d7]
+-14.0 Advanced Micro Devices, Inc. [AMD] FCH SMBus Controller [1022:790b]
+-14.3 Advanced Micro Devices, Inc. [AMD] FCH LPC Bridge [1022:790e]
+-18.0 Advanced Micro Devices, Inc. [AMD] Rembrandt Data Fabric: Device 18h; Function 0 [1022:1679]
+-18.1 Advanced Micro Devices, Inc. [AMD] Rembrandt Data Fabric: Device 18h; Function 1 [1022:167a]
+-18.2 Advanced Micro Devices, Inc. [AMD] Rembrandt Data Fabric: Device 18h; Function 2 [1022:167b]
+-18.3 Advanced Micro Devices, Inc. [AMD] Rembrandt Data Fabric: Device 18h; Function 3 [1022:167c]
+-18.4 Advanced Micro Devices, Inc. [AMD] Rembrandt Data Fabric: Device 18h; Function 4 [1022:167d]
+-18.5 Advanced Micro Devices, Inc. [AMD] Rembrandt Data Fabric: Device 18h; Function 5 [1022:167e]
+-18.6 Advanced Micro Devices, Inc. [AMD] Rembrandt Data Fabric: Device 18h; Function 6 [1022:167f]
\-18.7 Advanced Micro Devices, Inc. [AMD] Rembrandt Data Fabric: Device 18h; Function 7 [1022:1680]Offline
"pci=noaer" doesn't fix anything, it only suppresses errors - if you remove that, do you get a lot of PCI errors ahead of the chip failing?
The MT7921 is on the same bus as the nvme…
https://wiki.archlinux.org/title/Solid_ … leshooting
Instead of "pci=noaer" add "nvme_core.default_ps_max_latency_us=0 pcie_aspm=off pcie_port_pm=off iommu=soft"
Offline
do you get a lot of PCI errors ahead of the chip failing?
I get these everytime I boot, I don't think it's related to Wifi failure
Nov 20 08:58:32 sarjyant kernel: ACPI BIOS Error (bug): Could not resolve symbol [\_SB.PCI0.GPP2.WWAN], AE_NOT_FOUND (20250404/dswload2-162)
Nov 20 08:58:32 sarjyant kernel: ACPI Error: AE_NOT_FOUND, During name lookup/catalog (20250404/psobject-220)
Nov 20 08:58:32 sarjyant kernel: ACPI BIOS Error (bug): Could not resolve symbol [\_SB.PCI0.GPP5.RTL8], AE_NOT_FOUND (20250404/dswload2-162)
Nov 20 08:58:32 sarjyant kernel: ACPI Error: AE_NOT_FOUND, During name lookup/catalog (20250404/psobject-220)
Nov 20 08:58:32 sarjyant kernel: ACPI BIOS Error (bug): Failure creating named object [\_SB.PCI0.GPP6.WLAN._S0W], AE_ALREADY_EXISTS (20250404/dswload2-326)
Nov 20 08:58:32 sarjyant kernel: ACPI Error: AE_ALREADY_EXISTS, During name lookup/catalog (20250404/psobject-220)
Nov 20 08:58:32 sarjyant kernel: ACPI BIOS Error (bug): Could not resolve symbol [\_SB.PCI0.GPP7.DEV0], AE_NOT_FOUND (20250404/dswload2-162)
Nov 20 08:58:32 sarjyant kernel: ACPI Error: AE_NOT_FOUND, During name lookup/catalog (20250404/psobject-220)
Nov 20 08:58:32 sarjyant kernel: ACPI BIOS Error (bug): Could not resolve symbol [\_TZ.THRM._SCP.CTYP], AE_NOT_FOUND (20250404/psargs-332)
Nov 20 08:58:32 sarjyant kernel: ACPI Error: Aborting method \_TZ.THRM._SCP due to previous error (AE_NOT_FOUND) (20250404/psparse-529)Offline
Those are kinda normal - you'd be looking for problems that look more like
Nov 19 20:16:39 archlinux kernel: pcieport 0000:00:1c.7: AER: Correctable error message received from 0000:00:1c.7
Nov 19 20:16:39 archlinux kernel: pcieport 0000:00:1c.7: PCIe Bus Error: severity=Correctable, type=Physical Layer, (Receiver ID)
Nov 19 20:16:39 archlinux kernel: pcieport 0000:00:1c.7: device [8086:a397] error status/mask=00000001/00002000
Nov 19 20:16:39 archlinux kernel: pcieport 0000:00:1c.7: [ 0] RxErr (First)Did you add the nvme mitigating parameters and does that have any impact so far?
Offline
GRUB_CMDLINE_LINUX_DEFAULT="loglevel=3 quiet nvme_core.default_ps_max_latency_us=0 pcie_aspm=off pcie_port_pm=off iommu=soft"These are my current parameters. I haven’t experienced any issues today, but I’m not sure whether that’s due to these settings. I’ll update after a few days of usage.
Offline
The issue still persists. I actually experience more frequent kernel panics in the laptop.
Last edited by rapzy (2025-11-21 08:53:19)
Offline
Please post your complete system journal for a boot were you lost the NIC and without "pci=noaer" (losing the wifi will not "freeze" the system, but losing the root partition would)
Can you provoke this from the arch installation iso (ie a usb boot where you don't rely on the nvme as root) and thus avoid any freezes?
Offline