You are not logged in.
My wifi has been pretty unstable recently and disconnects and reconnects all the time, which is extremely annoying. After some investigation I found out that the kernel also seems to throw some possibly related "correctable errors" in the journal:
Mar 22 18:08:40 RandomArchUser kernel: rtw_8821ce 0000:02:00.0: [ 0] RxErr (First)
Mar 22 18:08:40 RandomArchUser kernel: rtw_8821ce 0000:02:00.0: device [10ec:c821] error status/mask=00000001/0000e000
Mar 22 18:08:40 RandomArchUser kernel: rtw_8821ce 0000:02:00.0: PCIe Bus Error: severity=Correctable, type=Physical Layer, (Receiver ID)
Mar 22 18:08:40 RandomArchUser kernel: pcieport 0000:00:1d.1: AER: Multiple Correctable error message received from 0000:02:00.0
(I should probably note that i ran journalctl with the -r option, so you gotta read it from bottom to top)
These four lines specifically are being repeated over and over again. I'm not really an expert, but I think this is probably related to my driver and network card? At least I find a matching number of characters (8821ce) when I run lspci:
02:00.0 Network controller: Realtek Semiconductor Co., Ltd. RTL8821CE 802.11ac PCIe Wireless Network Adapter
So, ig I found out a bit already but I'm unsure how to use that information? Could anyone please help me? Pls tell if you need any other logs. Also yes I checked and I have no network services other than iwd and dhcpcd running.
Thx in advance :)
EDIT:
Forgot to mention but in the logs, iwd also complains about invalid HE capabilities, not sure what that means
Last edited by NeoArcher (2025-03-22 18:35:23)
Offline
Random guess https://bbs.archlinux.org/viewtopic.php … 9#p2202679
Otherwise please post your complete system journal for the boot:
sudo journalctl -b | curl -F 'file=@-' 0x0.st
Offline
Random guess https://bbs.archlinux.org/viewtopic.php … 9#p2202679
Otherwise please post your complete system journal for the boot:sudo journalctl -b | curl -F 'file=@-' 0x0.st
I tried it and it first it seemed like it worked, but then it started occassionally breaking away again.
Btw I reinstalled arch recently bc I had a mess of DEs and WMs and just thought a fresh install would be more convenient than trying to delete everything separately (trust me it was a brutal mess), and I dont have the config on my new installation anymore, but since it didnt work permanentaly ig it doesnt really matter? just wanted to add that.
Here is my log then: https://0x0.st/8jl0.txt
Had the wifi disconnection a couple of times there, not sure when tho, but searching for lines that include iwd and dhcpcd should probably do the job. (the log is flooded with the four lines i mentioned in the OP)
Offline
The journal is basically just
Mar 25 22:41:58 RandomArchUser kernel: rtw_8821ce 0000:02:00.0: [ 0] RxErr (First)
Mar 25 22:41:58 RandomArchUser kernel: rtw_8821ce 0000:02:00.0: PCIe Bus Error: severity=Correctable, type=Physical Layer, (Receiver ID)
Mar 25 22:41:58 RandomArchUser kernel: rtw_8821ce 0000:02:00.0: device [10ec:c821] error status/mask=00000001/0000e000
Mar 25 22:41:58 RandomArchUser kernel: rtw_8821ce 0000:02:00.0: [ 0] RxErr (First)
Mar 25 22:41:59 RandomArchUser kernel: rtw_8821ce 0000:02:00.0: PCIe Bus Error: severity=Correctable, type=Physical Layer, (Receiver ID)
Mar 25 22:41:59 RandomArchUser kernel: rtw_8821ce 0000:02:00.0: device [10ec:c821] error status/mask=00000001/0000e000
Mar 25 22:41:59 RandomArchUser kernel: rtw_8821ce 0000:02:00.0: [ 0] RxErr (First)
Mar 25 22:41:59 RandomArchUser kernel: rtw_8821ce 0000:02:00.0: PCIe Bus Error: severity=Correctable, type=Physical Layer, (Receiver ID)
Mar 25 22:41:59 RandomArchUser kernel: rtw_8821ce 0000:02:00.0: device [10ec:c821] error status/mask=00000001/0000e000
Apparently ASPM isn't implemented?
Mar 25 20:24:43 RandomArchUser kernel: ACPI FADT declares the system doesn't support PCIe ASPM, so disable it
Do the PCI errors go away when re-adding
options rtw89_pci disable_clkreq=y disable_aspm_l1=y disable_aspm_l1ss=y
?
Edit: other than that the system is roaming, see https://wiki.archlinux.org/title/Iwd#iwd_keeps_roaming and make sure that's not the cause of the remaining "problems"
Last edited by seth (2025-03-25 22:27:32)
Offline
Do the PCI errors go away when re-adding
options rtw89_pci disable_clkreq=y disable_aspm_l1=y disable_aspm_l1ss=y
?
Edit: other than that the system is roaming, see https://wiki.archlinux.org/title/Iwd#iwd_keeps_roaming and make sure that's not the cause of the remaining "problems"
Nope nothing changes, still bad wifi and log flooded with PCI errors
Last edited by NeoArcher (2025-03-27 08:35:12)
Offline
Please avoid bloating the thread with pointless full-quotes.
Offline
Please avoid bloating the thread with pointless full-quotes.
Edited my last post, thx for the reminder and sorry
Last edited by NeoArcher (2025-03-27 08:35:57)
Offline
No problem, have you tried the LTS kernel?
Offline
No problem, have you tried the LTS kernel?
Nope, didn't think of that.
Might be worth noting though that my wifi issues, tho not completely fixed, got a bit better, disconnections are less frequent now. Will try working with the LTS kernel the next days and report back :)
Offline
Alright here is my report ig
Had the same issues with the lts kernel too, but I discovered that if I go closer to my wifi router my wifi stops disconnecting. So I think what is the issue is that I am too far away from the router, and bc of that iwd regularly roams which probably causes the disconnection? Does that make sense or do I have to work on my logic?
In any case, the pci errors are still there no matter what kernel I use.
Offline
Does that make sense
Likely, see https://wiki.archlinux.org/title/Iwd#iwd_keeps_roaming to control the roaming strategy - or lock into a BSSID (the APs MAC address)
Do the parameters actually apply?
systool -vm rtw89_pci
Can you configure anything ASPM/power related in the UEFI?
Is there maybe a parallel windows installation?
Offline
Do the parameters actually apply?
systool -vm rtw89_pci
Doesn't seem so, the output is:
Error opening module rtw89_pci
Ig that means I did something wrong? Or could it be related to the fact that I ran this command on a different kernel than the normal Linux kernel? (am using zen rn)
Can you configure anything ASPM/power related in the UEFI?
Can't access the UEFI atm, but I think i saved the password somewhere (yea I locked it for no reason), will try to find it
Is there maybe a parallel windows installation?
Nope, no multi boot, only vanilla x86_64 Arch Linux on the entire drive
Last edited by NeoArcher (2025-04-03 16:39:42)
Offline
F*** you're on rtw88 too, no idea how I end up reading the 89 there all the time (pleasenoglassespleasenoglassespleasenoglasses…)
options rtw88_pci disable_aspm=Y
You can also try adding "disable_msi=Y"
options rtw88_pci disable_aspm=Y disable_msi=Y
but ASPM is much more likely here.
Offline
F*** you're on rtw88 too, no idea how I end up reading the 89 there all the time (pleasenoglassespleasenoglassespleasenoglasses…)
Lol was my mistake too tbh, should've seen it myself xD And I really appreciate that you help me here!
options rtw88_pci disable_aspm=Y
You can also try adding "disable_msi=Y"
options rtw88_pci disable_aspm=Y disable_msi=Y
but ASPM is much more likely here.
Nope both didn't get rid of the pci errors, both of these options are being used now tho. Output of
systool -vm rtw88_pci
:
Module = "rtw88_pci"
Attributes:
coresize = "40960"
initsize = "0"
initstate = "live"
refcnt = "1"
srcversion = "77F73ABD6150FDA24AFBC22"
taint = ""
uevent = <store method only>
Parameters:
disable_aspm = "Y"
disable_msi = "Y"
Sections:
Last edited by NeoArcher (2025-04-05 08:10:13)
Offline
Is the NIC onboard or a (mini) card?
Is it properly seated?
Let's see what else is on the bus
lspci -tvnn
Offline
Is the NIC onboard or a (mini) card?
Not sure, where can I look that up ? In the docs of my motherboard?
Is it properly seated?
I thonk so, I didn't mess around with the hardware
Let's see what else is on the bus
lspci -tvnn
-[0000:00]-+-00.0 Intel Corporation Coffee Lake HOST and DRAM Controller [8086:3e34]
+-02.0 Intel Corporation WhiskeyLake-U GT2 [UHD Graphics 620] [8086:3ea0]
+-04.0 Intel Corporation Xeon E3-1200 v5/E3-1500 v5/6th Gen Core Processor Thermal Subsystem [8086:1903]
+-08.0 Intel Corporation Xeon E3-1200 v5/v6 / E3-1500 v5 / 6th/7th/8th Gen Core Processor Gaussian Mixture Model [8086:1911]
+-12.0 Intel Corporation Cannon Point-LP Thermal Controller [8086:9df9]
+-14.0 Intel Corporation Cannon Point-LP USB 3.1 xHCI Controller [8086:9ded]
+-14.2 Intel Corporation Cannon Point-LP Shared SRAM [8086:9def]
+-16.0 Intel Corporation Cannon Point-LP MEI Controller #1 [8086:9de0]
+-17.0 Intel Corporation 82801 Mobile SATA Controller [RAID mode] [8086:282a]
+-1d.0-[01]----00.0 Realtek Semiconductor Co., Ltd. RTL8111/8168/8211/8411 PCI Express Gigabit Ethernet Controller [10ec:8168]
+-1d.1-[02]----00.0 Realtek Semiconductor Co., Ltd. RTL8821CE 802.11ac PCIe Wireless Network Adapter [10ec:c821]
+-1d.4-[03]----00.0 Samsung Electronics Co Ltd NVMe SSD Controller 980 (DRAM-less) [144d:a809]
+-1f.0 Intel Corporation Cannon Point-LP LPC Controller [8086:9d84]
+-1f.3 Intel Corporation Cannon Point-LP High Definition Audio Controller [8086:9dc8]
+-1f.4 Intel Corporation Cannon Point-LP SMBus Controller [8086:9da3]
\-1f.5 Intel Corporation Cannon Point-LP SPI Controller [8086:9da4]
Offline
In the docs of my motherboard?
They'll tell you where the NIC is located, the harware implementation you'll likely have to ckeck by looking at it.
So the nvme is on that bus …
https://wiki.archlinux.org/title/Solid_ … leshooting
nvme_core.default_ps_max_latency_us=0 pcie_aspm=off pcie_port_pm=off iommu=soft
Offline
nvme_core.default_ps_max_latency_us=0 pcie_aspm=off pcie_port_pm=off iommu=soft
Nice, that worked, thanks a lot! But what did those parameters change? Just curious :)
Offline
The the link I posted - it's your nvme.
See whether you can drop some of them in the following order of undesired side-effects (battery drain)
- pcie_aspm=off # this affects all pcie devices
- pcie_port_pm=off # dto
- iommu=soft # bit cpu overhead
Offline
options rtw88_pci disable_aspm=Y
When you test that one you have to shut down the computer. It's not enough to reboot. This is the advice from Realtek (Ping-Ke Shih).
Offline