You are not logged in.

#1 2017-09-04 21:35:49

laird.avery
Member
Registered: 2017-09-04
Posts: 1

XPS 9560: Flood of PCIe errors in log

I was having some issues with my wifi card, a common issue with this laptop. I tried the solution mentioned here:

sudo mv /usr/lib/firmware/ath10k/QCA6174/hw3.0/firmware-6.bin /usr/lib/firmware/ath10k/QCA6174/hw3.0/firmware-6.bin.bak

To try and load some different firmware. This works, as evident from the output of dmesg:

# dmesg | grep ath10k_pci
[    1.937268] ath10k_pci 0000:02:00.0: enabling device (0000 -> 0002)
[    1.938824] ath10k_pci 0000:02:00.0: pci irq msi oper_irq_mode 2 irq_mode 0 reset_mode 0
[    2.206418] ath10k_pci 0000:02:00.0: qca6174 hw3.2 target 0x05030000 chip_id 0x00340aff sub 1a56:1535
[    2.206419] ath10k_pci 0000:02:00.0: kconfig debug 0 debugfs 1 tracing 0 dfs 0 testmode 0
[    2.206847] ath10k_pci 0000:02:00.0: firmware ver WLAN.RM.2.0-00180-QCARMSWPZ-1 api 4 features wowlan,ignore-otp,no-4addr-pad crc32 75dee6c5
[    2.270929] ath10k_pci 0000:02:00.0: board_file api 2 bmi_id N/A crc32 07ee144e
[    2.886927] ath10k_pci 0000:02:00.0: htt-ver 3.26 wmi-op 4 htt-op 3 cal otp max-sta 32 raw 0 hwcrypto 1
[    3.055056] ath10k_pci 0000:02:00.0 wlp2s0: renamed from wlan0

This seemed to fix the wifi issue, so far. However, I'm now getting a bunch of PCIe errors:

# journalctl -p 3 -xe
Sep 04 14:23:34 avery kernel: pcieport 0000:00:1c.0:    [12] Replay Timer Timeout  
Sep 04 14:24:03 avery kernel: pcieport 0000:00:1c.0: PCIe Bus Error: severity=Corrected, type=Data Link Layer, id=00e0(Transmitter ID)
Sep 04 14:24:03 avery kernel: pcieport 0000:00:1c.0:   device [8086:a110] error status/mask=00001000/00002000
Sep 04 14:24:03 avery kernel: pcieport 0000:00:1c.0:    [12] Replay Timer Timeout  
Sep 04 14:24:06 avery kernel: pcieport 0000:00:1c.0: PCIe Bus Error: severity=Corrected, type=Data Link Layer, id=00e0(Transmitter ID)
Sep 04 14:24:06 avery kernel: pcieport 0000:00:1c.0:   device [8086:a110] error status/mask=00001000/00002000
Sep 04 14:24:06 avery kernel: pcieport 0000:00:1c.0:    [12] Replay Timer Timeout  
Sep 04 14:24:06 avery kernel: pcieport 0000:00:1c.0: PCIe Bus Error: severity=Corrected, type=Data Link Layer, id=00e0(Transmitter ID)
Sep 04 14:24:06 avery kernel: pcieport 0000:00:1c.0:   device [8086:a110] error status/mask=00001000/00002000
Sep 04 14:24:06 avery kernel: pcieport 0000:00:1c.0:    [12] Replay Timer Timeout  
Sep 04 14:24:06 avery kernel: pcieport 0000:00:1c.0: PCIe Bus Error: severity=Corrected, type=Data Link Layer, id=00e0(Transmitter ID)
Sep 04 14:24:06 avery kernel: pcieport 0000:00:1c.0:   device [8086:a110] error status/mask=00001000/00002000
Sep 04 14:24:06 avery kernel: pcieport 0000:00:1c.0:    [12] Replay Timer Timeout  
...
...

At first, I thought it was the wifi card complaining. But lspci shows that [8086:a110] is not the wifi card:

# lspci -nn
00:00.0 Host bridge [0600]: Intel Corporation Xeon E3-1200 v6/7th Gen Core Processor Host Bridge/DRAM Registers [8086:5910] (rev 05)
00:01.0 PCI bridge [0604]: Intel Corporation Xeon E3-1200 v5/E3-1500 v5/6th Gen Core Processor PCIe Controller (x16) [8086:1901] (rev 05)
00:02.0 VGA compatible controller [0300]: Intel Corporation Device [8086:591b] (rev 04)
00:04.0 Signal processing controller [1180]: Intel Corporation Xeon E3-1200 v5/E3-1500 v5/6th Gen Core Processor Thermal Subsystem [8086:1903] (rev 05)
00:14.0 USB controller [0c03]: Intel Corporation Sunrise Point-H USB 3.0 xHCI Controller [8086:a12f] (rev 31)
00:14.2 Signal processing controller [1180]: Intel Corporation Sunrise Point-H Thermal subsystem [8086:a131] (rev 31)
00:15.0 Signal processing controller [1180]: Intel Corporation Sunrise Point-H Serial IO I2C Controller #0 [8086:a160] (rev 31)
00:15.1 Signal processing controller [1180]: Intel Corporation Sunrise Point-H Serial IO I2C Controller #1 [8086:a161] (rev 31)
00:16.0 Communication controller [0780]: Intel Corporation Sunrise Point-H CSME HECI #1 [8086:a13a] (rev 31)
00:17.0 SATA controller [0106]: Intel Corporation Sunrise Point-H SATA Controller [AHCI mode] [8086:a103] (rev 31)
00:1c.0 PCI bridge [0604]: Intel Corporation Sunrise Point-H PCI Express Root Port #1 [8086:a110] (rev f1)
...
...

It's the PCI bridge. So, it seems like it's an unrelated problem, but it only started after fiddling with the wifi issue. Does anyone have any insights about what this could be caused by?

Offline

#2 2017-09-04 22:10:46

Kaeptn
Member
Registered: 2017-09-03
Posts: 4

Re: XPS 9560: Flood of PCIe errors in log

Looks familiar to me. In my case the errors came from the M.2 NVMe module - Hynix in my case.
I don't have the XPS 9560 anymore though. Went back to Dell.

Offline

#3 2017-09-05 01:03:37

euclid211
Member
Registered: 2017-08-24
Posts: 6

Re: XPS 9560: Flood of PCIe errors in log

I also have an XPS 9560 with a 1G Hynix drive, and was getting similar errors. I haven't isolated the cause, but setting 'pci=nommconf' in the kernel options got rid of the errors.

Last edited by euclid211 (2017-09-05 01:07:56)

Offline

Board footer

Powered by FluxBB