You are not logged in.
System becomes unresponsive, running application gets killed, then thrown into tty, then SysRq to reboot system, On booting the following error appears in Red -->
../systemd/src/boot/random-seed.c:222@process_random_seed: Failed to read random seed file: Device error
../systemd/src/boot/boot.c:2545@image_start: Error opening root path: Device error
After that instead of booting into system, the system boots to UEFI mode and when i look at "Storage Device" section there, i cant see nvme disk there in PCIE section.
After few hours nvme shows up in UEFI "Storage Device" section and i am able to boot my system normally. This whole thing has happened 4 times till now.
Similar thing happened to me in 2024 -- https://bbs.archlinux.org/viewtopic.php?id=296362
But this time it has gotten more extreme where system won't boot for few hours, nvme entry missing in UEFI and the above errors and the following errors -->
Also after it happned to me the third time i started using the following parameters as this solved the problem previously, but then this happened again today (4th time)-->
1. nvme_core.default_ps_max_latency_us=0
2. pcie_aspm=off
3. iommu=off
nvme model --
KINGSTON OM8PCP3512F-AI1
The following command output are run in Fedora-live-boot media ( during the time system is un-bootable) -->
liveuser@localhost-live:~$ lsblk
NAME MAJ:MIN RM SIZE RO TYPE MOUNTPOINTS
loop0 7:0 0 2.4G 1 loop /run/rootfsbase
sda 8:0 0 931.5G 0 disk
└─sda1 8:1 0 931.5G 0 part
sdb 8:16 1 57.3G 0 disk
├─sdb1 8:17 1 57.3G 0 part
└─sdb2 8:18 1 32M 0 part
mmcblk0 179:0 0 14.4G 0 disk
└─mmcblk0p1 179:1 0 14.4G 0 part
zram0 251:0 0 8G 0 disk [SWAP]
nvme0n1 259:0 0 0B 0 disk
├─nvme0n1p1 259:1 0 1.5G 0 part
├─nvme0n1p2 259:2 0 12G 0 part
├─nvme0n1p3 259:3 0 451.4G 0 part
└─nvme0n1p4 259:4 0 12G 0 part
liveuser@localhost-live:~$ sudo btrfs check /dev/nvme0n1p3
Opening filesystem to check...
No valid Btrfs found on /dev/nvme0n1p3
ERROR: cannot open file system
liveuser@localhost-live:~$ sudo nvme list
Node Generic SN Model Namespace Usage Format FW Rev
--------------------- --------------------- -------------------- ---------------------------------------- ---------- -------------------------- ---------------- --------
liveuser@localhost-live:~$ sudo file -s /dev/nvme0n1p1
/dev/nvme0n1p1: ERROR: cannot read `/dev/nvme0n1p1' (Input/output error)
liveuser@localhost-live:~$ sudo file -s /dev/nvme0n1p2
/dev/nvme0n1p2: ERROR: cannot read `/dev/nvme0n1p2' (Input/output error)
liveuser@localhost-live:~$ sudo file -s /dev/nvme0n1p3
/dev/nvme0n1p3: ERROR: cannot read `/dev/nvme0n1p3' (Input/output error)
liveuser@localhost-live:~$ sudo file -s /dev/nvme0n1p4
/dev/nvme0n1p4: ERROR: cannot read `/dev/nvme0n1p4' (Input/output error)
00:00.0 Host bridge: Intel Corporation 8th Gen Core Processor Host Bridge/DRAM Registers (rev 07)
DeviceName: Onboard - Other
Subsystem: Micro-Star International Co., Ltd. [MSI] Device 12b8
Flags: bus master, fast devsel, latency 0
Capabilities: <access denied>
Kernel driver in use: skl_uncore
00:01.0 PCI bridge: Intel Corporation 6th-10th Gen Core Processor PCIe Controller (x16) (rev 07) (prog-if 00 [Normal decode])
Subsystem: Micro-Star International Co., Ltd. [MSI] Device 12b8
Flags: bus master, fast devsel, latency 0, IRQ 120
Bus: primary=00, secondary=01, subordinate=01, sec-latency=0
I/O behind bridge: 4000-4fff [size=4K] [16-bit]
Memory behind bridge: a4000000-a50fffff [size=17M] [32-bit]
Prefetchable memory behind bridge: 90000000-a20fffff [size=289M] [32-bit]
Capabilities: <access denied>
Kernel driver in use: pcieport
00:02.0 VGA compatible controller: Intel Corporation CoffeeLake-H GT2 [UHD Graphics 630] (prog-if 00 [VGA controller])
DeviceName: Onboard - Video
Subsystem: Micro-Star International Co., Ltd. [MSI] Device 12b8
Flags: bus master, fast devsel, latency 0, IRQ 145
Memory at a3000000 (64-bit, non-prefetchable) [size=16M]
Memory at 80000000 (64-bit, prefetchable) [size=256M]
I/O ports at 5000 [size=64]
Expansion ROM at 000c0000 [virtual] [disabled] [size=128K]
Capabilities: <access denied>
Kernel driver in use: i915
Kernel modules: i915
00:12.0 Signal processing controller: Intel Corporation Cannon Lake PCH Thermal Controller (rev 10)
DeviceName: Onboard - Other
Subsystem: Micro-Star International Co., Ltd. [MSI] Device 12b8
Flags: fast devsel, IRQ 16
Memory at a5423000 (64-bit, non-prefetchable) [size=4K]
Capabilities: <access denied>
Kernel driver in use: intel_pch_thermal
Kernel modules: intel_pch_thermal
00:14.0 USB controller: Intel Corporation Cannon Lake PCH USB 3.1 xHCI Host Controller (rev 10) (prog-if 30 [XHCI])
DeviceName: Onboard - Other
Subsystem: Micro-Star International Co., Ltd. [MSI] Device 12b8
Flags: bus master, medium devsel, latency 0, IRQ 124
Memory at a5400000 (64-bit, non-prefetchable) [size=64K]
Capabilities: <access denied>
Kernel driver in use: xhci_hcd
00:14.2 RAM memory: Intel Corporation Cannon Lake PCH Shared SRAM (rev 10)
DeviceName: Onboard - Other
Subsystem: Intel Corporation Device 7270
Flags: fast devsel
Memory at a541a000 (64-bit, non-prefetchable) [disabled] [size=8K]
Memory at a5422000 (64-bit, non-prefetchable) [disabled] [size=4K]
Capabilities: <access denied>
00:14.3 Network controller: Intel Corporation Cannon Lake PCH CNVi WiFi (rev 10)
DeviceName: Onboard - Ethernet
Subsystem: Intel Corporation Wireless-AC 9560
Flags: bus master, fast devsel, latency 0, IRQ 149
Memory at a5414000 (64-bit, non-prefetchable) [size=16K]
Capabilities: <access denied>
Kernel driver in use: iwlwifi
Kernel modules: iwlwifi
00:15.0 Serial bus controller: Intel Corporation Cannon Lake PCH Serial IO I2C Controller #0 (rev 10)
DeviceName: Onboard - Other
Subsystem: Micro-Star International Co., Ltd. [MSI] Device 12b8
Flags: bus master, fast devsel, latency 0, IRQ 16
Memory at a2100000 (64-bit, non-prefetchable) [size=4K]
Capabilities: <access denied>
Kernel driver in use: intel-lpss
00:15.2 Serial bus controller: Intel Corporation Cannon Lake PCH Serial IO I2C Controller #2 (rev 10)
DeviceName: Onboard - Other
Subsystem: Micro-Star International Co., Ltd. [MSI] Device 12b8
Flags: bus master, fast devsel, latency 0, IRQ 18
Memory at a2101000 (64-bit, non-prefetchable) [size=4K]
Capabilities: <access denied>
Kernel driver in use: intel-lpss
00:16.0 Communication controller: Intel Corporation Cannon Lake PCH HECI Controller (rev 10)
DeviceName: Onboard - Other
Subsystem: Micro-Star International Co., Ltd. [MSI] Device 12b8
Flags: bus master, fast devsel, latency 0, IRQ 147
Memory at a541f000 (64-bit, non-prefetchable) [size=4K]
Capabilities: <access denied>
Kernel driver in use: mei_me
Kernel modules: mei_me
00:17.0 SATA controller: Intel Corporation Cannon Lake Mobile PCH SATA AHCI Controller (rev 10) (prog-if 01 [AHCI 1.0])
DeviceName: Onboard - SATA
Subsystem: Micro-Star International Co., Ltd. [MSI] Device 12b8
Flags: bus master, 66MHz, medium devsel, latency 0, IRQ 123
Memory at a5418000 (32-bit, non-prefetchable) [size=8K]
Memory at a541e000 (32-bit, non-prefetchable) [size=256]
I/O ports at 5090 [size=8]
I/O ports at 5080 [size=4]
I/O ports at 5060 [size=32]
Memory at a541d000 (32-bit, non-prefetchable) [size=2K]
Capabilities: <access denied>
Kernel driver in use: ahci
00:1d.0 PCI bridge: Intel Corporation Cannon Lake PCH PCI Express Root Port #9 (rev f0) (prog-if 00 [Normal decode])
Subsystem: Micro-Star International Co., Ltd. [MSI] Device 12b8
Flags: bus master, fast devsel, latency 0, IRQ 121
Bus: primary=00, secondary=02, subordinate=02, sec-latency=0
I/O behind bridge: [disabled] [16-bit]
Memory behind bridge: a5300000-a53fffff [size=1M] [32-bit]
Prefetchable memory behind bridge: [disabled] [64-bit]
Capabilities: <access denied>
Kernel driver in use: pcieport
00:1d.6 PCI bridge: Intel Corporation Cannon Lake PCH PCI Express Root Port #15 (rev f0) (prog-if 00 [Normal decode])
Subsystem: Micro-Star International Co., Ltd. [MSI] Device 12b8
Flags: bus master, fast devsel, latency 0, IRQ 122
Bus: primary=00, secondary=03, subordinate=03, sec-latency=0
I/O behind bridge: 3000-3fff [size=4K] [16-bit]
Memory behind bridge: a5200000-a52fffff [size=1M] [32-bit]
Prefetchable memory behind bridge: [disabled] [64-bit]
Capabilities: <access denied>
Kernel driver in use: pcieport
00:1f.0 ISA bridge: Intel Corporation HM470 Chipset LPC/eSPI Controller (rev 10)
DeviceName: Onboard - Other
Subsystem: Micro-Star International Co., Ltd. [MSI] Device 12b8
Flags: bus master, medium devsel, latency 0
00:1f.3 Audio device: Intel Corporation Cannon Lake PCH cAVS (rev 10)
DeviceName: Onboard - Sound
Subsystem: Micro-Star International Co., Ltd. [MSI] Device 12b8
Flags: bus master, fast devsel, latency 32, IRQ 148
Memory at a5410000 (64-bit, non-prefetchable) [size=16K]
Memory at a5100000 (64-bit, non-prefetchable) [size=1M]
Capabilities: <access denied>
Kernel driver in use: snd_hda_intel
Kernel modules: snd_hda_intel, snd_soc_avs, snd_sof_pci_intel_cnl
00:1f.4 SMBus: Intel Corporation Cannon Lake PCH SMBus Controller (rev 10)
DeviceName: Onboard - Other
Subsystem: Micro-Star International Co., Ltd. [MSI] Device 12b8
Flags: medium devsel, IRQ 16
Memory at a541c000 (64-bit, non-prefetchable) [size=256]
I/O ports at efa0 [size=32]
Kernel driver in use: i801_smbus
Kernel modules: i2c_i801
00:1f.5 Serial bus controller: Intel Corporation Cannon Lake PCH SPI Controller (rev 10)
DeviceName: Onboard - Other
Subsystem: Micro-Star International Co., Ltd. [MSI] Device 12b8
Flags: fast devsel
Memory at fe010000 (32-bit, non-prefetchable) [size=4K]
Kernel driver in use: intel-spi
Kernel modules: spi_intel_pci
01:00.0 VGA compatible controller: NVIDIA Corporation TU116M [GeForce GTX 1660 Ti Mobile] (rev a1) (prog-if 00 [VGA controller])
Subsystem: Micro-Star International Co., Ltd. [MSI] Device 12b8
Flags: bus master, fast devsel, latency 0, IRQ 144
Memory at a4000000 (32-bit, non-prefetchable) [size=16M]
Memory at 90000000 (64-bit, prefetchable) [size=256M]
Memory at a0000000 (64-bit, prefetchable) [size=32M]
I/O ports at 4000 [size=128]
Expansion ROM at a5000000 [disabled] [size=512K]
Capabilities: <access denied>
Kernel driver in use: nouveau
Kernel modules: nouveau
01:00.1 Audio device: NVIDIA Corporation TU116 High Definition Audio Controller (rev a1)
Subsystem: Micro-Star International Co., Ltd. [MSI] Device 12b8
Flags: bus master, fast devsel, latency 0, IRQ 17
Memory at a5080000 (32-bit, non-prefetchable) [size=16K]
Capabilities: <access denied>
Kernel driver in use: snd_hda_intel
Kernel modules: snd_hda_intel
01:00.2 USB controller: NVIDIA Corporation TU116 USB 3.1 Host Controller (rev a1) (prog-if 30 [XHCI])
Subsystem: Micro-Star International Co., Ltd. [MSI] Device 12b8
Flags: fast devsel, IRQ 132
Memory at a2000000 (64-bit, prefetchable) [size=256K]
Memory at a2040000 (64-bit, prefetchable) [size=64K]
Capabilities: <access denied>
Kernel driver in use: xhci_hcd
01:00.3 Serial bus controller: NVIDIA Corporation TU116 USB Type-C UCSI Controller (rev a1)
Subsystem: Micro-Star International Co., Ltd. [MSI] Device 12b8
Flags: bus master, fast devsel, latency 0, IRQ 150
Memory at a5084000 (32-bit, non-prefetchable) [size=4K]
Capabilities: <access denied>
Kernel driver in use: nvidia-gpu
Kernel modules: i2c_nvidia_gpu
02:00.0 Non-Volatile memory controller: Kingston Technology Company, Inc. OM8PCP Design-In PCIe 3 NVMe SSD (DRAM-less) (rev 01) (prog-if 02 [NVM Express])
Subsystem: Kingston Technology Company, Inc. OM8PCP Design-In PCIe 3 NVMe SSD (DRAM-less)
Flags: fast devsel, IRQ 16
Memory at a5300000 (64-bit, non-prefetchable) [size=16K]
Capabilities: <access denied>
Kernel driver in use: nvme
Kernel modules: nvme
03:00.0 Ethernet controller: Realtek Semiconductor Co., Ltd. RTL8111/8168/8211/8411 PCI Express Gigabit Ethernet Controller (rev 15)
Subsystem: Micro-Star International Co., Ltd. [MSI] Device 12b8
Flags: bus master, fast devsel, latency 0, IRQ 18
I/O ports at 3000 [size=256]
Memory at a5204000 (64-bit, non-prefetchable) [size=4K]
Memory at a5200000 (64-bit, non-prefetchable) [size=16K]
Capabilities: <access denied>
Kernel driver in use: r8169
Kernel modules: r8169
I have been redirecting my journal output to SD card for last 3 days, This journal contains 2 events of this bug out of 4 times this has happned. But i am no able to find the log entry that shows this. Earlier when my system was booting successfully, i was able to `--grep nvme` and `--grep btrfs` and outputs were shown. But now that i am accessing them through fedora-live-system using `journalctl --directory=....`, i am not able to grep for them.
command used for above pastebin -->
journalctl --directory=/run/media/liveuser/e84bf4dc-456d-4493-8dc0-8b656625ed6c/ | curl -F 'file=@-' 0x0.st
https://imgur.com/a/D3gFtbO Image showing error when booting the system, and nvme missing in UEFI mode
https://imgur.com/a/KQrRU9x Image showing error when booting Fedora-live-media
Last edited by phoenix324 (Today 08:07:14)
Offline
try to run smart tests from any live boot media to verify the health of the drive.
Disliking systemd intensely, but not satisfied with alternatives so focusing on taming systemd.
clean chroot building not flexible enough ?
Try clean chroot manager by graysky
Offline
It's actually a very direct hit for https://wiki.archlinux.org/title/Solid_ … ST_support (down to the kingston brand)
sudo journalctl -b --directory=/run/media/liveuser/e84bf4dc-456d-4493-8dc0-8b656625ed6c/ | curl -F 'file=@-' 0x0.st
The journal you posted lacks all system messages, so I guess fedora doesn't run the live user in a privileged group - I wanted to check whether the parameters apply, also add "pcie_port_pm=off" to the list.
Online
try to run smart tests from any live boot media to verify the health of the drive.
nvme not showing up at all. I tried booting into system 3 times but it failed, since then i am spending time in fedora-live-media and doing my Anki.
The journal you posted lacks all system messages, so I guess fedora doesn't run the live user in a privileged group
Wow you were right, ran the journal with sudo and log output showing the error. Here you go --> http://0x0.st/8gIs.txt
I wanted to check whether the parameters apply, also add "pcie_port_pm=off" to the list.
Will do once nvme starts showing up again. Its been quite some time since the error, will try to boot into system.
Last edited by phoenix324 (Today 15:19:20)
Offline
Sytem bootable now.
smartctl 7.5 2025-04-30 r5714 [x86_64-linux-6.14.9-arch1-1] (local build)
Copyright (C) 2002-25, Bruce Allen, Christian Franke, www.smartmontools.org
=== START OF INFORMATION SECTION ===
Model Number: KINGSTON OM8PCP3512F-AI1
Serial Number: 50026B7683ECE53F
Firmware Version: ECFK52.8
PCI Vendor/Subsystem ID: 0x2646
IEEE OUI Identifier: 0x0026b7
Total NVM Capacity: 512,110,190,592 [512 GB]
Unallocated NVM Capacity: 0
Controller ID: 1
NVMe Version: 1.3
Number of Namespaces: 1
Namespace 1 Size/Capacity: 512,110,190,592 [512 GB]
Namespace 1 Formatted LBA Size: 512
Namespace 1 IEEE EUI-64: 0026b7 683ece53f5
Local Time is: Sun Jun 8 21:04:19 2025 IST
Firmware Updates (0x12): 1 Slot, no Reset required
Optional Admin Commands (0x0017): Security Format Frmw_DL Self_Test
Optional NVM Commands (0x0054): DS_Mngmt Sav/Sel_Feat Timestmp
Log Page Attributes (0x0a): Cmd_Eff_Lg Telmtry_Lg
Maximum Data Transfer Size: 512 Pages
Warning Comp. Temp. Threshold: 85 Celsius
Critical Comp. Temp. Threshold: 90 Celsius
Supported Power States
St Op Max Active Idle RL RT WL WT Ent_Lat Ex_Lat
0 + 5.74W - - 0 0 0 0 0 0
1 + 5.21W - - 1 1 1 1 0 0
2 + 4.95W - - 2 2 2 2 0 0
3 - 0.0490W - - 3 3 3 3 2000 2000
4 - 0.0018W - - 4 4 4 4 25000 25000
Supported LBA Sizes (NSID 0x1)
Id Fmt Data Metadt Rel_Perf
0 + 512 0 2
1 - 4096 0 1
=== START OF SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED
SMART/Health Information (NVMe Log 0x02, NSID 0xffffffff)
Critical Warning: 0x00
Temperature: 44 Celsius
Available Spare: 100%
Available Spare Threshold: 10%
Percentage Used: 26%
Data Units Read: 78,172,519 [40.0 TB]
Data Units Written: 70,373,774 [36.0 TB]
Host Read Commands: 771,071,899
Host Write Commands: 1,248,656,470
Controller Busy Time: 6,377
Power Cycles: 14,509
Power On Hours: 12,301
Unsafe Shutdowns: 219
Media and Data Integrity Errors: 0
Error Information Log Entries: 11,358
Warning Comp. Temperature Time: 0
Critical Comp. Temperature Time: 0
Thermal Temp. 1 Transition Count: 17
Thermal Temp. 1 Total Time: 3
Error Information (NVMe Log 0x01, 16 of 63 entries)
Num ErrCount SQId CmdId Status PELoc LBA NSID VS Message
0 11358 0 0x0018 0x4004 0x028 0 0 - Invalid Field in Command
Self-test Log (NVMe Log 0x06, NSID 0xffffffff)
Self-test status: No self-test in progress
Num Test_Description Status Power_on_Hours Failing_LBA NSID Seg SCT Code
0 Extended Completed without error 12301 - - - - -
1 Short Aborted: Controller Reset 12268 - - - - -
2 Extended Completed without error 12247 - - - - -
3 Short Completed without error 12247 - - - - -
4 Short Completed without error 12247 - - - - -
5 Extended Completed without error 12246 - - - - -
6 Extended Completed without error 12226 - - - - -
7 Short Completed without error 12206 - - - - -
8 Short Completed without error 10945 - - - - -
9 Short Aborted: Controller Reset 9207 - - - - -
Few questions -->
1. Is drive ok (nvme)?
2. is fs (btrfs) ok?
3. Do i need to buy new nvme?
Offline
Jun 06 16:09:52 archlinux kernel: Command line: initrd=\intel-ucode.img initrd=\initramfs-linux.img root=UUID=ff5d4ba3-25ce-46cd-9bab-dc98091801b1 rw rootflags=subvol=@ loglevel=3 mitigations=off iommu=soft pcie_aspm=off nvme_core.default_ps_max_latency_us=0
Jun 06 16:10:12 ArchLinux systemd[1]: Starting TLP system startup/shutdown...
Jun 06 16:10:12 ArchLinux tlp[1004]: Applying power save settings...done.
Jun 06 16:09:52 archlinux kernel: nvme nvme0: D3 entry latency set to 10 seconds
Jun 07 01:07:34 ArchLinux kernel: nvme nvme0: D3 entry latency set to 10 seconds
Jun 07 02:16:36 ArchLinux kernel: nvme nvme0: D3 entry latency set to 10 seconds
Jun 07 05:03:05 ArchLinux kernel: nvme nvme0: D3 entry latency set to 10 seconds
Jun 07 18:31:30 ArchLinux kernel: nvme nvme0: D3 entry latency set to 10 seconds
Jun 08 05:17:28 ArchLinux kernel: nvme nvme0: D3 entry latency set to 10 seconds
Jun 08 06:06:24 ArchLinux kernel: nvme nvme0: D3 entry latency set to 10 seconds
I suspect it's TLP?
Online
[phoenix@ArchLinux ~]$ journalctl --boot 0 --grep D3
Jun 08 22:13:55 archlinux kernel: pci 0000:00:01.0: PME# supported from D0 D3hot D3cold
Jun 08 22:13:55 archlinux kernel: pci 0000:00:14.0: PME# supported from D3hot D3cold
Jun 08 22:13:55 archlinux kernel: pci 0000:00:14.3: PME# supported from D0 D3hot D3cold
Jun 08 22:13:55 archlinux kernel: pci 0000:00:16.0: PME# supported from D3hot
Jun 08 22:13:55 archlinux kernel: pci 0000:00:17.0: PME# supported from D3hot
Jun 08 22:13:55 archlinux kernel: pci 0000:00:1d.0: PME# supported from D0 D3hot D3cold
Jun 08 22:13:55 archlinux kernel: pci 0000:00:1d.6: PME# supported from D0 D3hot D3cold
Jun 08 22:13:55 archlinux kernel: pci 0000:00:1f.3: PME# supported from D3hot D3cold
Jun 08 22:13:55 archlinux kernel: pci 0000:01:00.0: PME# supported from D0 D3hot D3cold
Jun 08 22:13:55 archlinux kernel: pci 0000:01:00.2: PME# supported from D0 D3hot D3cold
Jun 08 22:13:55 archlinux kernel: pci 0000:01:00.3: PME# supported from D0 D3hot D3cold
Jun 08 22:13:55 archlinux kernel: pci 0000:03:00.0: PME# supported from D0 D1 D2 D3hot D3cold
Jun 08 22:13:55 archlinux kernel: pci 0000:01:00.1: extending delay after power-on from D3hot to 20 msec
Jun 08 22:13:55 archlinux kernel: nvme nvme0: D3 entry latency set to 10 seconds
[phoenix@ArchLinux ~]$ cat /sys/module/nvme_core/parameters/default_ps_max_latency_us
0
Below is my tlp config, what do i need to change ?
--- TLP 1.8.0 --------------------------------------------
+++ Configured Settings:
/etc/tlp.conf L0038: TLP_ENABLE="1"
defaults.conf L0007: TLP_WARN_LEVEL="3"
defaults.conf L0008: TLP_MSG_COLORS="91 93 1 92"
/etc/tlp.conf L0072: TLP_PERSISTENT_DEFAULT="1"
defaults.conf L0010: DISK_IDLE_SECS_ON_AC="0"
defaults.conf L0011: DISK_IDLE_SECS_ON_BAT="2"
defaults.conf L0012: MAX_LOST_WORK_SECS_ON_AC="15"
defaults.conf L0013: MAX_LOST_WORK_SECS_ON_BAT="60"
defaults.conf L0014: CPU_ENERGY_PERF_POLICY_ON_AC="balance_performance"
defaults.conf L0015: CPU_ENERGY_PERF_POLICY_ON_BAT="balance_power"
defaults.conf L0016: NMI_WATCHDOG="0"
defaults.conf L0017: DISK_DEVICES="nvme0n1 sda"
defaults.conf L0018: DISK_APM_LEVEL_ON_AC="254 254"
defaults.conf L0019: DISK_APM_LEVEL_ON_BAT="128 128"
defaults.conf L0020: DISK_APM_CLASS_DENYLIST="usb ieee1394"
defaults.conf L0021: DISK_IOSCHED="keep keep"
defaults.conf L0022: SATA_LINKPWR_ON_AC="med_power_with_dipm"
defaults.conf L0023: SATA_LINKPWR_ON_BAT="med_power_with_dipm"
defaults.conf L0024: AHCI_RUNTIME_PM_ON_AC="on"
/etc/tlp.conf L0295: AHCI_RUNTIME_PM_ON_BAT="auto"
defaults.conf L0026: AHCI_RUNTIME_PM_TIMEOUT="15"
defaults.conf L0027: PCIE_ASPM_ON_AC="default"
defaults.conf L0028: PCIE_ASPM_ON_BAT="default"
defaults.conf L0029: RADEON_DPM_PERF_LEVEL_ON_AC="auto"
defaults.conf L0030: RADEON_DPM_PERF_LEVEL_ON_BAT="auto"
defaults.conf L0031: RADEON_POWER_PROFILE_ON_AC="default" #! Parameter is deprecated: feature will be removed with the next release
defaults.conf L0032: RADEON_POWER_PROFILE_ON_BAT="default" #! Parameter is deprecated: feature will be removed with the next release
defaults.conf L0033: AMDGPU_ABM_LEVEL_ON_AC="0"
defaults.conf L0034: AMDGPU_ABM_LEVEL_ON_BAT="1"
defaults.conf L0035: WIFI_PWR_ON_AC="off"
defaults.conf L0036: WIFI_PWR_ON_BAT="on"
defaults.conf L0037: WOL_DISABLE="Y"
defaults.conf L0038: SOUND_POWER_SAVE_ON_AC="1"
defaults.conf L0039: SOUND_POWER_SAVE_ON_BAT="1"
defaults.conf L0040: SOUND_POWER_SAVE_CONTROLLER="Y"
defaults.conf L0041: BAY_POWEROFF_ON_AC="0"
defaults.conf L0042: BAY_POWEROFF_ON_BAT="0"
defaults.conf L0043: BAY_DEVICE="sr0"
defaults.conf L0044: RUNTIME_PM_ON_AC="on"
defaults.conf L0045: RUNTIME_PM_ON_BAT="auto"
defaults.conf L0046: RUNTIME_PM_DRIVER_DENYLIST="mei_me nouveau radeon xhci_hcd"
/etc/tlp.conf L0424: USB_AUTOSUSPEND="1"
defaults.conf L0048: USB_EXCLUDE_AUDIO="1"
defaults.conf L0049: USB_EXCLUDE_BTUSB="0"
defaults.conf L0050: USB_EXCLUDE_PHONE="0"
defaults.conf L0051: USB_EXCLUDE_PRINTER="1"
defaults.conf L0052: USB_EXCLUDE_WWAN="0"
/etc/tlp.conf L0476: RESTORE_DEVICE_STATE_ON_STARTUP="1"
defaults.conf L0054: RESTORE_THRESHOLDS_ON_BAT="0"
defaults.conf L0055: NATACPI_ENABLE="1"
defaults.conf L0056: TPSMAPI_ENABLE="1"
/etc/tlp.conf L0066: TLP_DEFAULT_MODE="BAT"
/etc/tlp.conf L0211: MEM_SLEEP_ON_AC="deep"
/etc/tlp.conf L0212: MEM_SLEEP_ON_BAT="deep"
/etc/tlp.conf L0556: DEVICES_TO_DISABLE_ON_LAN_CONNECT="wifi wwan"
/etc/tlp.conf L0557: DEVICES_TO_DISABLE_ON_WIFI_CONNECT="wwan"
/etc/tlp.conf L0558: DEVICES_TO_DISABLE_ON_WWAN_CONNECT="wifi"
+++ System Info
System = Micro-Star International Co., Ltd. REV:1.0 GL65 Leopard 9SDK
BIOS = E16U7IMS.508
OS Release = Arch Linux
Kernel = 6.14.9-arch1-1 #1 SMP PREEMPT_DYNAMIC Thu, 29 May 2025 21:42:15 +0000 x86_64
/proc/cmdline = initrd=\intel-ucode.img initrd=\initramfs-linux.img root=UUID=ff5d4ba3-25ce-46cd-9bab-dc98091801b1 rw rootflags=subvol=@ loglevel=3 mitigations=off iommu=soft pcie_aspm=off pcie_port_pm=off nvme_core.default_ps_max_latency_us=0
Init system = systemd
Boot mode = UEFI
Suspend mode = s2idle [deep]
+++ TLP Status
State = enabled
RDW state = enabled
Last run = 10:15:34 PM, 78 sec(s) ago
Mode = battery (persistent)
Power source = AC
+++ Processor
CPU model = Intel(R) Core(TM) i7-9750H CPU @ 2.60GHz
/sys/devices/system/cpu/cpu0/cpufreq/scaling_driver = intel_pstate
/sys/devices/system/cpu/cpu0/cpufreq/scaling_governor = powersave
/sys/devices/system/cpu/cpu0/cpufreq/scaling_available_governors = performance powersave
/sys/devices/system/cpu/cpu0/cpufreq/scaling_min_freq = 800000 [kHz]
/sys/devices/system/cpu/cpu0/cpufreq/scaling_max_freq = 4500000 [kHz]
/sys/devices/system/cpu/cpu0/cpufreq/cpuinfo_min_freq = 800000 [kHz]
/sys/devices/system/cpu/cpu0/cpufreq/cpuinfo_max_freq = 4500000 [kHz]
/sys/devices/system/cpu/cpu0/cpufreq/energy_performance_preference = balance_power [EPP]
/sys/devices/system/cpu/cpu0/cpufreq/energy_performance_available_preferences = default performance balance_performance balance_power power
/sys/devices/system/cpu/cpu1..cpu11: omitted for clarity, use -v to show all
/sys/devices/system/cpu/intel_pstate/status = active
/sys/devices/system/cpu/intel_pstate/min_perf_pct = 17 [%]
/sys/devices/system/cpu/intel_pstate/max_perf_pct = 100 [%]
/sys/devices/system/cpu/intel_pstate/no_turbo = 0
/sys/devices/system/cpu/intel_pstate/hwp_dynamic_boost = 0
/sys/devices/system/cpu/intel_pstate/turbo_pct = 50 [%]
/sys/devices/system/cpu/intel_pstate/num_pstates = 38
/sys/module/workqueue/parameters/power_efficient = Y
/proc/sys/kernel/nmi_watchdog = 0
+++ Platform Profile
/sys/firmware/acpi/platform_profile = (not available)
/sys/firmware/acpi/platform_profile_choices = (not available)
+++ Temperatures
CPU temp = 53 [°C]
Fan speed = (not available)
+++ File System
/proc/sys/vm/laptop_mode = 2
/proc/sys/vm/dirty_writeback_centisecs = 6000
/proc/sys/vm/dirty_expire_centisecs = 6000
/proc/sys/vm/dirty_ratio = 20
/proc/sys/vm/dirty_background_ratio = 10
+++ Disks
Devices = nvme0n1 sda
/dev/nvme0n1:
Type = NVMe
Disk ID = nvme-KINGSTON_OM8PCP3512F-AI1_****************
Model = KINGSTON OM8PCP3512F-AI1
Firmware = ECFK52.8
Temp = 45 °C
Scheduler = [none] mq-deadline kyber bfq (multi queue)
Runtime PM:
/sys/block/nvme0n1/device/power/control = auto, autosuspend_delay_ms = (not available)
SMART info:
Critical Warning: 0x00
Temperature: 45 Celsius
Available Spare: 100%
Available Spare Threshold: 10%
Percentage Used: 26%
Data Units Written: 70,374,623 [36.0 TB]
Power Cycles: 14,510
Power On Hours: 12,301
Unsafe Shutdowns: 219
Media and Data Integrity Errors: 0
/dev/sda:
Type = SATA
Disk ID = ata-ST1000LM035-1RK172_********
Model = ST1000LM035-1RK172
Firmware = SBM3
APM Level = 128
Status = active/idle
Host = host5
Scheduler = none [mq-deadline] kyber bfq (multi queue)
Runtime PM:
/sys/block/sda/device/power/control = auto, autosuspend_delay_ms = 15000
SMART info:
4 Start_Stop_Count = 18668
5 Reallocated_Sector_Ct = 1208
9 Power_On_Hours = 6470 [h]
12 Power_Cycle_Count = 7814
190 Airflow_Temperature_Cel = 37 [°C]
191 G-Sense_Error_Rate = 57
193 Load_Cycle_Count = 165662
194 Temperature_Celsius = 37 (0 12 0 [°C]
241 Total_LBAs_Written = 14.561 [TB]
+++ AHCI Link Power Management (ALPM) :: SATA Links
/sys/bus/pci/devices/0000:00:17.0/ata1/host0/scsi_host/host0/link_power_management_policy = med_power_with_dipm
/sys/bus/pci/devices/0000:00:17.0/ata2/host1/scsi_host/host1/link_power_management_policy = med_power_with_dipm
/sys/bus/pci/devices/0000:00:17.0/ata3/host2/scsi_host/host2/link_power_management_policy = med_power_with_dipm
/sys/bus/pci/devices/0000:00:17.0/ata4/host3/scsi_host/host3/link_power_management_policy = med_power_with_dipm
/sys/bus/pci/devices/0000:00:17.0/ata5/host4/scsi_host/host4/link_power_management_policy = med_power_with_dipm
/sys/bus/pci/devices/0000:00:17.0/ata6/host5/scsi_host/host5/link_power_management_policy = med_power_with_dipm -- sda
+++ AHCI Port Runtime Power Management :: SATA/ATA Ports
/sys/bus/pci/devices/0000:00:17.0/ata1/power/control = auto
/sys/bus/pci/devices/0000:00:17.0/ata2/power/control = auto
/sys/bus/pci/devices/0000:00:17.0/ata3/power/control = auto
/sys/bus/pci/devices/0000:00:17.0/ata4/power/control = auto
/sys/bus/pci/devices/0000:00:17.0/ata5/power/control = auto
/sys/bus/pci/devices/0000:00:17.0/ata6/power/control = auto -- sda
+++ Intel Graphics
/sys/class/drm/card1/device/driver = i915
/sys/class/drm/card1/power/rc6_enable = 1 (enabled)
/sys/kernel/debug/dri/1/i915_fbc_status = FBC enabled
/sys/kernel/debug/dri/1/i915_edp_psr_status = PSR mode: disabled
/sys/class/drm/card1/gt_min_freq_mhz = 350 [MHz]
/sys/class/drm/card1/gt_max_freq_mhz = 1150 [MHz]
/sys/class/drm/card1/gt_boost_freq_mhz = 1150 [MHz]
/sys/class/drm/card1/gt_RPn_freq_mhz = 350 [MHz] (GPU min)
/sys/class/drm/card1/gt_RP0_freq_mhz = 1150 [MHz] (GPU max)
+++ Wireless
bluetooth = on
nfc = none (no device)
wifi = on
wwan = none (no device)
hci0(btusb) : bluetooth, connected
wlo1(iwlwifi) : wifi, connected, power management = on
+++ Audio
/sys/module/snd_hda_intel/parameters/power_save = 1
/sys/module/snd_hda_intel/parameters/power_save_controller = Y
+++ PCIe Active State Power Management
/sys/module/pcie_aspm/parameters/policy = [default] performance powersave powersupersave (using BIOS preferences)
+++ PCIe Runtime Power Management
Enable devices = (disabled)
Disable devices = (disabled)
Device denylist = (disabled)
Driver denylist = mei_me nouveau radeon xhci_hcd
/sys/bus/pci/devices/0000:00:00.0/power/control = auto (0x060000, Host bridge, skl_uncore)
/sys/bus/pci/devices/0000:00:01.0/power/control = auto (0x060400, PCI bridge, pcieport)
/sys/bus/pci/devices/0000:00:02.0/power/control = auto (0x030000, VGA compatible controller, i915)
/sys/bus/pci/devices/0000:00:12.0/power/control = auto (0x118000, Signal processing controller, intel_pch_thermal)
/sys/bus/pci/devices/0000:00:14.0/power/control = on (0x0c0330, USB controller, xhci_hcd)
/sys/bus/pci/devices/0000:00:14.2/power/control = auto (0x050000, RAM memory, no driver)
/sys/bus/pci/devices/0000:00:14.3/power/control = auto (0x028000, Network controller, iwlwifi)
/sys/bus/pci/devices/0000:00:15.0/power/control = auto (0x0c8000, Serial bus controller, intel-lpss)
/sys/bus/pci/devices/0000:00:15.2/power/control = auto (0x0c8000, Serial bus controller, intel-lpss)
/sys/bus/pci/devices/0000:00:16.0/power/control = auto (0x078000, Communication controller, mei_me)
/sys/bus/pci/devices/0000:00:17.0/power/control = auto (0x010601, SATA controller, ahci)
/sys/bus/pci/devices/0000:00:1d.0/power/control = auto (0x060400, PCI bridge, pcieport)
/sys/bus/pci/devices/0000:00:1d.6/power/control = auto (0x060400, PCI bridge, pcieport)
/sys/bus/pci/devices/0000:00:1f.0/power/control = auto (0x060100, ISA bridge, no driver)
/sys/bus/pci/devices/0000:00:1f.3/power/control = auto (0x040300, Audio device, snd_hda_intel)
/sys/bus/pci/devices/0000:00:1f.4/power/control = auto (0x0c0500, SMBus, i801_smbus)
/sys/bus/pci/devices/0000:00:1f.5/power/control = auto (0x0c8000, Serial bus controller, intel-spi)
/sys/bus/pci/devices/0000:02:00.0/power/control = auto (0x010802, Non-Volatile memory controller, nvme)
/sys/bus/pci/devices/0000:03:00.0/power/control = auto (0x020000, Ethernet controller, r8169)
+++ USB
Autosuspend = enabled
Device allowlist = (not configured)
Device denylist = (not configured)
Exclude audio = enabled
Exclude bluetooth = disabled
Exclude phones = disabled
Exclude printers = enabled
Exclude WWAN = disabled
Bus 001 Device 001 ID 1d6b:0002 control = auto, autosuspend_delay_ms = 0 -- Linux Foundation 2.0 root hub (hub)
Bus 001 Device 002 ID 0c45:8603 control = on, autosuspend_delay_ms = 2000 -- Microdia SI Gaming Keyboard (usbhid)
Bus 001 Device 003 ID 0bda:0129 control = auto, autosuspend_delay_ms = 2000 -- Realtek Semiconductor Corp. RTS5129 Card Reader Controller (rtsx_usb)
Bus 001 Device 004 ID 1038:1122 control = on, autosuspend_delay_ms = 2000 -- SteelSeries ApS SteelSeries KLC (usbhid)
Bus 001 Device 005 ID 04f2:b695 control = auto, autosuspend_delay_ms = 2000 -- Chicony Electronics Co., Ltd HD Webcam (no driver)
Bus 001 Device 006 ID 8087:0aaa control = auto, autosuspend_delay_ms = 2000 -- Intel Corp. Bluetooth 9460/9560 Jefferson Peak (JfP) (btusb)
Bus 002 Device 001 ID 1d6b:0003 control = auto, autosuspend_delay_ms = 0 -- Linux Foundation 3.0 root hub (hub)
+++ Battery Care
Plugin: generic
Supported features: none available
+++ Battery Status: BAT1
/sys/class/power_supply/BAT1/manufacturer = MSI
/sys/class/power_supply/BAT1/model_name = BIF0_9
/sys/class/power_supply/BAT1/cycle_count = 0 (or not supported)
/sys/class/power_supply/BAT1/charge_full_design = 4730 [mAh]
/sys/class/power_supply/BAT1/charge_full = 3634 [mAh]
/sys/class/power_supply/BAT1/charge_now = 2905 [mAh]
/sys/class/power_supply/BAT1/current_now = 0 [mA]
/sys/class/power_supply/BAT1/status = Not charging
/sys/class/power_supply/BAT1/charge_control_start_threshold = (not available)
/sys/class/power_supply/BAT1/charge_control_end_threshold = (not available)
/sys/class/power_supply/BAT1/charge_behaviour = (not available)
Charge = 79.9 [%]
Capacity = 76.8 [%]
Also using "pcie_port_pm=off", dedicated nvidia gpu is not going into suspend mode.
Offline
dedicated nvidia gpu is not going into suspend mode
Expectable, but currently we're just searching for causes of the nvme failure.
In that light: just completely disable TLP and see whether that stabilizes the nvme, so we're not messing around w/ it in complete vein.
Online