You are not logged in.

#1 2025-06-26 08:52:09

north::guy
Member
Registered: 2025-06-26
Posts: 1

Random Reboots When CPU Temperature Too Low

Hello,

I recently acquired a laptop Asus Rog Zephyrus G14 GA402R 2022. The previous owner reported some issues with random reboots if the charging cable is not connected; and this was with Windows. I tested the laptop and it worked really fine and since the charging requirement was not a problem for me, I decided to buy it. The previous owner also changed the cooling liquid on the CPU, but they said that this did not resolve the issue.

So after using the laptop for a while, the reboots got worse, as it was not enough that the laptop was connected to a power supply. For this reason I am utilizing stress-ng script which puts just enough weight for the CPU so that the device does not reboot. When using stress-ng the device does not have to be plugged into power supply, but this drains the battery very swiftly.

It seems that as I boot when the CPU is cold, I can at best get to the Grub menu when the machine reboots. For some reason the firmware interface heats up the CPU very intensely so I have the heat the CPU every time when I'm booting to around 70-80 Celsius so that I can boot successfully.

*So the issue is that without enough CPU temperature, my device randomly reboots.*

I have tried to diagnose the issue and I believe that it might be due to faulty hardware, or a ACPI related BIOS bug. I have tried all the officially availabe BIOS versions 317, 318, 319 and every one of them seems to have this issue. I have tested this with various different kernels and currently I have three kernels which can replicate this issue 6.15.2-arch1-1, 6.14.11-hardened1-1-hardened, and 6.15.2-zen1-1-zen.

Here are two ACPI related errors from dmesg that I could find:

[    0.322814] ACPI: [Firmware Bug]: BIOS _OSI(Linux) query honored via cmdline
...
[    0.396721] ACPI BIOS Error (bug): Could not resolve symbol [\_SB.PCI0.GPP0.SWUS.SWDS.VGA._STA.GC00], AE_NOT_FOUND (20240827/psargs-332)
[    0.396731] ACPI Error: Aborting method \_SB.PCI0.GPP0.SWUS.SWDS.VGA._STA due to previous error (AE_NOT_FOUND) (20240827/psparse-529)
/* ^This error above shows multiple times in a single boot sequence^ */
...

In every boot sequence I get the "Could not resolve symbol" error message displayed by Grub. It seems like the kernel is trying to fetch ACPI configuration information from BIOS, but is unable to resolve it.

Here is the output of acpi -V:

Battery 0: Not charging, 100%
Battery 0: design capacity 4769 mAh, last full capacity 4129 mAh = 86%
Adapter 0: on-line
Thermal 0: ok, 20.0 degrees C
Thermal 0: trip point 0 switches to mode critical at temperature 110.0 degrees C
Thermal 1: ok, 84.0 degrees C
Thermal 1: trip point 0 switches to mode critical at temperature 120.0 degrees C
Thermal 1: trip point 1 switches to mode passive at temperature 110.0 degrees C
Cooling 0: Processor 0 of 3
Cooling 1: Processor 0 of 3
Cooling 2: Processor 0 of 3
Cooling 3: Processor 0 of 3
Cooling 4: Processor 0 of 3
Cooling 5: Processor 0 of 3
Cooling 6: Processor 0 of 3
Cooling 7: Processor 0 of 3
Cooling 8: Processor 0 of 3
Cooling 9: Processor 0 of 3
Cooling 10: Processor 0 of 3
Cooling 11: Processor 0 of 3
Cooling 12: Processor 0 of 3
Cooling 13: Processor 0 of 3
Cooling 14: Processor 0 of 3
Cooling 15: Processor 0 of 3

*So my question is that is this possible to fix by for example tweaking some values from software side of things, or does this require potential hardware fixes? Could it also be that the CPU is weirdly soldered and thus, it has problems with electricity. And only after heat expansion it is able supply electricity normally.*

Some additional information:

lspci:

00:00.0 Host bridge: Advanced Micro Devices, Inc. [AMD] Family 17h-19h PCIe Root Complex (rev 01)
00:00.2 IOMMU: Advanced Micro Devices, Inc. [AMD] Family 17h-19h IOMMU
00:01.0 Host bridge: Advanced Micro Devices, Inc. [AMD] Family 17h-19h PCIe Dummy Host Bridge (rev 01)
00:01.1 PCI bridge: Advanced Micro Devices, Inc. [AMD] Family 17h-19h PCIe GPP Bridge
00:02.0 Host bridge: Advanced Micro Devices, Inc. [AMD] Family 17h-19h PCIe Dummy Host Bridge (rev 01)
00:02.1 PCI bridge: Advanced Micro Devices, Inc. [AMD] Family 17h-19h PCIe GPP Bridge
00:02.2 PCI bridge: Advanced Micro Devices, Inc. [AMD] Family 17h-19h PCIe GPP Bridge
00:02.4 PCI bridge: Advanced Micro Devices, Inc. [AMD] Family 17h-19h PCIe GPP Bridge
00:03.0 Host bridge: Advanced Micro Devices, Inc. [AMD] Family 17h-19h PCIe Dummy Host Bridge (rev 01)
00:04.0 Host bridge: Advanced Micro Devices, Inc. [AMD] Family 17h-19h PCIe Dummy Host Bridge (rev 01)
00:08.0 Host bridge: Advanced Micro Devices, Inc. [AMD] Family 17h-19h PCIe Dummy Host Bridge (rev 01)
00:08.1 PCI bridge: Advanced Micro Devices, Inc. [AMD] Family 17h-19h Internal PCIe GPP Bridge (rev 10)
00:08.3 PCI bridge: Advanced Micro Devices, Inc. [AMD] Family 17h-19h Internal PCIe GPP Bridge (rev 10)
00:14.0 SMBus: Advanced Micro Devices, Inc. [AMD] FCH SMBus Controller (rev 71)
00:14.3 ISA bridge: Advanced Micro Devices, Inc. [AMD] FCH LPC Bridge (rev 51)
00:18.0 Host bridge: Advanced Micro Devices, Inc. [AMD] Rembrandt Data Fabric: Device 18h; Function 0
00:18.1 Host bridge: Advanced Micro Devices, Inc. [AMD] Rembrandt Data Fabric: Device 18h; Function 1
00:18.2 Host bridge: Advanced Micro Devices, Inc. [AMD] Rembrandt Data Fabric: Device 18h; Function 2
00:18.3 Host bridge: Advanced Micro Devices, Inc. [AMD] Rembrandt Data Fabric: Device 18h; Function 3
00:18.4 Host bridge: Advanced Micro Devices, Inc. [AMD] Rembrandt Data Fabric: Device 18h; Function 4
00:18.5 Host bridge: Advanced Micro Devices, Inc. [AMD] Rembrandt Data Fabric: Device 18h; Function 5
00:18.6 Host bridge: Advanced Micro Devices, Inc. [AMD] Rembrandt Data Fabric: Device 18h; Function 6
00:18.7 Host bridge: Advanced Micro Devices, Inc. [AMD] Rembrandt Data Fabric: Device 18h; Function 7
01:00.0 PCI bridge: Advanced Micro Devices, Inc. [AMD/ATI] Navi 10 XL Upstream Port of PCI Express Switch (rev c0)
02:00.0 PCI bridge: Advanced Micro Devices, Inc. [AMD/ATI] Navi 10 XL Downstream Port of PCI Express Switch
03:00.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI] Navi 23 [Radeon RX 6650 XT / 6700S / 6800S] (rev c0)
03:00.1 Audio device: Advanced Micro Devices, Inc. [AMD/ATI] Navi 21/23 HDMI/DP Audio Controller
04:00.0 SD Host controller: O2 Micro, Inc. SD/MMC Card Reader Controller (rev 01)
05:00.0 Network controller: MEDIATEK Corp. MT7922 802.11ax PCI Express Wireless Network Adapter
06:00.0 Non-Volatile memory controller: Micron Technology Inc 2450 NVMe SSD [HendrixV] (DRAM-less) (rev 01)
07:00.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI] Rembrandt [Radeon 680M] (rev c7)
07:00.1 Audio device: Advanced Micro Devices, Inc. [AMD/ATI] Radeon High Definition Audio Controller [Rembrandt/Strix]
07:00.2 Encryption controller: Advanced Micro Devices, Inc. [AMD] Family 19h PSP/CCP
07:00.3 USB controller: Advanced Micro Devices, Inc. [AMD] Rembrandt USB4 XHCI controller #3
07:00.4 USB controller: Advanced Micro Devices, Inc. [AMD] Rembrandt USB4 XHCI controller #4
07:00.5 Multimedia controller: Advanced Micro Devices, Inc. [AMD] Audio Coprocessor (rev 60)
07:00.6 Audio device: Advanced Micro Devices, Inc. [AMD] Family 17h/19h/1ah HD Audio Controller
08:00.0 USB controller: Advanced Micro Devices, Inc. [AMD] Rembrandt USB4 XHCI controller #8
08:00.3 USB controller: Advanced Micro Devices, Inc. [AMD] Rembrandt USB4 XHCI controller #5
08:00.4 USB controller: Advanced Micro Devices, Inc. [AMD] Rembrandt USB4 XHCI controller #6

lscpu:

Architecture:                x86_64
  CPU op-mode(s):            32-bit, 64-bit
  Address sizes:             48 bits physical, 48 bits virtual
  Byte Order:                Little Endian
CPU(s):                      16
  On-line CPU(s) list:       0-15
Vendor ID:                   AuthenticAMD
  Model name:                AMD Ryzen 9 6900HS with Radeon Graphics
    CPU family:              25
    Model:                   68
    Thread(s) per core:      2
    Core(s) per socket:      8
    Socket(s):               1
    Stepping:                1
    Frequency boost:         enabled
    CPU(s) scaling MHz:      43%
    CPU max MHz:             4936.6309
    CPU min MHz:             416.3430
    BogoMIPS:                6587.65
    Flags:                   fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ht syscall nx mmxext fxsr_opt pdpe1gb rdtscp lm constant_tsc rep
                             _good nopl xtopology nonstop_tsc cpuid extd_apicid aperfmperf rapl pni pclmulqdq monitor ssse3 fma cx16 sse4_1 sse4_2 x2apic movbe popcnt aes xsave avx f16c rdran
                             d lahf_lm cmp_legacy extapic cr8_legacy abm sse4a misalignsse 3dnowprefetch osvw ibs skinit wdt tce topoext perfctr_core perfctr_nb bpext perfctr_llc mwaitx cpb c
                             at_l3 cdp_l3 hw_pstate ssbd mba ibrs ibpb stibp vmmcall fsgsbase bmi1 avx2 smep bmi2 erms invpcid cqm rdt_a rdseed adx smap clflushopt clwb sha_ni xsaveopt xsavec
                              xgetbv1 xsaves cqm_llc cqm_occup_llc cqm_mbm_total cqm_mbm_local user_shstk clzero irperf xsaveerptr rdpru wbnoinvd cppc arat npt lbrv svm_lock nrip_save tsc_sca
                             le vmcb_clean flushbyasid decodeassists pausefilter pfthreshold avic v_vmsave_vmload vgif v_spec_ctrl umip pku ospke vaes vpclmulqdq rdpid overflow_recov succor s
                             mca fsrm debug_swap
Caches (sum of all):
  L1d:                       256 KiB (8 instances)
  L1i:                       256 KiB (8 instances)
  L2:                        4 MiB (8 instances)
  L3:                        16 MiB (1 instance)
NUMA:
  NUMA node(s):              1
  NUMA node0 CPU(s):         0-15
...

Hope this is a good and informative post and that this landed in the right place as this is my first post here!

Edit *added proper code tags

Last edited by north::guy (2025-06-26 20:21:48)

Offline

#2 2025-06-26 19:27:41

seth
Member
Registered: 2012-09-03
Posts: 65,801

Re: Random Reboots When CPU Temperature Too Low

Please use [code][/code] tags, the BBS predates markdown by approximately your age. Edit your post in this regard.

Ryzens are notorious for that, see https://wiki.archlinux.org/title/Ryzen#Random_reboots and this also perfectly fits

For this reason I am utilizing stress-ng script which puts just enough weight for the CPU so that the device does not reboot.

Offline

#3 2025-06-27 14:32:07

LuxFerre
Member
Registered: 2010-03-01
Posts: 40

Re: Random Reboots When CPU Temperature Too Low

It does sound like a hardware issue, considering the problem was present in windows too in some capacity. However it is worth trying to change cpu voltage/frequency/PBO settings in the BIOS, if you are comfortable with that. You'd need to google what are safe limits for your cpu if you increase the voltage, but it's also possible that underclocking (lowering the speed) would help.

Offline

#4 2025-07-02 06:40:17

Gricenchos
Member
Registered: 2016-09-03
Posts: 58

Re: Random Reboots When CPU Temperature Too Low

I'm having a very similar issue on Lenovo Legion S7. You could to increase the voltage with the following command:

ryzenadj --set-coall=

The exact value I'm not sure of, try both increasing and decreasing it. I've been increasing it by 15-20, but try playing with it a bit.

Offline

Board footer

Powered by FluxBB