I fixed the issue with this workaround: https://github.com/erpalma/throttled#ar … ty-package
]]>1. How can I find out what are the threshold values that the logs mention? It doesn't appear the laptop is running particularly hot, so I want to confirm the messages are actually correct.
2. Is it possible to reduce the logs to only ONE message per event? With a 6-core (12-thread) i7, I'm having TWELVE "above threshold" messages and 12 more "speed normal" messages each time it happens, which makes it hard to find anything else in the system logs.
]]>[ 11.546168] random: crng init done
[ 16.099406] mce: CPU6: Core temperature above threshold, cpu clock throttled (total events = 1)
[ 16.099406] mce: CPU0: Core temperature above threshold, cpu clock throttled (total events = 1)
[ 16.099408] mce: CPU0: Package temperature above threshold, cpu clock throttled (total events = 1)
[ 16.099412] mce: CPU3: Package temperature above threshold, cpu clock throttled (total events = 1)
[ 16.099413] mce: CPU1: Package temperature above threshold, cpu clock throttled (total events = 1)
[ 16.099413] mce: CPU7: Package temperature above threshold, cpu clock throttled (total events = 1)
[ 16.099414] mce: CPU8: Package temperature above threshold, cpu clock throttled (total events = 1)
[ 16.099415] mce: CPU9: Package temperature above threshold, cpu clock throttled (total events = 1)
[ 16.099416] mce: CPU2: Package temperature above threshold, cpu clock throttled (total events = 1)
[ 16.099417] mce: CPU5: Package temperature above threshold, cpu clock throttled (total events = 1)
[ 16.099418] mce: CPU4: Package temperature above threshold, cpu clock throttled (total events = 1)
[ 16.099418] mce: CPU11: Package temperature above threshold, cpu clock throttled (total events = 1)
[ 16.099419] mce: CPU10: Package temperature above threshold, cpu clock throttled (total events = 1)
[ 16.099438] mce: CPU6: Package temperature above threshold, cpu clock throttled (total events = 1)
[ 16.117387] mce: CPU4: Package temperature/speed normal
[ 16.117389] mce: CPU10: Package temperature/speed normal
[ 16.117424] mce: CPU0: Core temperature/speed normal
[ 16.117424] mce: CPU6: Core temperature/speed normal
[ 16.117425] mce: CPU7: Package temperature/speed normal
[ 16.117426] mce: CPU1: Package temperature/speed normal
[ 16.117427] mce: CPU3: Package temperature/speed normal
[ 16.117428] mce: CPU2: Package temperature/speed normal
[ 16.117428] mce: CPU8: Package temperature/speed normal
[ 16.117429] mce: CPU11: Package temperature/speed normal
[ 16.117430] mce: CPU5: Package temperature/speed normal
[ 16.117430] mce: CPU9: Package temperature/speed normal
[ 16.117431] mce: CPU6: Package temperature/speed normal
[ 16.117431] mce: CPU0: Package temperature/speed normal
[ 16.118661] NET: Registered protocol family 38
[ 16.370348] PM: Image not found (code -22)
and I think V1del has a point regarding starting up services and so on during boot time.
]]>From the timestamp it looks like this happens during early boot, which is somewhat expected as that is a time were starting up all your services and the system will probably generate quite a bit of load on all of your cores.
If this doesn't persist or repeat itself during normal usage or lead to unexpected behaviour during active use I'd say that shouldn't be much to worry about.
First one yes, early boot, but then happens again and its quite random,
I'll make a test with generating some prime numbers to see the result.
I don't really get it. - during the prime test , no trottle.
[ 90.848473] audit: type=1130 audit(1564580237.629:48): pid=1 uid=0 auid=4294967295 ses=4294967295 msg='unit=NetworkManager-dispatcher comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success'
[ 100.877979] audit: type=1131 audit(1564580247.789:49): pid=1 uid=0 auid=4294967295 ses=4294967295 msg='unit=NetworkManager-dispatcher comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success'
[ 689.049264] mce: CPU0: Core temperature above threshold, cpu clock throttled (total events = 1)
[ 689.049265] mce: CPU2: Core temperature above threshold, cpu clock throttled (total events = 1)
[ 689.049267] mce: CPU2: Package temperature above threshold, cpu clock throttled (total events = 1)
[ 689.049268] mce: CPU0: Package temperature above threshold, cpu clock throttled (total events = 1)
[ 689.049295] mce: CPU1: Package temperature above threshold, cpu clock throttled (total events = 1)
[ 689.049296] mce: CPU3: Package temperature above threshold, cpu clock throttled (total events = 1)
[ 689.051334] mce: CPU0: Core temperature/speed normal
[ 689.051335] mce: CPU2: Core temperature/speed normal
[ 689.051336] mce: CPU2: Package temperature/speed normal
[ 689.051336] mce: CPU0: Package temperature/speed normal
[ 689.051370] mce: CPU1: Package temperature/speed normal
[ 689.051370] mce: CPU3: Package temperature/speed normal
[ 901.584063] audit: type=1130 audit(1564581048.646:50): pid=1 uid=0 auid=4294967295 ses=4294967295 msg='unit=systemd-tmpfiles-clean comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success'
[ 901.584070] audit: type=1131 audit(1564581048.646:51): pid=1 uid=0 auid=4294967295 ses=4294967295 msg='unit=systemd-tmpfiles-clean comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success'
[ 1162.222730] audit: type=1130 audit(1564581309.274:52): pid=1 uid=0 auid=4294967295 ses=4294967295 msg='unit=fprintd comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success'
[ 1162.483195] usb 1-9: reset full-speed USB device number 3 using xhci_hcd
[ 1165.841441] audit: type=1131 audit(1564581312.894:53): pid=1 uid=0 auid=4294967295 ses=4294967295 msg='unit=lm_sensors comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success'
[ 1175.969972] i2c /dev entries driver
[ 1195.094151] audit: type=1130 audit(1564581342.141:54): pid=1 uid=0 auid=4294967295 ses=4294967295 msg='unit=lm_sensors comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success'
[ 1195.738201] audit: type=1131 audit(1564581342.784:55): pid=1 uid=0 auid=4294967295 ses=4294967295 msg='unit=fprintd comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success'
[ 1311.273239] mce: CPU2: Core temperature above threshold, cpu clock throttled (total events = 9)
[ 1311.273239] mce: CPU0: Core temperature above threshold, cpu clock throttled (total events = 9)
[ 1311.273241] mce: CPU0: Package temperature above threshold, cpu clock throttled (total events = 9)
[ 1311.273244] mce: CPU2: Package temperature above threshold, cpu clock throttled (total events = 9)
[ 1311.273273] mce: CPU1: Package temperature above threshold, cpu clock throttled (total events = 9)
[ 1311.273274] mce: CPU3: Package temperature above threshold, cpu clock throttled (total events = 9)
[ 1311.274219] mce: CPU0: Core temperature/speed normal
[ 1311.274220] mce: CPU2: Core temperature/speed normal
[ 1311.274221] mce: CPU3: Package temperature/speed normal
[ 1311.274222] mce: CPU1: Package temperature/speed normal
[ 1311.274223] mce: CPU2: Package temperature/speed normal
[ 1311.274224] mce: CPU0: Package temperature/speed normal
[ 1387.402273] audit: type=1130 audit(1564581534.431:56): pid=1 uid=0 auid=4294967295 ses=4294967295 msg='unit=systemd-hostnamed comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success'
[ 1417.455065] audit: type=1131 audit(1564581564.484:57): pid=1 uid=0 auid=4294967295 ses=4294967295 msg='unit=systemd-hostnamed comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success'
If this doesn't persist or repeat itself during normal usage or lead to unexpected behaviour during active use I'd say that shouldn't be much to worry about.
]]>Phiscally it's a laptop, and yes, I can feel some heat. But fans are working sometime, i can hear them, but they have quite a low threshold for staring cooling. I think that's the error.
With currrent kernel update 5.2.4 - error persists.
Also I dont have AC here, and about 35 outside.
iwlwifi-virtual-0
Adapter: Virtual device
temp1: +37.0°C
thinkpad-isa-0000
Adapter: ISA adapter
fan1: 0 RPM
temp1: +49.0°C
temp2: N/A
temp3: +0.0°C
temp4: +0.0°C
temp5: +0.0°C
temp6: +0.0°C
temp7: +0.0°C
temp8: +0.0°C
temp9: +0.0°C
temp10: +0.0°C
temp11: +1.0°C
temp12: +0.0°C
temp13: +0.0°C
temp14: +0.0°C
temp15: +0.0°C
temp16: +0.0°C
acpitz-acpi-0
Adapter: ACPI interface
temp1: +49.0°C (crit = +128.0°C)
coretemp-isa-0000
Adapter: ISA adapter
Package id 0: +48.0°C (high = +100.0°C, crit = +100.0°C)
Core 0: +48.0°C (high = +100.0°C, crit = +100.0°C)
Core 1: +48.0°C (high = +100.0°C, crit = +100.0°C)
pch_skylake-virtual-0
Adapter: Virtual device
temp1: +43.5°C
Check your air filters. If clogged, cleaning them might help.
If it is a tower, you could add fans, a bigger heat sink, or even some kind of mass transport cooling system.
Laptops generally have limits to their thermal design.
Edit: Although that first line about the interrupt is interesting. Did the fans come on and start screaming about this time? What does htop say about CPU usage?
]]>[ 12.212158] rfkill: input handler disabled
[ 12.706256] mce: CPU0: Core temperature above threshold, cpu clock throttled (total events = 1)
[ 12.706257] mce: CPU2: Core temperature above threshold, cpu clock throttled (total events = 1)
[ 12.706259] mce: CPU1: Package temperature above threshold, cpu clock throttled (total events = 1)
[ 12.706260] mce: CPU3: Package temperature above threshold, cpu clock throttled (total events = 1)
[ 12.706262] mce: CPU2: Package temperature above threshold, cpu clock throttled (total events = 1)
[ 12.706265] mce: CPU0: Package temperature above threshold, cpu clock throttled (total events = 1)
[ 12.707260] mce: CPU2: Core temperature/speed normal
[ 12.707261] mce: CPU0: Core temperature/speed normal
[ 12.707262] mce: CPU3: Package temperature/speed normal
[ 12.707263] mce: CPU1: Package temperature/speed normal
[ 12.707263] mce: CPU0: Package temperature/speed normal
[ 12.707264] mce: CPU2: Package temperature/speed normal