You are not logged in.

#1 2014-06-18 22:34:50

cfr
Member
From: Cymru
Registered: 2011-11-27
Posts: 7,140

[SOLVED] thermald fails with kernel upgrade 3.15

I have been using thermald to manage temperature over and above the thermal management provided through CPU scaling with the pstate driver. (I know the point is not thermal management per se but managing the CPUs efficiently helps manage temperature etc.)

Since upgrading this evening, thermald cannot start. The followed a reboot on upgrade to the new kernel, 3.15. The upgrade also included libva-intel-driver (1.3.1-1 -> 1.3.2-1) and linux-firmware (20140316.dec41bc-1 -> 20140603.a4f3bc0-1). I discovered that due to odditites in the versioning of thermald, I was actually using an older version, so I built and installed version 1.2.01-1 from AUR. Following suggestions in the comments on the AUR page, I also reran sensors-detect. This has not, unfortunately helped.

I've run systemctl daemon-reload and restarted thermald.service but I still can't get things to work.

This is the output from systemctl status thermald.service:

● thermald.service - Thermal Daemon Service
   Loaded: loaded (/usr/lib/systemd/system/thermald.service; enabled)
   Active: failed (Result: exit-code) since Mer 2014-06-18 22:00:26 BST; 56min ago
  Process: 6877 ExecStart=/usr/sbin/thermald --no-daemon --dbus-enable (code=exited, status=1/FAILURE)
 Main PID: 6877 (code=exited, status=1/FAILURE)

Meh 18 22:00:26 Mine thermald[6877]: RAPL domain count 1
Meh 18 22:00:26 Mine thermald[6877]: 13 CPUID levels; family:model:stepping 0x6:2a:7 (6:42:7)
Meh 18 22:00:26 Mine thermald[6877]: Polling mode is enabled: 4
Meh 18 22:00:26 Mine thermald[6877]: sensor_update: type acpitz
Meh 18 22:00:26 Mine thermald[6877]: sensor_update: type x86_pkg_temp
Meh 18 22:00:26 Mine thermald[6877]: thd_read_default_thermal_sensors loaded 2 sensors
Meh 18 22:00:26 Mine thermald[6877]: Thermal DTS: No coretemp sysfs, trying hwmon
Meh 18 22:00:26 Mine thermald[6877]: sensor id 2: No temp sysfs for reading raw temp
Meh 18 22:00:26 Mine thermald[6877]: Thermal sysfs Error in reading sensors
Meh 18 22:00:26 Mine thermald[6877]: THD engine start failed:

senosrs gives the following more-or-less normal-looking output:

thinkpad-isa-0000
Adapter: ISA adapter
fan1:         530 RPM
temp1:        +55.0°C  
temp2:         +0.0°C  
temp3:        +55.0°C  
temp4:         +0.0°C  
temp5:         +0.0°C  
temp6:         +0.0°C  
temp7:        +30.0°C  
temp8:         +0.0°C  

coretemp-isa-0000
Adapter: ISA adapter
Physical id 0:  +59.0°C  (high = +86.0°C, crit = +100.0°C)
Core 0:         +59.0°C  (high = +86.0°C, crit = +100.0°C)
Core 1:         +57.0°C  (high = +86.0°C, crit = +100.0°C)

journalctl -b -u thermald.service has this:

-- Logs begin at Llu 2014-06-16 15:59:36 BST, end at Mer 2014-06-18 23:06:11 BST. --
Meh 18 21:34:23 Mine systemd[1]: Started Thermal Daemon Service.
Meh 18 21:34:23 Mine thermald[930]: UUID is [81529351-7B50-CB11-83DC-AEADCF46AB45]
Meh 18 21:34:23 Mine thermald[930]: product name is[3045CTO]
Meh 18 21:34:23 Mine thermald[930]: 13 CPUID levels; family:model:stepping 0x6:2a:7 (6:42:7)
Meh 18 21:34:23 Mine thermald[930]: Polling mode is enabled: 4
Meh 18 21:34:23 Mine thermald[930]: powercap RAPL max power limit range 17000000
Meh 18 21:34:23 Mine thermald[930]: Found Intel pstate driver
Meh 18 21:34:23 Mine thermald[930]: DTS temperature path not found
Meh 18 21:34:23 Mine thermald[930]: Thermal DTS: No coretemp sysfs, trying hwmon
Meh 18 21:34:23 Mine thermald[930]: DTS temperature path not found
Meh 18 21:34:23 Mine thermald[930]: Thermal DTS or hwmon: No Zones present:
Meh 18 21:34:23 Mine thermald[930]: No thermal sensors foundTHD engine start failed:
Meh 18 21:34:23 Mine systemd[1]: thermald.service: main process exited, code=exited, status=1/FAILURE
Meh 18 21:34:23 Mine systemd[1]: Unit thermald.service entered failed state.
Meh 18 21:42:44 Mine systemd[1]: Starting Thermal Daemon Service...
Meh 18 21:42:44 Mine systemd[1]: Started Thermal Daemon Service.
Meh 18 21:42:44 Mine thermald[4628]: UUID is [81529351-7B50-CB11-83DC-AEADCF46AB45]
Meh 18 21:42:44 Mine thermald[4628]: product name is[3045CTO]
Meh 18 21:42:44 Mine thermald[4628]: 13 CPUID levels; family:model:stepping 0x6:2a:7 (6:42:7)
Meh 18 21:42:44 Mine thermald[4628]: Polling mode is enabled: 4
Meh 18 21:42:44 Mine thermald[4628]: powercap RAPL max power limit range 17000000
Meh 18 21:42:44 Mine thermald[4628]: Found Intel pstate driver
Meh 18 21:42:44 Mine thermald[4628]: DTS temperature path not found
Meh 18 21:42:44 Mine thermald[4628]: Thermal DTS: No coretemp sysfs, trying hwmon
Meh 18 21:42:44 Mine systemd[1]: thermald.service: main process exited, code=exited, status=1/FAILURE
Meh 18 21:42:44 Mine systemd[1]: Unit thermald.service entered failed state.
Meh 18 21:42:44 Mine thermald[4628]: DTS temperature path not found
Meh 18 21:42:44 Mine thermald[4628]: Thermal DTS or hwmon: No Zones present:
Meh 18 21:42:44 Mine thermald[4628]: No thermal sensors foundTHD engine start failed:
Meh 18 21:49:13 Mine systemd[1]: Starting Thermal Daemon Service...
Meh 18 21:49:13 Mine systemd[1]: Started Thermal Daemon Service.
Meh 18 21:49:13 Mine thermald[4848]: UUID is [81529351-7B50-CB11-83DC-AEADCF46AB45]
Meh 18 21:49:13 Mine thermald[4848]: product name is[3045CTO]
Meh 18 21:49:13 Mine thermald[4848]: 13 CPUID levels; family:model:stepping 0x6:2a:7 (6:42:7)
Meh 18 21:49:13 Mine thermald[4848]: Polling mode is enabled: 4
Meh 18 21:49:13 Mine thermald[4848]: powercap RAPL max power limit range 17000000
Meh 18 21:49:13 Mine thermald[4848]: Found Intel pstate driver
Meh 18 21:49:13 Mine thermald[4848]: DTS temperature path not found
Meh 18 21:49:13 Mine thermald[4848]: Thermal DTS: No coretemp sysfs, trying hwmon
Meh 18 21:49:13 Mine thermald[4848]: DTS temperature path not found
Meh 18 21:49:13 Mine thermald[4848]: Thermal DTS or hwmon: No Zones present:
Meh 18 21:49:13 Mine thermald[4848]: No thermal sensors foundTHD engine start failed:
Meh 18 21:49:13 Mine systemd[1]: thermald.service: main process exited, code=exited, status=1/FAILURE
Meh 18 21:49:13 Mine systemd[1]: Unit thermald.service entered failed state.
Meh 18 22:00:26 Mine systemd[1]: Starting Thermal Daemon Service...
Meh 18 22:00:26 Mine systemd[1]: Started Thermal Daemon Service.
Meh 18 22:00:26 Mine thermald[6877]: sysfs read failed /sys/class/powercap/intel-rapl/power/name
Meh 18 22:00:26 Mine thermald[6877]: sysfs read failed /sys/class/powercap/intel-rapl/enabled/name
Meh 18 22:00:26 Mine thermald[6877]: sysfs read failed /sys/class/powercap/intel-rapl/intel-rapl:0/name/name
Meh 18 22:00:26 Mine thermald[6877]: sysfs read failed /sys/class/powercap/intel-rapl/intel-rapl:0/power/name
Meh 18 22:00:26 Mine thermald[6877]: sysfs read failed /sys/class/powercap/intel-rapl/intel-rapl:0/constraint_0_time_window_us/name
Meh 18 22:00:26 Mine thermald[6877]: sysfs read failed /sys/class/powercap/intel-rapl/intel-rapl:0/constraint_1_power_limit_uw/name
Meh 18 22:00:26 Mine thermald[6877]: sysfs read failed /sys/class/powercap/intel-rapl/intel-rapl:0/enabled/name
Meh 18 22:00:26 Mine thermald[6877]: sysfs read failed /sys/class/powercap/intel-rapl/intel-rapl:0/device/name
Meh 18 22:00:26 Mine thermald[6877]: sysfs read failed /sys/class/powercap/intel-rapl/intel-rapl:0/constraint_0_power_limit_uw/name
Meh 18 22:00:26 Mine thermald[6877]: sysfs read failed /sys/class/powercap/intel-rapl/intel-rapl:0/subsystem/name
Meh 18 22:00:26 Mine thermald[6877]: sysfs read failed /sys/class/powercap/intel-rapl/intel-rapl:0/constraint_0_name/name
Meh 18 22:00:26 Mine thermald[6877]: sysfs read failed /sys/class/powercap/intel-rapl/intel-rapl:0/constraint_1_name/name
Meh 18 22:00:26 Mine thermald[6877]: sysfs read failed /sys/class/powercap/intel-rapl/intel-rapl:0/energy_uj/name
Meh 18 22:00:26 Mine systemd[1]: thermald.service: main process exited, code=exited, status=1/FAILURE
Meh 18 22:00:26 Mine systemd[1]: Unit thermald.service entered failed state.
Meh 18 22:00:26 Mine thermald[6877]: sysfs read failed /sys/class/powercap/intel-rapl/intel-rapl:0/constraint_1_max_power_uw/name
Meh 18 22:00:26 Mine thermald[6877]: sysfs read failed /sys/class/powercap/intel-rapl/intel-rapl:0/uevent/name
Meh 18 22:00:26 Mine thermald[6877]: sysfs read failed /sys/class/powercap/intel-rapl/intel-rapl:0/max_energy_range_uj/name
Meh 18 22:00:26 Mine thermald[6877]: sysfs read failed /sys/class/powercap/intel-rapl/intel-rapl:0/constraint_0_max_power_uw/name
Meh 18 22:00:26 Mine thermald[6877]: sysfs read failed /sys/class/powercap/intel-rapl/intel-rapl:0/constraint_1_time_window_us/name
Meh 18 22:00:26 Mine thermald[6877]: RAPL domain count 0
Meh 18 22:00:26 Mine thermald[6877]: sysfs read failed /sys/class/powercap/intel-rapl/subsystem/name
Meh 18 22:00:26 Mine thermald[6877]: sysfs read failed /sys/class/powercap/intel-rapl/uevent/name
Meh 18 22:00:26 Mine thermald[6877]: RAPL domain count 1
Meh 18 22:00:26 Mine thermald[6877]: 13 CPUID levels; family:model:stepping 0x6:2a:7 (6:42:7)
Meh 18 22:00:26 Mine thermald[6877]: Polling mode is enabled: 4
Meh 18 22:00:26 Mine thermald[6877]: sensor_update: type acpitz
Meh 18 22:00:26 Mine thermald[6877]: sensor_update: type x86_pkg_temp
Meh 18 22:00:26 Mine thermald[6877]: thd_read_default_thermal_sensors loaded 2 sensors
Meh 18 22:00:26 Mine thermald[6877]: Thermal DTS: No coretemp sysfs, trying hwmon
Meh 18 22:00:26 Mine thermald[6877]: sensor id 2: No temp sysfs for reading raw temp
Meh 18 22:00:26 Mine thermald[6877]: Thermal sysfs Error in reading sensors
Meh 18 22:00:26 Mine thermald[6877]: THD engine start failed:

As far as I remember, I did not do anything special when I first installed thermald and I seem to be using the default configuration files. I am also using the default thermald.service file under /usr/lib. I don't know if it is relevant but my kernel command line is currently (and has for some time been):

BOOT_IMAGE=/vmlinuz-linux root=/dev/vgroup-cfr/arch rw rootfstype=ext4 cryptdevice=/dev/disk/by-uuid/LONG-LIST-OF-ALPHA-NUMERICS:lvm resume=/dev/vgroup-cfr/swap rd.log i915.i915_enable_rc6=-1 i915.i915_enable_fbc=1 i915.lvds_downclock=1 i915.semaphores=1 fbcon=scrollback:200k add_efi_memmap

I am not sure the RAPL thing it is looking for is anything I have but it should certainly find other ways to control the temperature and, of course, sensors to monitor it. That is, I have /sys/class/powercap/intel-rapl/ and various variants, but I do not have the particular files it is trying to read. But I am pretty sure it should find coretemp rather than falling back to hwmon although I'm not entirely certain of this.

I'm sure I'm forgetting something obvious since it seems weird to me that searching the forums and wiki hasn't turned up the solution - or, if not a solution which works for me, at least some things to try. Possibly I'm using the wrong search terms but all I searched on was archlinux.org for site and the term thermald which does not seem unduly restrictive.

What am I missing? Or how can I find out?

Last edited by cfr (2014-06-20 20:54:51)


CLI Paste | How To Ask Questions

Arch Linux | x86_64 | GPT | EFI boot | refind | stub loader | systemd | LVM2 on LUKS
Lenovo x270 | Intel(R) Core(TM) i5-7200U CPU @ 2.50GHz | Intel Wireless 8265/8275 | US keyboard w/ Euro | 512G NVMe INTEL SSDPEKKF512G7L

Offline

#2 2014-06-18 23:28:55

WonderWoofy
Member
From: Los Gatos, CA
Registered: 2012-05-19
Posts: 8,414

Re: [SOLVED] thermald fails with kernel upgrade 3.15

I was also experiencing this a few weeks ago while running the 3.15 kernel while it was still rc.  I didn't investigate at all, but whatever the issue is, it seems to be fixed in git.  So can you try the thermald-git package and see if that fixes the issue?

Offline

#3 2014-06-20 20:54:27

cfr
Member
From: Cymru
Registered: 2011-11-27
Posts: 7,140

Re: [SOLVED] thermald fails with kernel upgrade 3.15

Thanks, WonderWoofy. I installed thermald-git last night and it seems to work quite happily. Now I just have to try to remember to keep it up-to-date. (The main reason I try to avoid -git AUR packages!)


CLI Paste | How To Ask Questions

Arch Linux | x86_64 | GPT | EFI boot | refind | stub loader | systemd | LVM2 on LUKS
Lenovo x270 | Intel(R) Core(TM) i5-7200U CPU @ 2.50GHz | Intel Wireless 8265/8275 | US keyboard w/ Euro | 512G NVMe INTEL SSDPEKKF512G7L

Offline

#4 2014-06-21 00:56:15

WonderWoofy
Member
From: Los Gatos, CA
Registered: 2012-05-19
Posts: 8,414

Re: [SOLVED] thermald fails with kernel upgrade 3.15

I discovered that there is a super lazy solution to the *-git packages.  Since I am already a cower user, I installed pacaur.  So I can do pacaur -yu --devel.  It will check to see if there are any new commits to the devel packages (git, svn, etc) and build it if there are.  As long as you don't have a whole shitload of *-git packages, it is a fairly quick process.

(Amazingly, 'shitload' is not flagged as misspelled smile)

Edit: I misspell a shitload of things...

Last edited by WonderWoofy (2014-06-21 00:56:58)

Offline

Board footer

Powered by FluxBB