You are not logged in.
Guys, my nvidia 1030 and 1060 are both shooting over 80c just watching a video with "mpv --hwdec". They're under 45c in windows and 50 in Manjaro with multiple 4k streams. Switched kernels and reinstalled drivers, same problem (linux, linux-lts, nvidia, nvidia-dkms) The issue seems to be the GPU eager to run at 100% when decoding.
I have a 4K primary monitor, and a 1080p secondary monitor. Cards are clean, new thermo paste applied last year, fans are good and on, 140mm @ 700rpm constant Gaming temperature in Windows AAA title maxed 75-80c. Dota/League 45c. If I dail up the fan speed, gaming temps get lower. Fan is not the issue.
What makes my head itch is that both cards stay cool with Manjaro (xfce). I'm so happy with Arch and want to settle down. I'd hate to leave Arch just because of that.
Searched around and went on Reddit, no solution yet, please help. Thanks.
Hardware:
AMD 3600, Asus X370-Pro, GTX 1060 3G, Intel SSDWhat I have done:
Reinstalled Arch from scratch, twice. (EndavourOS, Artix also have overheating problem, but not Manjaro or Devuan)
Switched kernels and reinstalled drivers (linux, linux-lts, nvidia, nvidia-dkms)
Added nvidia-drm.modeset=1 to grub
Coolbit = 8 and used offset mkinitcpio.conf
MODULES=(crc32c nvidia nvidia_modeset nvidia_uvm nvidia_drm)Last edited by texfat (2020-07-06 19:32:07)
Offline
Guys, my nvidia 1030 and 1060 are both shooting over 80c just watching a video with "mpv --hwdec".
What's the output of nvidia-smi when watching a video?
What I have done:
Coolbit = 8 and used offset
Can you post a screenshot of nvidia-settings PowerMizer and "Thermal Settings"?
Last edited by sabroad (2020-07-07 10:37:46)
--
saint_abroad
Offline
texfat wrote:Guys, my nvidia 1030 and 1060 are both shooting over 80c just watching a video with "mpv --hwdec".
What's the output of nvidia-smi when watching a video?
texfat wrote:What I have done:
Coolbit = 8 and used offsetCan you post a screenshot of nvidia-settings PowerMizer and "Thermal Settings"?
Hi, thanks for trying to help. nvidia-smi shows correct temp reading (high), everything else is just normal. PowerMizer normal, although I tried to offset -200 GPU and -2000 GRAM, max allowed, to lower temperature. The result was indeed lower temp but playing 2 x 4K streams weren't possible, both became choppy.
Anyway, I relectantly moved on to Manjaro and the overheating problem gone. Not bashing Arch at all, I still have Arch on my laptop, but the main desktop, I cannot live with GPU temps that high. Must be some kernel issue. Arch and its direct derivatives (EOS, Artix) all have this problem. This is beyond my pay grade to fix. Manjaro has their own kernel, so no problem there. I might try Zen kernel when I have time to tinker again.
I only have temp screenshots, https://imgur.com/a/mp3wZrn I just got your msg and I had already moved onto Manjaro. I don't think this is an isolated case. Devs please pay attention.
Thanks everyone. Arch is amazing, love it. Fix the GPU overheating problem, I'll be back so fast. lol
Offline
The difference you see between Arch and other distros shouldn't be caused by a different kernel. I'm thinking it can't be the kernel because the Nvidia driver code is living outside of the kernel, it doesn't change with different kernels. When you then use the same Nvidia driver version, the card should behave the same. When it doesn't the difference should be caused by a configuration file somewhere.
In that screenshot you shared, the fans of the card are stopped and at zero rpm. Maybe that's the problem? Are the fans staying at zero rpm in Manjaro as well, or only in Arch? The card should normally decide to start its fans when things get hot enough, like at 50°C for example.
Offline
Guys, I found a stop-gap solution:
Use zen kernel! Significant temp drop, playing 2 4K streams with mpv --hwdec temp floats around 60c, much better than stock kernels. Not as good as temps in Manjaro but 60c is fine. I'll take that. Man.. what a trip. XD I'm still a noob, but I'm trying.
Last edited by texfat (2020-07-10 15:22:58)
Offline
The difference you see between Arch and other distros shouldn't be caused by a different kernel. I'm thinking it can't be the kernel because the Nvidia driver code is living outside of the kernel, it doesn't change with different kernels. When you then use the same Nvidia driver version, the card should behave the same. When it doesn't the difference should be caused by a configuration file somewhere.
In that screenshot you shared, the fans of the card are stopped and at zero rpm. Maybe that's the problem? Are the fans staying at zero rpm in Manjaro as well, or only in Arch? The card should normally decide to start its fans when things get hot enough, like at 50°C for example.
Hi, thanks for the reply. Switching to zen kernel proved to be a stop-gap solution. Temp stays around 60c which is acceptable. So yes, there's definitely something about the kernel which is beyond the knowledge I currently have.
I removed the stock fans and slapped a 140mm fan, manually controlled, to the card. That's why the rpm is 0. Actual rpm is about 700rpm which is enough for every game under Windows. That way I get a quiet machine which is just 2-3 feet from me. Normally, playing multiple 4K streams won't push the cards above 45c. Arch is the only distro pushing temps above 80c. Manjaro, Devuan/Debian don't do that. But zen kernel sort of mitigated the issue.
The good news is: I'm back to ARCH!!! YEEHAW! So, devs, please look at this issue. I wish I knew more to help.
Last edited by texfat (2020-07-10 15:32:42)
Offline
You're using nvidia proprietary driver I guess ?
If yes, kernel devs won't touch this issue at all (they refuse bug reports related to proprietary kernel modules)
You can try to contact nvidia linux devs by posting on https://devtalk.nvidia.com/ .
Disliking systemd intensely, but not satisfied with alternatives so focusing on taming systemd.
clean chroot building not flexible enough ?
Try clean chroot manager by graysky
Offline