You are not logged in.
I'm having issues playing video games because my CPU keeps overheating. I'm tracking the temperature and it goes up to around 90 degrees Celsius before the game crashes. I'm also tracking the temperature of my GPU and it doesn't seem affected at all.
I'm not sure if this is due to the games being too CPU heavy or if it is due to the GPU not being used properly and therefore the CPU having to manage graphics. I can sometimes experience the same problem when multi-threading the compilation of very large programs, although then I'm more confident it's just a shitty-cooling-problem for the CPU.
I'm curious how to check if my GPU is being used correctly by the games I'm playing.
lspci | grep AMD
01:00.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI] Ellesmere [Radeon RX 470/480/570/570X/580/580X/590] (rev c7)
01:00.1 Audio device: Advanced Micro Devices, Inc. [AMD/ATI] Ellesmere HDMI Audio [Radeon RX 470/480 / 570/580/590]Thanks.
Last edited by leadry (2022-08-11 15:48:12)
Offline
Overheating == cooling problem. Whether or not the game is using the GPU is not really relevant, since your machine should not overheat - period. Are the fans running? Dust buildup?
Offline
Fair enough, I've opened up my computer and reapplied the cooling paste. It seems the CPU temperature is now fine (keeps below 80 degrees at all times)
My game keeps crashing, and I'm not sure why.
dmesg outputs the following related errors:
[ 4926.367463] dota2[21180]: segfault at ff00ff00 ip 00000000ff00ff00 sp 00007ffc09cf9fb0 error 14
[ 4926.367481] Code: Unable to access opcode bytes at RIP 0xff00fed6.
[ 4926.367566] audit: type=1701 audit(1660480387.163:91): auid=1000 uid=1000 gid=985 ses=1 pid=21180 comm="dota2" exe=2F686F6D652F6772756D70792F2E6C6F63616C2F73686172652F537465616D2F737465616D617070732F636F6D6D6F6E2F646F7461203220626574612F67616D652F62696E2F6C696E7578737465616D727436342F646F746132 sig=11 res=1
[ 4926.378396] audit: type=1334 audit(1660480387.176:92): prog-id=27 op=LOAD
[ 4926.378471] audit: type=1334 audit(1660480387.176:93): prog-id=28 op=LOAD
[ 4926.378475] audit: type=1334 audit(1660480387.176:94): prog-id=29 op=LOAD
[ 4926.379303] audit: type=1130 audit(1660480387.176:95): pid=1 uid=0 auid=4294967295 ses=4294967295 msg='unit=systemd-coredump@2-23878-0 comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success'
[ 4947.297669] audit: type=1131 audit(1660480408.096:96): pid=1 uid=0 auid=4294967295 ses=4294967295 msg='unit=systemd-coredump@2-23878-0 comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success'
[ 4947.404219] audit: type=1334 audit(1660480408.203:97): prog-id=0 op=UNLOAD
[ 4947.404224] audit: type=1334 audit(1660480408.203:98): prog-id=0 op=UNLOAD
[ 4947.404225] audit: type=1334 audit(1660480408.203:99): prog-id=0 op=UNLOAD
[ 5053.138976] mce: [Hardware Error]: Machine check events logged
[ 5053.138980] mce: [Hardware Error]: CPU 0: Machine Check: 0 Bank 0: 9000004000010005
[ 5053.138984] mce: [Hardware Error]: TSC 1027313580c4
[ 5053.138985] mce: [Hardware Error]: PROCESSOR 0:306a9 TIME 1660480513 SOCKET 0 APIC 0 microcode 16
[ 5055.338161] mce: [Hardware Error]: Machine check events logged
[ 5055.338165] mce: [Hardware Error]: CPU 0: Machine Check: 0 Bank 0: 9000004000010005
[ 5055.338168] mce: [Hardware Error]: TSC 1028fc56fcbb
[ 5055.338170] mce: [Hardware Error]: PROCESSOR 0:306a9 TIME 1660480516 SOCKET 0 APIC 0 microcode 16
[ 5083.572907] mce: [Hardware Error]: CPU 0: Machine Check: 0 Bank 0: 9000004000010005
[ 5083.572914] mce: [Hardware Error]: TSC 104002fffbcd
[ 5083.572916] mce: [Hardware Error]: PROCESSOR 0:306a9 TIME 1660480544 SOCKET 0 APIC 0 microcode 16
[ 5087.622028] mce: [Hardware Error]: CPU 0: Machine Check: 0 Bank 0: 9000004000010005
[ 5087.622035] mce: [Hardware Error]: TSC 10435058dce2
[ 5087.622037] mce: [Hardware Error]: PROCESSOR 0:306a9 TIME 1660480548 SOCKET 0 APIC 0 microcode 16
[ 5105.616317] mce: [Hardware Error]: CPU 0: Machine Check: 0 Bank 0: 9000004000010005
[ 5105.616324] mce: [Hardware Error]: TSC 1051fd132d26
[ 5105.616325] mce: [Hardware Error]: PROCESSOR 0:306a9 TIME 1660480566 SOCKET 0 APIC 0 microcode 16
[ 5179.421665] mce_notify_irq: 3 callbacks suppressed
[ 5179.421668] mce: [Hardware Error]: Machine check events logged
[ 5179.421670] mce: [Hardware Error]: CPU 0: Machine Check: 0 Bank 0: 9000004000010005
[ 5179.421674] mce: [Hardware Error]: TSC 108e2da847d4
[ 5179.421675] mce: [Hardware Error]: PROCESSOR 0:306a9 TIME 1660480640 SOCKET 0 APIC 0 microcode 16
[ 5206.723732] mce: [Hardware Error]: Machine check events logged
[ 5206.723736] mce: [Hardware Error]: CPU 2: Machine Check: 0 Bank 0: 9000004000010005
[ 5206.723741] mce: [Hardware Error]: TSC 10a471991e7f
[ 5206.723742] mce: [Hardware Error]: PROCESSOR 0:306a9 TIME 1660480667 SOCKET 0 APIC 4 microcode 16
[ 5241.368002] audit: type=1100 audit(1660480702.169:100): pid=25505 uid=1000 auid=1000 ses=1 msg='op=PAM:authentication grantors=pam_faillock,pam_permit,pam_faillock acct="grumpy" exe="/usr/bin/sudo" hostname=? addr=? terminal=/dev/pts/0 res=success'
[ 5241.368115] audit: type=1101 audit(1660480702.169:101): pid=25505 uid=1000 auid=1000 ses=1 msg='op=PAM:accounting grantors=pam_unix,pam_permit,pam_time acct="grumpy" exe="/usr/bin/sudo" hostname=? addr=? terminal=/dev/pts/0 res=success'
[ 5241.368632] audit: type=1110 audit(1660480702.169:102): pid=25505 uid=1000 auid=1000 ses=1 msg='op=PAM:setcred grantors=pam_faillock,pam_permit,pam_faillock acct="root" exe="/usr/bin/sudo" hostname=? addr=? terminal=/dev/pts/0 res=success'
[ 5241.370248] audit: type=1105 audit(1660480702.169:103): pid=25505 uid=1000 auid=1000 ses=1 msg='op=PAM:session_open grantors=pam_systemd_home,pam_limits,pam_unix,pam_permit acct="root" exe="/usr/bin/sudo" hostname=? addr=? terminal=/dev/pts/0 res=success'
[ 5241.387980] audit: type=1106 audit(1660480702.189:104): pid=25505 uid=1000 auid=1000 ses=1 msg='op=PAM:session_close grantors=pam_systemd_home,pam_limits,pam_unix,pam_permit acct="root" exe="/usr/bin/sudo" hostname=? addr=? terminal=/dev/pts/0 res=success'
[ 5241.388034] audit: type=1104 audit(1660480702.189:105): pid=25505 uid=1000 auid=1000 ses=1 msg='op=PAM:setcred grantors=pam_faillock,pam_permit,pam_faillock acct="root" exe="/usr/bin/sudo" hostname=? addr=? terminal=/dev/pts/0 res=success'
[ 5378.559771] mce: [Hardware Error]: Machine check events logged
[ 5378.559775] mce: [Hardware Error]: CPU 0: Machine Check: 0 Bank 0: 9000004000010005
[ 5378.559779] mce: [Hardware Error]: TSC 1130945e0cab
[ 5378.559780] mce: [Hardware Error]: PROCESSOR 0:306a9 TIME 1660480839 SOCKET 0 APIC 0 microcode 16
[ 5384.493053] mce: [Hardware Error]: Machine check events logged
[ 5384.493057] mce: [Hardware Error]: CPU 0: Machine Check: 0 Bank 0: 9000004000010005
[ 5384.493061] mce: [Hardware Error]: TSC 11356b14133f
[ 5384.493062] mce: [Hardware Error]: PROCESSOR 0:306a9 TIME 1660480845 SOCKET 0 APIC 0 microcode 16
[ 5421.184737] dota2[24129]: segfault at 14 ip 00007f4cdd4aa3b5 sp 00007ffd7a5698f0 error 4 in libclient.so[7f4cd9c00000+576b000]
[ 5421.184759] Code: 24 0a 06 00 00 00 74 45 4c 8b 25 26 62 fd 02 49 8b 1c 24 48 85 db 0f 84 b1 00 00 00 80 7b 45 00 48 89 d8 74 50 f3 0f 10 43 34 <f3> 0f 5e 40 14 f3 0f 58 05 ca 5f a6 01 f3 0f 2c c0 48 83 c4 08 5bOffline
Your CPU needs more voltage. Do you have a Ryzen CPU running Curve Optimizer?
Excuse my poor English.
Offline
What tells you that my CPU needs more voltage? I'm not familiar with a Ryzen CPU nor Curve Optimizer. This is the ouput of
lscpuArchitecture: x86_64
CPU op-mode(s): 32-bit, 64-bit
Address sizes: 36 bits physical, 48 bits virtual
Byte Order: Little Endian
CPU(s): 4
On-line CPU(s) list: 0-3
Vendor ID: GenuineIntel
Model name: Intel(R) Core(TM) i5-3570K CPU @ 3.40GHz
CPU family: 6
Model: 58
Thread(s) per core: 1
Core(s) per socket: 4
Socket(s): 1
Stepping: 9
CPU(s) scaling MHz: 51%
CPU max MHz: 4100.0000
CPU min MHz: 1600.0000
BogoMIPS: 7008.21
Flags: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe syscall nx rdtsc
p lm constant_tsc arch_perfmon pebs bts rep_good nopl xtopology nonstop_tsc cpuid aperfmperf pni pclmulqdq dtes64 monitor ds_cpl est tm2
ssse3 cx16 xtpr pdcm pcid sse4_1 sse4_2 popcnt tsc_deadline_timer aes xsave avx f16c rdrand lahf_lm cpuid_fault epb pti fsgsbase smep e
rms xsaveopt dtherm ida arat pln pts
Caches (sum of all):
L1d: 128 KiB (4 instances)
L1i: 128 KiB (4 instances)
L2: 1 MiB (4 instances)
L3: 6 MiB (1 instance)
NUMA:
NUMA node(s): 1
NUMA node0 CPU(s): 0-3
Vulnerabilities:
Itlb multihit: KVM: Mitigation: VMX unsupported
L1tf: Mitigation; PTE Inversion
Mds: Vulnerable: Clear CPU buffers attempted, no microcode; SMT disabled
Meltdown: Mitigation; PTI
Mmio stale data: Not affected
Retbleed: Not affected
Spec store bypass: Vulnerable
Spectre v1: Mitigation; usercopy/swapgs barriers and __user pointer sanitization
Spectre v2: Mitigation; Retpolines, STIBP disabled, RSB filling
Srbds: Vulnerable: No microcode
Tsx async abort: Not affectedOffline
There're certainly HW errors (recorded from the previous boot) and the game segfaults (in some private library, it seems?), but your filter doesn't cover whether there's a coredump and while "[ 4926.367481] Code: Unable to access opcode bytes at RIP 0xff00fed6." looks like an IOMMU error, it also lacks context.
iow: don't filter the logs, post the entire system journal and check whether coredumpctl has a coredump for dota2
Online
What tells you that my CPU needs more voltage?
mce: [Hardware Error]: CPU 0: Machine Check: 0 Bank 0: 9000004000010005Are you overclocking your CPU?
Anyways like i said previously, overclocked or not, that CPU needs more voltage or lower frequencies.
Excuse my poor English.
Offline