You are not logged in.

#1 2022-08-11 15:46:53

leadry
Member
Registered: 2022-07-02
Posts: 16

Video games crash due to high CPU temperature, unsure if GPU is used

I'm having issues playing video games because my CPU keeps overheating. I'm tracking the temperature and it goes up to around 90 degrees Celsius before the game crashes. I'm also tracking the temperature of my GPU and it doesn't seem affected at all.

I'm not sure if this is due to the games being too CPU heavy or if it is due to the GPU not being used properly and therefore the CPU having to manage graphics. I can sometimes experience the same problem when multi-threading the compilation of very large programs, although then I'm more confident it's just a shitty-cooling-problem for the CPU.

I'm curious how to check if my GPU is being used correctly by the games I'm playing.

lspci | grep AMD
01:00.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI] Ellesmere [Radeon RX 470/480/570/570X/580/580X/590] (rev c7)
01:00.1 Audio device: Advanced Micro Devices, Inc. [AMD/ATI] Ellesmere HDMI Audio [Radeon RX 470/480 / 570/580/590]

Thanks.

Last edited by leadry (2022-08-11 15:48:12)

Offline

#2 2022-08-12 15:24:23

twelveeighty
Member
Registered: 2011-09-04
Posts: 1,453

Re: Video games crash due to high CPU temperature, unsure if GPU is used

Overheating == cooling problem. Whether or not the game is using the GPU is not really relevant, since your machine should not overheat - period. Are the fans running? Dust buildup?

Offline

#3 2022-08-14 12:45:22

leadry
Member
Registered: 2022-07-02
Posts: 16

Re: Video games crash due to high CPU temperature, unsure if GPU is used

Fair enough, I've opened up my computer and reapplied the cooling paste. It seems the CPU temperature is now fine (keeps below 80 degrees at all times)

My game keeps crashing, and I'm not sure why.
dmesg outputs the following related errors:

[ 4926.367463] dota2[21180]: segfault at ff00ff00 ip 00000000ff00ff00 sp 00007ffc09cf9fb0 error 14
[ 4926.367481] Code: Unable to access opcode bytes at RIP 0xff00fed6.
[ 4926.367566] audit: type=1701 audit(1660480387.163:91): auid=1000 uid=1000 gid=985 ses=1 pid=21180 comm="dota2" exe=2F686F6D652F6772756D70792F2E6C6F63616C2F73686172652F537465616D2F737465616D617070732F636F6D6D6F6E2F646F7461203220626574612F67616D652F62696E2F6C696E7578737465616D727436342F646F746132 sig=11 res=1
[ 4926.378396] audit: type=1334 audit(1660480387.176:92): prog-id=27 op=LOAD
[ 4926.378471] audit: type=1334 audit(1660480387.176:93): prog-id=28 op=LOAD
[ 4926.378475] audit: type=1334 audit(1660480387.176:94): prog-id=29 op=LOAD
[ 4926.379303] audit: type=1130 audit(1660480387.176:95): pid=1 uid=0 auid=4294967295 ses=4294967295 msg='unit=systemd-coredump@2-23878-0 comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success'
[ 4947.297669] audit: type=1131 audit(1660480408.096:96): pid=1 uid=0 auid=4294967295 ses=4294967295 msg='unit=systemd-coredump@2-23878-0 comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success'
[ 4947.404219] audit: type=1334 audit(1660480408.203:97): prog-id=0 op=UNLOAD
[ 4947.404224] audit: type=1334 audit(1660480408.203:98): prog-id=0 op=UNLOAD
[ 4947.404225] audit: type=1334 audit(1660480408.203:99): prog-id=0 op=UNLOAD
[ 5053.138976] mce: [Hardware Error]: Machine check events logged
[ 5053.138980] mce: [Hardware Error]: CPU 0: Machine Check: 0 Bank 0: 9000004000010005
[ 5053.138984] mce: [Hardware Error]: TSC 1027313580c4
[ 5053.138985] mce: [Hardware Error]: PROCESSOR 0:306a9 TIME 1660480513 SOCKET 0 APIC 0 microcode 16
[ 5055.338161] mce: [Hardware Error]: Machine check events logged
[ 5055.338165] mce: [Hardware Error]: CPU 0: Machine Check: 0 Bank 0: 9000004000010005
[ 5055.338168] mce: [Hardware Error]: TSC 1028fc56fcbb
[ 5055.338170] mce: [Hardware Error]: PROCESSOR 0:306a9 TIME 1660480516 SOCKET 0 APIC 0 microcode 16
[ 5083.572907] mce: [Hardware Error]: CPU 0: Machine Check: 0 Bank 0: 9000004000010005
[ 5083.572914] mce: [Hardware Error]: TSC 104002fffbcd
[ 5083.572916] mce: [Hardware Error]: PROCESSOR 0:306a9 TIME 1660480544 SOCKET 0 APIC 0 microcode 16
[ 5087.622028] mce: [Hardware Error]: CPU 0: Machine Check: 0 Bank 0: 9000004000010005
[ 5087.622035] mce: [Hardware Error]: TSC 10435058dce2
[ 5087.622037] mce: [Hardware Error]: PROCESSOR 0:306a9 TIME 1660480548 SOCKET 0 APIC 0 microcode 16
[ 5105.616317] mce: [Hardware Error]: CPU 0: Machine Check: 0 Bank 0: 9000004000010005
[ 5105.616324] mce: [Hardware Error]: TSC 1051fd132d26
[ 5105.616325] mce: [Hardware Error]: PROCESSOR 0:306a9 TIME 1660480566 SOCKET 0 APIC 0 microcode 16
[ 5179.421665] mce_notify_irq: 3 callbacks suppressed
[ 5179.421668] mce: [Hardware Error]: Machine check events logged
[ 5179.421670] mce: [Hardware Error]: CPU 0: Machine Check: 0 Bank 0: 9000004000010005
[ 5179.421674] mce: [Hardware Error]: TSC 108e2da847d4
[ 5179.421675] mce: [Hardware Error]: PROCESSOR 0:306a9 TIME 1660480640 SOCKET 0 APIC 0 microcode 16
[ 5206.723732] mce: [Hardware Error]: Machine check events logged
[ 5206.723736] mce: [Hardware Error]: CPU 2: Machine Check: 0 Bank 0: 9000004000010005
[ 5206.723741] mce: [Hardware Error]: TSC 10a471991e7f
[ 5206.723742] mce: [Hardware Error]: PROCESSOR 0:306a9 TIME 1660480667 SOCKET 0 APIC 4 microcode 16
[ 5241.368002] audit: type=1100 audit(1660480702.169:100): pid=25505 uid=1000 auid=1000 ses=1 msg='op=PAM:authentication grantors=pam_faillock,pam_permit,pam_faillock acct="grumpy" exe="/usr/bin/sudo" hostname=? addr=? terminal=/dev/pts/0 res=success'
[ 5241.368115] audit: type=1101 audit(1660480702.169:101): pid=25505 uid=1000 auid=1000 ses=1 msg='op=PAM:accounting grantors=pam_unix,pam_permit,pam_time acct="grumpy" exe="/usr/bin/sudo" hostname=? addr=? terminal=/dev/pts/0 res=success'
[ 5241.368632] audit: type=1110 audit(1660480702.169:102): pid=25505 uid=1000 auid=1000 ses=1 msg='op=PAM:setcred grantors=pam_faillock,pam_permit,pam_faillock acct="root" exe="/usr/bin/sudo" hostname=? addr=? terminal=/dev/pts/0 res=success'
[ 5241.370248] audit: type=1105 audit(1660480702.169:103): pid=25505 uid=1000 auid=1000 ses=1 msg='op=PAM:session_open grantors=pam_systemd_home,pam_limits,pam_unix,pam_permit acct="root" exe="/usr/bin/sudo" hostname=? addr=? terminal=/dev/pts/0 res=success'
[ 5241.387980] audit: type=1106 audit(1660480702.189:104): pid=25505 uid=1000 auid=1000 ses=1 msg='op=PAM:session_close grantors=pam_systemd_home,pam_limits,pam_unix,pam_permit acct="root" exe="/usr/bin/sudo" hostname=? addr=? terminal=/dev/pts/0 res=success'
[ 5241.388034] audit: type=1104 audit(1660480702.189:105): pid=25505 uid=1000 auid=1000 ses=1 msg='op=PAM:setcred grantors=pam_faillock,pam_permit,pam_faillock acct="root" exe="/usr/bin/sudo" hostname=? addr=? terminal=/dev/pts/0 res=success'
[ 5378.559771] mce: [Hardware Error]: Machine check events logged
[ 5378.559775] mce: [Hardware Error]: CPU 0: Machine Check: 0 Bank 0: 9000004000010005
[ 5378.559779] mce: [Hardware Error]: TSC 1130945e0cab
[ 5378.559780] mce: [Hardware Error]: PROCESSOR 0:306a9 TIME 1660480839 SOCKET 0 APIC 0 microcode 16
[ 5384.493053] mce: [Hardware Error]: Machine check events logged
[ 5384.493057] mce: [Hardware Error]: CPU 0: Machine Check: 0 Bank 0: 9000004000010005
[ 5384.493061] mce: [Hardware Error]: TSC 11356b14133f
[ 5384.493062] mce: [Hardware Error]: PROCESSOR 0:306a9 TIME 1660480845 SOCKET 0 APIC 0 microcode 16
[ 5421.184737] dota2[24129]: segfault at 14 ip 00007f4cdd4aa3b5 sp 00007ffd7a5698f0 error 4 in libclient.so[7f4cd9c00000+576b000]
[ 5421.184759] Code: 24 0a 06 00 00 00 74 45 4c 8b 25 26 62 fd 02 49 8b 1c 24 48 85 db 0f 84 b1 00 00 00 80 7b 45 00 48 89 d8 74 50 f3 0f 10 43 34 <f3> 0f 5e 40 14 f3 0f 58 05 ca 5f a6 01 f3 0f 2c c0 48 83 c4 08 5b

Offline

#4 2022-08-14 14:50:58

agapito
Member
From: Who cares.
Registered: 2008-11-13
Posts: 703

Re: Video games crash due to high CPU temperature, unsure if GPU is used

Your CPU needs more voltage. Do you have a Ryzen CPU running Curve Optimizer?


Excuse my poor English.

Offline

#5 2022-08-14 16:10:08

leadry
Member
Registered: 2022-07-02
Posts: 16

Re: Video games crash due to high CPU temperature, unsure if GPU is used

What tells you that my CPU needs more voltage? I'm not familiar with a Ryzen CPU nor Curve Optimizer. This is the ouput of

lscpu
Architecture:           x86_64
  CPU op-mode(s):       32-bit, 64-bit
  Address sizes:        36 bits physical, 48 bits virtual
  Byte Order:           Little Endian
CPU(s):                 4
  On-line CPU(s) list:  0-3
Vendor ID:              GenuineIntel
  Model name:           Intel(R) Core(TM) i5-3570K CPU @ 3.40GHz
    CPU family:         6
    Model:              58
    Thread(s) per core: 1
    Core(s) per socket: 4
    Socket(s):          1
    Stepping:           9
    CPU(s) scaling MHz: 51%
    CPU max MHz:        4100.0000
    CPU min MHz:        1600.0000
    BogoMIPS:           7008.21
    Flags:              fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe syscall nx rdtsc
                        p lm constant_tsc arch_perfmon pebs bts rep_good nopl xtopology nonstop_tsc cpuid aperfmperf pni pclmulqdq dtes64 monitor ds_cpl est tm2
                         ssse3 cx16 xtpr pdcm pcid sse4_1 sse4_2 popcnt tsc_deadline_timer aes xsave avx f16c rdrand lahf_lm cpuid_fault epb pti fsgsbase smep e
                        rms xsaveopt dtherm ida arat pln pts
Caches (sum of all):
  L1d:                  128 KiB (4 instances)
  L1i:                  128 KiB (4 instances)
  L2:                   1 MiB (4 instances)
  L3:                   6 MiB (1 instance)
NUMA:
  NUMA node(s):         1
  NUMA node0 CPU(s):    0-3
Vulnerabilities:
  Itlb multihit:        KVM: Mitigation: VMX unsupported
  L1tf:                 Mitigation; PTE Inversion
  Mds:                  Vulnerable: Clear CPU buffers attempted, no microcode; SMT disabled
  Meltdown:             Mitigation; PTI
  Mmio stale data:      Not affected
  Retbleed:             Not affected
  Spec store bypass:    Vulnerable
  Spectre v1:           Mitigation; usercopy/swapgs barriers and __user pointer sanitization
  Spectre v2:           Mitigation; Retpolines, STIBP disabled, RSB filling
  Srbds:                Vulnerable: No microcode
  Tsx async abort:      Not affected

Offline

#6 2022-08-14 19:31:13

seth
Member
From: Won't reply 2 private help req
Registered: 2012-09-03
Posts: 76,500

Re: Video games crash due to high CPU temperature, unsure if GPU is used

There're certainly HW errors (recorded from the previous boot) and the game segfaults (in some private library, it seems?), but your filter doesn't cover whether there's a coredump and while "[ 4926.367481] Code: Unable to access opcode bytes at RIP 0xff00fed6." looks like an IOMMU error, it also lacks context.
iow: don't filter the logs, post the entire system journal and check whether coredumpctl has a coredump for dota2

Online

#7 2022-08-14 19:39:39

agapito
Member
From: Who cares.
Registered: 2008-11-13
Posts: 703

Re: Video games crash due to high CPU temperature, unsure if GPU is used

leadry wrote:

What tells you that my CPU needs more voltage?

mce: [Hardware Error]: CPU 0: Machine Check: 0 Bank 0: 9000004000010005

Are you overclocking your CPU?

Anyways like i said previously, overclocked or not, that CPU needs more voltage or lower frequencies.


Excuse my poor English.

Offline

Board footer

Powered by FluxBB