You are not logged in.
I am trying to play some windows games but after some time the whole system crashes (freezes for a few seconds, including audio and then turning off and automatically turning on again). I tried many versions of both proton and wine:
proton stable
wine stable
proton experimental
wine staging
proton-ge
wine-ge
UMU-proton
but all do that, I tried multiple games and the time it takes to crash changes, it looks like the less the game is optimized the faster it crashes. (Techtonica for example after making some progress crashes in 2 minutes)
I checked the journal for anything but all there's nothing just at some point things start closing
Offline
What's your hardware, which exact games, do you get logs from these crashes? You say you checked the journal but maybe you're looking for a different piece of information, can you post such a journal? Assuming you had to reboot post the journal of the previous boot e.g.
sudo journalctl -b-1 | curl -F 'file=@-' 0x0.st
, fwiw generally if you're forced to reboot and you do so by pressing the power button, try not to do that but enable SysRQ, and use the REISUB combination to safely reboot (leave a pause of a few seconds between each key press of the sequence, especially after the S)
Online
the games I tried are Assassin's Creed 2 and Techtonica, Assassin's Creed 2 takes like 2 hours to crash. I don't get any logs that I know of.
GPU: Geforce RTX 3050
iGPU (not actually used): UHD Graphics 770
CPU: 12th Gen Intel i5-12500
nvidia driver: 565.57.01
memory: 32GB
journal:
https://0x0.st/Xn6M.txt
and I'm not pressing any power button, it reboots itself
Offline
Are you "just" running out of VRAM? The messages at the end sound like the nvidia driver can't allocate some resources anymore. FWIW for a test, can you reproduce the issue with the nvidia, rather than the nvidia-open module?
Online
okay so I tried it with the closed drivers and it still freezed, though this time it didn't reboot by itself, only freezed. I forgot to enable SysRQ so I had to force shutdown with the power button. the freeze was at 15:50
journal:
https://0x0.st/X5o-.txt
Offline
that's quite a hefty board for the rest of the components
what cooler do you use? is the cased closed? have you tried with side panel off?
check temps - sounds like a thermsl issue to me - like someone forgot to remove the plastic cover from the heatsink before mounting it
btw - my grammar-nazi triggers: the past tense of freeze is froze - there's no freezed
recommand take a look at https://www.deepl.com/
Offline
So I just tested it without a side panel and it didn't help. I also continuously monitored the temps of the GPU and the CPU. the CPU max was one core that reached 52C, and for the GPU the max was 63C. Both of those high points didn't happen at the crash. the crash did make the computer reboot automatically again. And I don't know what is my cooler.
Offline
And I don't know what is my cooler.
is it an air cooler or a liquid cooling?
have you checked that the protective plastic cover was removed before installing it?
if it'S not temps it's likely a bad power supply - I would try to replace it - and even if it's not the issue: there's always a good reason to have a spare psu
Offline
it's an air cooler, and there's not plastic cover on it. and how could it be a power supply issue if it happens only with wine? I'm running heavier things just fine without crashing, like a windows vm for hours.
Offline
a system randomly insant power-off or reboot is almost always either a thermal issue or a power issue - or damage already done caused by the previous two
replacing the power supply is usually both the cheapest and simplest option - and as it is one of the most common causes it also has a high impact rate as to eith be the sole culprit or at leadt help to diagnose other issues
and: running a vm is hardly any additional load on top of the host - it's just sharing resources with minimal overhead
if you really want to stress your system use furmark for gpu and something like prime95 for cpu - these synthetic loads have way more weight than calling running a vm a heavy workload
Offline
"iommu=pt intel_iommu=on"
Why is that there?
Offline
the iommu kernel parameters are from when I tried doing gpu passthrough for a vm, it didn't happen in the end. I had the wine issue before that and I also now tested without them again and it's still crashing.
a new power supply will take some time, so I'll do it if no other solution is found. also I did the stress tests and no crashing occurred.
Offline
well - to put it this way: wine/proton or anything in its stack surely is not the actual root cause - it's merely the trigger which sets up a chain which then ends up in the system crash
in order to have a hard reset happen just from software there has to happen quite a lot in a modern OS as something has to trigger which has access to the stuff required for the hard reset
and although it CAN happen from user code usually the kernel is designed to prevent user space code go that ham and prevent it
however - the kernel itself has to have access to such powerful tools
so - what can be a possible root cause are kernel or driver bugs which run in privilege mode able to trigger a reset - but even then this usually should also trigger with synthetic load as in the end for both gpu and cpu there're just so many routes from user level code to kernel/driver level code - it's not like synthetic stress test tools somehow are able to make calls different from what games or wine is
and again - as said: the psu is usually one of the cheaper parts of a system along with the case - as both the motherboard and ram usually are more expensive - and it's also a common point of failure to many problems - have a look over at reddit at r/datahoarder for threads reading like "random data corruption" - and more often it ends up faulty power supplies deliever just instable power to the drives and rest of the system - which does cause enough interference to cause data corruption or data loss - and although I can't really explain it better it's just from observation and personal experience
also: as mentioned it's nothing bad with having a spare psu on hand as even when they sit a year or two on the shelf they don't lose much value - so it just makes sense to start with the psu to replace as you're likely able to flip it otherwise
Offline
i see there is a bios update for your board, worth a try to see if that sorts the problem
also i would try removing the nvidia gpu and running off the igpu to see if the crash happens that way (make sure you remove the nvidia_drm.modeset=1 kernel parameter), maybe even try older nvidia drivers and/or the lts kernel, just try anything you can think of to try and pinpoint the cause of the issue, good old trial and error.
also remove those other kernel parameters related to passthrough as well
Offline
how could it be a power supply issue if it happens only with wine?
Does https://wiki.archlinux.org/title/Steam/ … _emulation help?
Offline