You are not logged in.

#1 2024-03-30 06:55:42

insidesources
Member
Registered: 2024-03-30
Posts: 11

Displays freezing

Hello,

my pc started to freeze about a week ago - the monitors freeze thus forcing me to reboot.

I am using the following:
kernel 6.8.2-zen2-1-zen
nvidia-dkms 550.67-1

journal has this: kernel: NVRM: Xid (PCI:0000:01:00): 56, pid='<unknown>', name=<unknown>, CMDre 00000003 0000017c 00000000 00000003 00000000

checked here: https://docs.nvidia.com/deploy/xid-errors/index.html

56: Display Engine error

This is super frustrating - does anyone know how we can resolve this?

Offline

#2 2024-03-30 07:50:28

seth
Member
Registered: 2012-09-03
Posts: 51,671

Re: Displays freezing

XID 56 is hardware or driver, so either your GPU is underpowered, overheating, mis-seated or moving to a serverfarm upstate.
Or it's a driver issue.

Please don't post errors out of context

sudo journalctl -b | curl -F 'file=@-' 0x0.st

and try the behavior w/ the LTS kernel and possibly an older version of nvidia-dkms (and utils) from the ALA (535xx or 545xx won't build w/ newer kernels unless patched to work around GPL issues)

Offline

#3 2024-03-31 01:31:34

insidesources
Member
Registered: 2024-03-30
Posts: 11

Re: Displays freezing

https://pastebin.com/0V6EmtkY

here's more of the journal if you want to see.

power/temps look good - i can reseat it to make sure but i'm leaning towards a driver issue

Last edited by insidesources (2024-03-31 01:32:07)

Offline

#4 2024-03-31 09:44:42

seth
Member
Registered: 2012-09-03
Posts: 51,671

Re: Displays freezing

That segment still starts at the XID56 and therefore tells nothing about the conditions leading up to it. Or the general setup.
As random pickup, there's

vmnetBridge[1962]: RTM_NEWLINK: name:wlan0 index:3 flags:0x00001002
Mar 30 02:13:54 q4pt99x vmnet-natd

so any kind of VM passthrough efforts might be a factor here, but that's spculation based on virtually no information.

Offline

#5 2024-04-01 06:38:32

insidesources
Member
Registered: 2024-03-30
Posts: 11

Re: Displays freezing

That was the start of the log when i checked, i didn't have my PC on for very long before that. I can capture another one when it freezes again.

Is there anyone else having this issue?

Offline

#6 2024-04-01 07:17:19

seth
Member
Registered: 2012-09-03
Posts: 51,671

Re: Displays freezing

sudo journalctl -b

is gonna show you the log for the entire boot and you can go back in time by making it "-b -1", "-b -2" …
Since this is plausibly a hardware problem, asking for echo is not a promising strategy.
Next to showing us the journal of the incident you could try to downgrade the driver (but mind you that older versions like 535xx, 545xx) will only build against the LTS kernel - you'll still likely figure whether it's a HW or SW issue.

Offline

#7 2024-04-07 22:27:19

insidesources
Member
Registered: 2024-03-30
Posts: 11

Re: Displays freezing

https://pastebin.com/ZRLnCiSF

it hasn't crashed in awhile - and now it crashed again at 5:51pm - here's the log - before you say there's no info, the lines above what i pasted are just repeating the same thing about failed to lookup and polkit etc

Last edited by insidesources (2024-04-07 22:27:43)

Offline

#8 2024-04-08 07:21:31

seth
Member
Registered: 2012-09-03
Posts: 51,671

Re: Displays freezing

The system journal is supposed to start w/ the DMI messages, kernel, initramfs…
We're still at

nothing about the conditions leading up to it. Or the general setup.
As random pickup, there's

vmnetBridge[1962]: RTM_NEWLINK: name:wlan0 index:3 flags:0x00001002
Mar 30 02:13:54 q4pt99x vmnet-natd

so any kind of VM passthrough efforts might be a factor here, but that's spculation based on virtually no information.

The XID56 errors in that segment start immediately after

Apr 07 17:51:03 q4pt99x vmnetBridge[1963]: RTM_NEWLINK: name:wlan0 index:3 flags:0x00001002
Apr 07 17:51:03 q4pt99x vmnet-natd[2037]: RTM_NEWLINK: name:wlan0 index:3 flags:0x00001002
Apr 07 17:51:03 q4pt99x vmnetBridge[1963]: RTM_NEWLINK: name:wlan0 index:3 flags:0x00001002
Apr 07 17:51:03 q4pt99x NetworkManager[1909]: <info>  [1712526663.4848] device (wlan0): set-hw-addr: set MAC address to 92:AF:A9:99:0D:42 (scanning)
Apr 07 17:51:03 q4pt99x vmnet-natd[2037]: RTM_NEWLINK: name:wlan0 index:3 flags:0x00001002

confirming that pattern, but w/ the partial logs it's impossible to say whether that's a fluke and you had frequent vmnet-natd messages in a long running VM before w/o causing any nvidia-related issues or whether you're trying to do sth. special w/ the nvidia GPU.

sudo journalctl -b -1 | curl -F 'file=@-' 0x0.st # for the entire journal of the previous boot

Offline

#9 2024-04-11 21:36:35

insidesources
Member
Registered: 2024-03-30
Posts: 11

Re: Displays freezing

I tried to get the entire boot this time, it is 800mb so sorry for the size.

I crashed at 3:26PM during a work meeting.

I have noticed one thing, most of the crashes i have are only if my browser is focused and i'm watching a video *most* of the time, and sometimes i'm not even at my desk.

Last edited by insidesources (2024-04-19 06:11:45)

Offline

#10 2024-04-11 22:44:58

seth
Member
Registered: 2012-09-03
Posts: 51,671

Re: Displays freezing

800MB??
I suppose most of that is rtkit?
https://www.reddit.com/r/archlinux/comm … trollably/
https://github.com/heftig/rtkit/issues/ … 1321085246

You can treat the journal w/

grep -vE 'rtkit-daemon.*: (Failed to look up client|Warning)'

my pc started to freeze about a week ago

What happened at that point? Did you upgrade to the 550xx nvidia drivers?

Offline

#11 2024-04-13 04:18:42

insidesources
Member
Registered: 2024-03-30
Posts: 11

Re: Displays freezing

seth wrote:

800MB??
I suppose most of that is rtkit?
https://www.reddit.com/r/archlinux/comm … trollably/
https://github.com/heftig/rtkit/issues/ … 1321085246

You can treat the journal w/

grep -vE 'rtkit-daemon.*: (Failed to look up client|Warning)'

thank you - i do have to lessen the logging on that, it's ridiculous

my pc started to freeze about a week ago

What happened at that point? Did you upgrade to the 550xx nvidia drivers?

I update my pc almost all of the time, i have to see when the last driver update was before that or around that time. And 1 update back from that.

This issue happened a long time ago(over a year) then eventually went away, and now it's back again.

Offline

#12 2024-04-13 07:28:09

seth
Member
Registered: 2012-09-03
Posts: 51,671

Re: Displays freezing

Report back w/ the insights from the pacman log and a hopefully journal-sized journal  wink

Offline

#13 2024-04-14 04:08:56

insidesources
Member
Registered: 2024-03-30
Posts: 11

Re: Displays freezing

https://pastebin.com/mJ8dVxAx

i cleaned it up a bit, hope this helps

Offline

#14 2024-04-14 07:45:27

seth
Member
Registered: 2012-09-03
Posts: 51,671

Re: Displays freezing

xdg-desktop-portal crashes a lot in glib2, https://bbs.archlinux.org/viewtopic.php … 3#p2164563

vmnet-natd seems a fluke, shows only up 6h before the XID56/32 burst and a minute afterwards.
The nvidia failure is rather isolated w/ > 1h gap before in the journal.

a) does this only happen w/ the zen kernel?
b) does this only happen when the GPU is idle (yo're not doing anything and there's also no GPGPU job (ollama) in the background?
c) do you use https://archlinux.org/packages/extra/x8 … lama-cuda/ ? Is it running while this happens? Temperature issue?
e) try to disable

pcie_aspm=off nvidia.NVreg_DynamicPowerManagement=0x00

https://wiki.archlinux.org/title/Kernel_parameters
https://download.nvidia.com/XFree86/Lin … ement.html

Offline

#15 2024-04-16 02:16:00

insidesources
Member
Registered: 2024-03-30
Posts: 11

Re: Displays freezing

seth wrote:

a) does this only happen w/ the zen kernel?

only one i've used recently - but the first time this happened i was using the regular arch kernel

b) does this only happen when the GPU is idle (yo're not doing anything and there's also no GPGPU job (ollama) in the background?

most of the time yes - only when i'm web browsing mostly or if i go away from my pc for awhile

c) do you use https://archlinux.org/packages/extra/x8 … lama-cuda/ ? Is it running while this happens? Temperature issue?

I don't use, and my temps are good - i have a full watercooling loop. 80-110 F depending on what i'm doing

e) try to disable

pcie_aspm=off nvidia.NVreg_DynamicPowerManagement=0x00

https://wiki.archlinux.org/title/Kernel_parameters
https://download.nvidia.com/XFree86/Lin … ement.html

thank you

Offline

#16 2024-04-17 01:46:38

insidesources
Member
Registered: 2024-03-30
Posts: 11

Re: Displays freezing

https://pastebin.com/XF7WvtLt

another crash tonight, happened in the last 2 hours, i left my browser selected and went AFK - came back to black monitors and pc wouldnt wake them up, i SSH'd into my pc and to check it was still awake and then rebooted

Offline

#17 2024-04-17 17:02:51

seth
Member
Registered: 2012-09-03
Posts: 51,671

Re: Displays freezing

Apr 16 20:12:24 q4pt99x zerotier-one[2140]: connect: Connection timed out
Apr 16 20:12:30 q4pt99x zerotier-one[2140]: connect: Connection timed out
Apr 16 20:12:34 q4pt99x kernel: NVRM: GPU at PCI:0000:01:00: GPU-c7ae409a-8791-b846-1e8f-b101104d3ed1
Apr 16 20:12:34 q4pt99x kernel: NVRM: Xid (PCI:0000:01:00): 56, pid='<unknown>', name=<unknown>, CMDre 00000003 00000000 00000000 00000001 00000000
Apr 16 20:12:34 q4pt99x kernel: NVRM: Xid (PCI:0000:01:00): 32, pid='<unknown>', name=<unknown>, Channel ID 00000003 intr 00004000
Apr 16 20:12:34 q4pt99x kernel: NVRM: Xid (PCI:0000:01:00): 32, pid='<unknown>', name=<unknown>, Channel ID 00000003 intr 00004000
Apr 16 20:12:34 q4pt99x kernel: NVRM: Xid (PCI:0000:01:00): 45, pid='<unknown>', name=<unknown>, Ch 00000000
…

and all downhill from there.
This is already w/ "pcie_aspm=off nvidia.NVreg_DynamicPowerManagement=0x00"?
(The journal starts late)

https://www.nvidia.com/en-us/geforce/fo … ery-in-ev/  - they first speculate it's maybe an unsupported freesync output and the tail has a stable sytem on 528.49 (this is all on windows)

If you want to try to downgrade to eg. the 535xx series from teh ALA you'll have to use the LTS kernel as the older drivers are incompatible w/ GPL restrictions in the latests kernels.
(You'd use the nvidia-dkms and 535xx-version matching nvidia-utils, https://wiki.archlinux.org/title/Arch_Linux_Archive )

Offline

#18 2024-04-18 04:28:31

insidesources
Member
Registered: 2024-03-30
Posts: 11

Re: Displays freezing

seth wrote:

This is already w/ "pcie_aspm=off nvidia.NVreg_DynamicPowerManagement=0x00"?
(The journal starts late)

I just set it right now, so let's see what happens.

https://www.nvidia.com/en-us/geforce/fo … ery-in-ev/  - they first speculate it's maybe an unsupported freesync output and the tail has a stable sytem on 528.49 (this is all on windows)

Does this apply if i have g sync monitors?

If you want to try to downgrade to eg. the 535xx series from teh ALA you'll have to use the LTS kernel as the older drivers are incompatible w/ GPL restrictions in the latests kernels.
(You'd use the nvidia-dkms and 535xx-version matching nvidia-utils, https://wiki.archlinux.org/title/Arch_Linux_Archive )

Thank you. Maybe give this kernel para some time and i will do that eventually.

Offline

#19 2024-04-18 07:33:12

seth
Member
Registered: 2012-09-03
Posts: 51,671

Re: Displays freezing

Does this apply if i have g sync monitors?

G-Sync would be certified for nvidia, but oc. could be falsely been certified.
You could therefore also try to trigger this w/ G-Sync disabled (in monitor and driver)

On aformal note, please try to not wrap your own reply into quote tags.

Offline

#20 2024-04-18 15:22:34

insidesources
Member
Registered: 2024-03-30
Posts: 11

Re: Displays freezing

insidesources wrote:
seth wrote:

This is already w/ "pcie_aspm=off nvidia.NVreg_DynamicPowerManagement=0x00"?
(The journal starts late)

I just set it right now, so let's see what happens.

As soon as i opened this tab to view your reply, bam crashed. 10:53am - waited a few minutes and manually rebooted my pc at 10:57am

got a 53 then 32 32 - https://pastebin.com/B0xZDXKQ with the kernel paras above ^

Offline

#21 2024-04-18 20:35:37

seth
Member
Registered: 2012-09-03
Posts: 51,671

Re: Displays freezing

In order to rule out a hardware issue you should test the behavior either w/ https://archlinux.org/packages/extra/x8 … idia-open/ or (possibly better) w/ the LTS kernel and possibly an older version of nvidia-dkms (and utils) from the ALA (535xx or 545xx won't build w/ newer kernels unless patched to work around GPL issues)

Offline

Board footer

Powered by FluxBB