You are not logged in.

#1 2023-03-10 12:28:44

am4rtinez
Member
Registered: 2023-03-10
Posts: 1

System goes to black screen (may be Nvidia driver)

Hey y'all,

I'm running the 6.2.2-arch1-1 kernel, Nvida driver version: 525.89.02 (NVidia GeForce 1650) and bspwm.

Suddenly the system goes to black screen and nothing works except I can connect by ssh to the machine a reboot the system.

This is the last journal before the crash.

```
mar 10 12:03:46 deimos rtkit-daemon[760]: Supervising 5 threads of 4 processes of 1 users.
mar 10 12:03:46 deimos rtkit-daemon[760]: Supervising 5 threads of 4 processes of 1 users.
mar 10 12:07:17 deimos rtkit-daemon[760]: Supervising 5 threads of 4 processes of 1 users.
mar 10 12:07:17 deimos rtkit-daemon[760]: Supervising 5 threads of 4 processes of 1 users.
mar 10 12:07:20 deimos rtkit-daemon[760]: Supervising 5 threads of 4 processes of 1 users.
mar 10 12:07:20 deimos rtkit-daemon[760]: Supervising 5 threads of 4 processes of 1 users.
mar 10 12:08:16 deimos rtkit-daemon[760]: Supervising 5 threads of 4 processes of 1 users.
mar 10 12:08:16 deimos rtkit-daemon[760]: Supervising 5 threads of 4 processes of 1 users.
mar 10 12:08:21 deimos rtkit-daemon[760]: Supervising 5 threads of 4 processes of 1 users.
mar 10 12:08:21 deimos rtkit-daemon[760]: Supervising 5 threads of 4 processes of 1 users.
mar 10 12:09:35 deimos rtkit-daemon[760]: Supervising 5 threads of 4 processes of 1 users.
mar 10 12:09:35 deimos rtkit-daemon[760]: Supervising 5 threads of 4 processes of 1 users.
mar 10 12:09:39 deimos rtkit-daemon[760]: Supervising 5 threads of 4 processes of 1 users.
mar 10 12:09:39 deimos rtkit-daemon[760]: Supervising 5 threads of 4 processes of 1 users.
mar 10 12:09:41 deimos rtkit-daemon[760]: Supervising 5 threads of 4 processes of 1 users.
mar 10 12:09:41 deimos rtkit-daemon[760]: Supervising 5 threads of 4 processes of 1 users.
mar 10 12:09:47 deimos rtkit-daemon[760]: Supervising 5 threads of 4 processes of 1 users.
mar 10 12:09:47 deimos rtkit-daemon[760]: Supervising 5 threads of 4 processes of 1 users.
mar 10 12:10:17 deimos rtkit-daemon[760]: Supervising 5 threads of 4 processes of 1 users.
mar 10 12:10:17 deimos rtkit-daemon[760]: Supervising 5 threads of 4 processes of 1 users.
mar 10 12:10:20 deimos rtkit-daemon[760]: Supervising 5 threads of 4 processes of 1 users.
mar 10 12:10:20 deimos rtkit-daemon[760]: Supervising 5 threads of 4 processes of 1 users.
mar 10 12:10:34 deimos rtkit-daemon[760]: Supervising 5 threads of 4 processes of 1 users.
mar 10 12:10:34 deimos rtkit-daemon[760]: Supervising 5 threads of 4 processes of 1 users.
mar 10 12:10:38 deimos rtkit-daemon[760]: Supervising 5 threads of 4 processes of 1 users.
mar 10 12:10:38 deimos rtkit-daemon[760]: Supervising 5 threads of 4 processes of 1 users.
mar 10 12:21:09 deimos systemd-timesyncd[496]: Network configuration changed, trying to establish connection.
mar 10 12:21:09 deimos systemd-timesyncd[496]: Contacted time server 193.136.152.72:123 (2.arch.pool.ntp.org).
mar 10 12:42:26 deimos kernel: NVRM: GPU at PCI:0000:01:00: GPU-d78473d2-809c-3a03-cf8a-fcfdd8b07456
mar 10 12:42:26 deimos kernel: NVRM: Xid (PCI:0000:01:00): 61, pid='<unknown>', name=<unknown>, 0cee(2c2c) 00000000 00000000
mar 10 12:42:27 deimos kernel: NVRM: Xid (PCI:0000:01:00): 109, pid=578, name=Xorg, Ch 00000008, errorString CTX SWITCH TIMEOUT, Info 0x24002
mar 10 12:42:41 deimos kernel: NVRM: Xid (PCI:0000:01:00): 109, pid=578, name=Xorg, Ch 0000000a, errorString CTX SWITCH TIMEOUT, Info 0x24002
mar 10 12:42:44 deimos at-spi-bus-launcher[786]: X connection to :0 broken (explicit kill or server shutdown).
mar 10 12:42:44 deimos lightdm[632]: pam_unix(lightdm:session): session closed for user amartinez
mar 10 12:42:44 deimos systemd[1]: Started Process Core Dump (PID 1680469/UID 0).
mar 10 12:42:44 deimos systemd-logind[514]: Session 2 logged out. Waiting for processes to exit.
mar 10 12:42:44 deimos systemd-coredump[1680474]: Process 1318453 (picom) of user 1000 dumped core.
                                                 
                                                  Stack trace of thread 1318453:
                                                  #0  0x00007f800d7278ec n/a (libc.so.6 + 0x878ec)
                                                  #1  0x00007f800d6d8ea8 raise (libc.so.6 + 0x38ea8)
                                                  #2  0x00007f800d6c253d abort (libc.so.6 + 0x2253d)
                                                  #3  0x0000565058c19043 n/a (picom + 0xa043)
                                                  #4  0x0000565058c287cd n/a (picom + 0x197cd)
                                                  #5  0x0000565058c28fc2 n/a (picom + 0x19fc2)
                                                  #6  0x0000565058c47b8d n/a (picom + 0x38b8d)
                                                  #7  0x0000565058c49686 n/a (picom + 0x3a686)
                                                  #8  0x0000565058c1e226 n/a (picom + 0xf226)
                                                  #9  0x0000565058c1ebcc n/a (picom + 0xfbcc)
                                                  #10 0x00007f800dc6b0cb ev_invoke_pending (libev.so.4 + 0x50cb)
                                                  #11 0x00007f800dc6ed10 ev_run (libev.so.4 + 0x8d10)
                                                  #12 0x0000565058c1a6b2 n/a (picom + 0xb6b2)
                                                  #13 0x00007f800d6c3790 n/a (libc.so.6 + 0x23790)
                                                  #14 0x00007f800d6c384a __libc_start_main (libc.so.6 + 0x2384a)
                                                  #15 0x0000565058c1b895 n/a (picom + 0xc895)
                                                  ELF object binary architecture: AMD x86-64
mar 10 12:42:44 deimos systemd[1]: systemd-coredump@1-1680469-0.service: Deactivated successfully.
mar 10 12:42:44 deimos bluetoothd[511]: Endpoint unregistered: sender=:1.30 path=/MediaEndpoint/A2DPSink/sbc
mar 10 12:42:44 deimos bluetoothd[511]: Endpoint unregistered: sender=:1.30 path=/MediaEndpoint/A2DPSource/sbc
mar 10 12:42:44 deimos bluetoothd[511]: Endpoint unregistered: sender=:1.30 path=/MediaEndpoint/A2DPSink/sbc_xq_453
mar 10 12:42:44 deimos bluetoothd[511]: Endpoint unregistered: sender=:1.30 path=/MediaEndpoint/A2DPSource/sbc_xq_453
mar 10 12:42:44 deimos bluetoothd[511]: Endpoint unregistered: sender=:1.30 path=/MediaEndpoint/A2DPSink/sbc_xq_512
mar 10 12:42:44 deimos bluetoothd[511]: Endpoint unregistered: sender=:1.30 path=/MediaEndpoint/A2DPSource/sbc_xq_512
mar 10 12:42:44 deimos bluetoothd[511]: Endpoint unregistered: sender=:1.30 path=/MediaEndpoint/A2DPSink/sbc_xq_552
mar 10 12:42:44 deimos bluetoothd[511]: Endpoint unregistered: sender=:1.30 path=/MediaEndpoint/A2DPSource/sbc_xq_552
mar 10 12:42:44 deimos systemd[1]: session-2.scope: Deactivated successfully.
mar 10 12:42:44 deimos systemd[653]: pulseaudio.service: Consumed 1.808s CPU time.
mar 10 12:42:44 deimos systemd[1]: session-2.scope: Consumed 14h 54min 14.429s CPU time.
mar 10 12:42:45 deimos kernel: NVRM: Xid (PCI:0000:01:00): 109, pid=578, name=Xorg, Ch 00000001, errorString CTX SWITCH TIMEOUT, Info 0x14002
mar 10 12:43:14 deimos kernel: NVRM: GPU 0000:01:00.0: RmInitAdapter failed! (0x25:0x40:1457)
mar 10 12:43:14 deimos kernel: NVRM: GPU 0000:01:00.0: rm_init_adapter failed, device minor number 0
mar 10 12:43:30 deimos kernel: NVRM: GPU 0000:01:00.0: RmInitAdapter failed! (0x25:0x40:1457)
mar 10 12:43:30 deimos kernel: NVRM: GPU 0000:01:00.0: rm_init_adapter failed, device minor number 0
mar 10 12:43:30 deimos systemd[1]: lightdm.service: Main process exited, code=exited, status=1/FAILURE
mar 10 12:43:30 deimos systemd[1]: lightdm.service: Failed with result 'exit-code'.
mar 10 12:43:30 deimos systemd[1]: lightdm.service: Consumed 3h 56min 44.609s CPU time.
mar 10 12:43:31 deimos systemd[1]: lightdm.service: Scheduled restart job, restart counter is at 1.
mar 10 12:43:31 deimos systemd[1]: Stopped Light Display Manager.
mar 10 12:43:31 deimos systemd[1]: lightdm.service: Consumed 3h 56min 44.609s CPU time.
mar 10 12:43:31 deimos systemd[1]: Starting Light Display Manager...
mar 10 12:43:31 deimos systemd[1]: Started Light Display Manager.
mar 10 12:43:46 deimos systemd[1]: pcscd.service: Deactivated successfully.
mar 10 12:43:47 deimos kernel: NVRM: GPU 0000:01:00.0: RmInitAdapter failed! (0x25:0x40:1457)
mar 10 12:43:47 deimos kernel: NVRM: GPU 0000:01:00.0: rm_init_adapter failed, device minor number 0
mar 10 12:43:51 deimos dbus-daemon[512]: [system] Activating via systemd: service name='org.freedesktop.home1' unit='dbus-org.freedesktop.home1.service' requested by ':1.2027757' (uid=0 pid=1680552 comm="sshd: amartinez [priv]")
mar 10 12:43:51 deimos dbus-daemon[512]: [system] Activation via systemd failed for unit 'dbus-org.freedesktop.home1.service': Unit dbus-org.freedesktop.home1.service not found.
mar 10 12:43:51 deimos sshd[1680552]: pam_systemd_home(sshd:auth): systemd-homed is not available: Unit dbus-org.freedesktop.home1.service not found.
```

Any help would be appreciated.

Thanks!

Last edited by am4rtinez (2023-03-10 12:29:35)

Offline

#2 2023-05-03 05:06:52

vovan
Member
Registered: 2023-05-03
Posts: 4

Re: System goes to black screen (may be Nvidia driver)

Same here
Driver Version: 525.85.05
Kernel: 5.15.59
I have 1650 D6
Seem to be some general problem with linux and 1650, it happens roughly once per 2 weeks for me.

Offline

#3 2023-05-03 05:56:56

seth
Member
Registered: 2012-09-03
Posts: 49,996

Re: System goes to black screen (may be Nvidia driver)

https://docs.nvidia.com/deploy/xid-errors/index.html - but 61 & 109 are kinda obscure.

mar 10 12:43:47 deimos kernel: NVRM: GPU 0000:01:00.0: RmInitAdapter failed! (0x25:0x40:1457)
mar 10 12:43:47 deimos kernel: NVRM: GPU 0000:01:00.0: rm_init_adapter failed, device minor number 0

the device doesn't come up.

it happens roughly once per 2 weeks for me

Try to add "pcie_aspm=off" to the kernel commandline and see whether it still happens.

@am4rtinez, please use [code][/code] tags, the bbs predates markdown.

Online

#4 2023-05-03 16:43:34

vovan
Member
Registered: 2023-05-03
Posts: 4

Re: System goes to black screen (may be Nvidia driver)

[1022625.245012] NVRM: GPU at PCI:0000:0a:00: GPU-c56d04a3-baa7-3e3b-0d39-8a5b7824bf5b
[1022625.245016] NVRM: Xid (PCI:0000:0a:00): 61, pid='<unknown>', name=<unknown>, 0cee(2c2c) 00000000 00000000
[1022629.250853] NVRM: Xid (PCI:0000:0a:00): 109, pid=1000, name=(udev-worker), Ch 00000002, errorString CTX SWITCH TIMEOUT, Info 0x34003

[1022634.552988] NVRM: Xid (PCI:0000:0a:00): 109, pid=5308, name=X, Ch 00000008, errorString CTX SWITCH TIMEOUT, Info 0x24003

[1022639.859482] NVRM: Xid (PCI:0000:0a:00): 109, pid=5308, name=X, Ch 00000001, errorString CTX SWITCH TIMEOUT, Info 0x14003

[1022640.809961] sched: RT throttling activated
[1022644.357017] nvidia-modeset: ERROR: GPU:0: Failed to idle DMA.
[1022645.162358] NVRM: Xid (PCI:0000:0a:00): 109, pid=5424, name=plasmashell, Ch 00000010, errorString CTX SWITCH TIMEOUT, Info 0x54003

[1022652.697798] nvidia-modeset: ERROR: GPU:0: Failed to idle DMA.
[1022653.962228] NVRM: Xid (PCI:0000:0a:00): 109, pid=6084, name=chrome, Ch 00000040, errorString CTX SWITCH TIMEOUT, Info 0x1d4003

[1022666.841219] nvidia 0000:0a:00.0: AMD-Vi: Event logged [IO_PAGE_FAULT domain=0x0011 address=0xbfff8000 flags=0x0000]

My logs

Ok, will try "pcie_aspm=off"

Offline

#5 2023-05-03 16:56:36

Scimmia
Fellow
Registered: 2012-09-01
Posts: 11,466

Re: System goes to black screen (may be Nvidia driver)

Why is your system so old? And why is the driver newer than the kernel, but still old? Something is very wrong here.

Offline

#6 2023-05-04 05:16:48

vovan
Member
Registered: 2023-05-03
Posts: 4

Re: System goes to black screen (may be Nvidia driver)

It's not "wrong", it's "stable". OP has fresh kernel and newer driver, but still has the issue, that means it's not related.
But anyway, I'll update kernel to 6.1.24 and nvidia-drivers to 525.105.17 and will test the result along with new kernel parameter.

Offline

#7 2023-05-19 06:52:30

vovan
Member
Registered: 2023-05-03
Posts: 4

Re: System goes to black screen (may be Nvidia driver)

Today I have Xid 61 again with updated drivers/kernel, so it didn't fixed it. Uptime was about ~2weeks (booted May 5 09:52).
But this time the several minute freeze resulted in Xorg crash, at least something changed smile

Offline

#8 2023-05-19 07:03:42

seth
Member
Registered: 2012-09-03
Posts: 49,996

Re: System goes to black screen (may be Nvidia driver)

XID 61 is "Internal micro-controller breakpoint/warning", 109 is "Reserved"
Where there any other error messages? Do you have the xorg log and system journal from that session?

Online

Board footer

Powered by FluxBB