You are not logged in.

#1 2012-09-06 14:46:38

cyang
Member
From: Illinois, USA
Registered: 2009-05-28
Posts: 15

NVRM Xid 31 Crash with artifacts on screen

I recently built a new desktop system but I've been experiencing crashes which leave artifacts on my screen every few days. I'm not too sure what the cause is, but I think it has to do with my nVidia graphics card or the nvidia kernel module.

Here are some pictures of what the screen looks like after the crash (sorry, a little blurry):
small_IMG_20120903_213233.jpgsmall_IMG_20120903_213247.jpgsmall_IMG_20120906_084354.jpg

The crashes have all occurred while I've been away from the computer and while the computer was more or less idle.

I've seen some threads on nVidia forums and nV News that have similar problems, but no resolution.

During the crash, I can still SSH into the box. `top` shows that X is running and is taking 100% CPU. `journalctl` gives one line of output that seems related:

Sep 06 07:57:38 guava kernel: NVRM: Xid (0000:01:00): 31, Ch 00000000, engmask 00000101, intr 10000000

The card I have is the nVidia GTX 560 Ti and the driver version is 304.43

[cyang@guava ~]% lspci -nn         
00:00.0 Host bridge [0600]: Intel Corporation Xeon E3-1200 v2/3rd Gen Core processor DRAM Controller [8086:0150] (rev 09)
00:01.0 PCI bridge [0604]: Intel Corporation Xeon E3-1200 v2/3rd Gen Core processor PCI Express Root Port [8086:0151] (rev 09)
00:02.0 Display controller [0380]: Intel Corporation Xeon E3-1200 v2/3rd Gen Core processor Graphics Controller [8086:0162] (rev 09)
00:14.0 USB controller [0c03]: Intel Corporation 7 Series/C210 Series Chipset Family USB xHCI Host Controller [8086:1e31] (rev 04)
00:16.0 Communication controller [0780]: Intel Corporation 7 Series/C210 Series Chipset Family MEI Controller #1 [8086:1e3a] (rev 04)
00:1a.0 USB controller [0c03]: Intel Corporation 7 Series/C210 Series Chipset Family USB Enhanced Host Controller #2 [8086:1e2d] (rev 04)
00:1b.0 Audio device [0403]: Intel Corporation 7 Series/C210 Series Chipset Family High Definition Audio Controller [8086:1e20] (rev 04)
00:1c.0 PCI bridge [0604]: Intel Corporation 7 Series/C210 Series Chipset Family PCI Express Root Port 1 [8086:1e10] (rev c4)
00:1c.3 PCI bridge [0604]: Intel Corporation 7 Series/C210 Series Chipset Family PCI Express Root Port 4 [8086:1e16] (rev c4)
00:1c.4 PCI bridge [0604]: Intel Corporation 7 Series/C210 Series Chipset Family PCI Express Root Port 5 [8086:1e18] (rev c4)
00:1c.5 PCI bridge [0604]: Intel Corporation 7 Series/C210 Series Chipset Family PCI Express Root Port 6 [8086:1e1a] (rev c4)
00:1c.7 PCI bridge [0604]: Intel Corporation 7 Series/C210 Series Chipset Family PCI Express Root Port 8 [8086:1e1e] (rev c4)
00:1d.0 USB controller [0c03]: Intel Corporation 7 Series/C210 Series Chipset Family USB Enhanced Host Controller #1 [8086:1e26] (rev 04)
00:1f.0 ISA bridge [0601]: Intel Corporation Z77 Express Chipset LPC Controller [8086:1e44] (rev 04)
00:1f.2 SATA controller [0106]: Intel Corporation 7 Series/C210 Series Chipset Family 6-port SATA Controller [AHCI mode] [8086:1e02] (rev 04)
00:1f.3 SMBus [0c05]: Intel Corporation 7 Series/C210 Series Chipset Family SMBus Controller [8086:1e22] (rev 04)
01:00.0 VGA compatible controller [0300]: NVIDIA Corporation GF114 [GeForce GTX 560 Ti] [10de:1200] (rev a1)
01:00.1 Audio device [0403]: NVIDIA Corporation GF114 HDMI Audio Controller [10de:0e0c] (rev a1)
02:00.0 Network controller [0280]: Realtek Semiconductor Co., Ltd. RTL8188CE 802.11b/g/n WiFi Adapter [10ec:8178] (rev 01)
03:00.0 SATA controller [0106]: ASMedia Technology Inc. ASM1062 Serial ATA Controller [1b21:0612] (rev 01)
04:00.0 Ethernet controller [0200]: Broadcom Corporation NetLink BCM57781 Gigabit Ethernet PCIe [14e4:16b1] (rev 10)
05:00.0 PCI bridge [0604]: ASMedia Technology Inc. ASM1083/1085 PCIe to PCI Bridge [1b21:1080] (rev 03)
07:00.0 USB controller [0c03]: ASMedia Technology Inc. ASM1042 SuperSpeed USB Host Controller [1b21:1042]
[cyang@guava ~]% modinfo nvidia
filename:       /usr/lib/modules/3.5.3-2-cy/extramodules/nvidia.ko.gz
alias:          char-major-195-*
version:        304.43
supported:      external
license:        NVIDIA
alias:          pci:v000010DEd00000E00sv*sd*bc04sc80i00*
alias:          pci:v000010DEd00000AA3sv*sd*bc0Bsc40i00*
alias:          pci:v000010DEd*sv*sd*bc03sc02i00*
alias:          pci:v000010DEd*sv*sd*bc03sc00i00*
depends:        
vermagic:       3.5.3-2-cy SMP preempt mod_unload modversions
[...]

Does anyone know whether this is a problem with the hardware or with the kernel module? Any ideas on how to fix it? Thanks!

The rest of the lines from `journalctl` seem irrelevant, but here's a sample:

Sep 06 05:07:34 guava ddclient[14894]: WARNING:  cannot connect to checkip.dyndns.com:80 socket: IO::Socket::INET: Bad hostname 'checkip.dyndns.com'
Sep 06 05:17:34 guava ddclient[14896]: WARNING:  cannot connect to checkip.dyndns.com:80 socket: IO::Socket::INET: Bad hostname 'checkip.dyndns.com'
Sep 06 05:27:34 guava ddclient[14900]: WARNING:  cannot connect to checkip.dyndns.com:80 socket: IO::Socket::INET: Bad hostname 'checkip.dyndns.com'
Sep 06 05:37:34 guava ddclient[14904]: WARNING:  cannot connect to checkip.dyndns.com:80 socket: IO::Socket::INET: Bad hostname 'checkip.dyndns.com'
Sep 06 05:47:34 guava ddclient[14908]: WARNING:  cannot connect to checkip.dyndns.com:80 socket: IO::Socket::INET: Bad hostname 'checkip.dyndns.com'
Sep 06 05:57:34 guava ddclient[14912]: WARNING:  cannot connect to checkip.dyndns.com:80 socket: IO::Socket::INET: Bad hostname 'checkip.dyndns.com'
Sep 06 06:01:01 guava /usr/sbin/crond[14913]: pam_unix(crond:session): session opened for user root by (uid=0)
Sep 06 06:01:01 guava /USR/SBIN/CROND[14914]: (root) CMD (run-parts /etc/cron.hourly)
Sep 06 06:01:01 guava /USR/SBIN/CROND[14913]: pam_unix(crond:session): session closed for user root
Sep 06 06:07:34 guava ddclient[14919]: WARNING:  cannot connect to checkip.dyndns.com:80 socket: IO::Socket::INET: Bad hostname 'checkip.dyndns.com'
Sep 06 06:17:34 guava ddclient[14921]: WARNING:  cannot connect to checkip.dyndns.com:80 socket: IO::Socket::INET: Bad hostname 'checkip.dyndns.com'
Sep 06 06:27:34 guava ddclient[14925]: WARNING:  cannot connect to checkip.dyndns.com:80 socket: IO::Socket::INET: Bad hostname 'checkip.dyndns.com'
Sep 06 06:37:34 guava ddclient[14927]: WARNING:  cannot connect to checkip.dyndns.com:80 socket: IO::Socket::INET: Bad hostname 'checkip.dyndns.com'
Sep 06 06:47:34 guava ddclient[14933]: WARNING:  cannot connect to checkip.dyndns.com:80 socket: IO::Socket::INET: Bad hostname 'checkip.dyndns.com'
Sep 06 06:57:34 guava ddclient[14936]: WARNING:  cannot connect to checkip.dyndns.com:80 socket: IO::Socket::INET: Bad hostname 'checkip.dyndns.com'
Sep 06 07:01:01 guava /usr/sbin/crond[14937]: pam_unix(crond:session): session opened for user root by (uid=0)
Sep 06 07:01:01 guava /USR/SBIN/CROND[14938]: (root) CMD (run-parts /etc/cron.hourly)
Sep 06 07:01:01 guava /USR/SBIN/CROND[14937]: pam_unix(crond:session): session closed for user root
Sep 06 07:07:34 guava ddclient[14944]: WARNING:  cannot connect to checkip.dyndns.com:80 socket: IO::Socket::INET: Bad hostname 'checkip.dyndns.com'
Sep 06 07:17:35 guava ddclient[14947]: WARNING:  cannot connect to checkip.dyndns.com:80 socket: IO::Socket::INET: Bad hostname 'checkip.dyndns.com'
Sep 06 07:27:35 guava ddclient[14951]: WARNING:  cannot connect to checkip.dyndns.com:80 socket: IO::Socket::INET: Bad hostname 'checkip.dyndns.com'
Sep 06 07:37:35 guava ddclient[14955]: WARNING:  cannot connect to checkip.dyndns.com:80 socket: IO::Socket::INET: Bad hostname 'checkip.dyndns.com'
Sep 06 07:47:35 guava ddclient[14960]: WARNING:  cannot connect to checkip.dyndns.com:80 socket: IO::Socket::INET: Bad hostname 'checkip.dyndns.com'
Sep 06 07:57:35 guava ddclient[14963]: WARNING:  cannot connect to checkip.dyndns.com:80 socket: IO::Socket::INET: Bad hostname 'checkip.dyndns.com'
Sep 06 07:57:38 guava kernel: NVRM: Xid (0000:01:00): 31, Ch 00000000, engmask 00000101, intr 10000000
Sep 06 08:01:01 guava /usr/sbin/crond[14966]: pam_unix(crond:session): session opened for user root by (uid=0)
Sep 06 08:01:01 guava /USR/SBIN/CROND[14967]: (root) CMD (run-parts /etc/cron.hourly)
Sep 06 08:01:01 guava /USR/SBIN/CROND[14966]: pam_unix(crond:session): session closed for user root
Sep 06 08:07:35 guava ddclient[14971]: WARNING:  cannot connect to checkip.dyndns.com:80 socket: IO::Socket::INET: Bad hostname 'checkip.dyndns.com'
Sep 06 08:17:35 guava ddclient[14974]: WARNING:  cannot connect to checkip.dyndns.com:80 socket: IO::Socket::INET: Bad hostname 'checkip.dyndns.com'
Sep 06 08:27:35 guava ddclient[14977]: WARNING:  cannot connect to checkip.dyndns.com:80 socket: IO::Socket::INET: Bad hostname 'checkip.dyndns.com'
Sep 06 08:37:35 guava ddclient[14980]: WARNING:  cannot connect to checkip.dyndns.com:80 socket: IO::Socket::INET: Bad hostname 'checkip.dyndns.com'
Sep 06 08:40:47 guava synergys[2811]: Synergy 1.4.10: 2012-09-06T08:40:47 NOTE: client "blueberry" is dead
Sep 06 08:42:50 guava sshd[14982]: Accepted password for cyang from 192.168.1.133 port 36740 ssh2
Sep 06 08:42:50 guava sshd[14982]: pam_unix_session(sshd:session): session opened for user cyang by (uid=0)
Sep 06 08:42:50 guava console-kit-daemon[297]: missing action
Sep 06 08:42:50 guava systemd-logind[221]: New session c11 of user cyang.
Sep 06 08:44:25 guava sudo[15037]: cyang : TTY=pts/5 ; PWD=/home/cyang ; USER=root ; COMMAND=/usr/bin/vim /etc/lxdm/PostLogin
Sep 06 08:44:25 guava sudo[15037]: pam_unix(sudo:session): session opened for user root by cyang(uid=0)
Sep 06 08:44:27 guava sudo[15037]: pam_unix(sudo:session): session closed for user root
Sep 06 08:45:09 guava sudo[15041]: cyang : TTY=pts/5 ; PWD=/home/cyang ; USER=root ; COMMAND=/usr/bin/journalctl
Sep 06 08:45:09 guava sudo[15041]: pam_unix(sudo:session): session opened for user root by cyang(uid=0)
Sep 06 08:47:35 guava ddclient[15044]: WARNING:  cannot connect to checkip.dyndns.com:80 socket: IO::Socket::INET: Bad hostname 'checkip.dyndns.com'
Sep 06 08:57:35 guava ddclient[15048]: WARNING:  cannot connect to checkip.dyndns.com:80 socket: IO::Socket::INET: Bad hostname 'checkip.dyndns.com'
Sep 06 09:01:01 guava /usr/sbin/crond[15050]: pam_unix(crond:session): session opened for user root by (uid=0)
Sep 06 09:01:01 guava /USR/SBIN/CROND[15051]: (root) CMD (run-parts /etc/cron.hourly)
Sep 06 09:01:01 guava /USR/SBIN/CROND[15050]: pam_unix(crond:session): session closed for user root
Sep 06 09:07:35 guava ddclient[15056]: WARNING:  cannot connect to checkip.dyndns.com:80 socket: IO::Socket::INET: Bad hostname 'checkip.dyndns.com'
Sep 06 09:09:51 guava sshd[15058]: Accepted password for cyang from 192.168.1.133 port 37472 ssh2
Sep 06 09:09:51 guava sshd[15058]: pam_unix_session(sshd:session): session opened for user cyang by (uid=0)
Sep 06 09:09:51 guava console-kit-daemon[297]: missing action
Sep 06 09:09:51 guava systemd-logind[221]: New session c12 of user cyang.

Edit: I killed X, and `journalctl` gave the following output:

Sep 06 09:50:06 guava synergys[2811]: Synergy 1.4.10: 2012-09-06T09:50:06 FATAL: X display has unexpectedly disconnected
Sep 06 09:50:06 guava synergys[2811]: Synergy 1.4.10: 2012-09-06T09:50:06 DEBUG: adopting new buffer
Sep 06 09:50:06 guava synergys[2811]: Synergy 1.4.10: 2012-09-06T09:50:06 DEBUG: discarding 6 event(s)
Sep 06 09:50:06 guava systemd[1]: synergys.service: main process exited, code=killed, status=6
Sep 06 09:50:06 guava systemd[1]: Unit synergys.service entered failed state.
Sep 06 09:50:06 guava lxdm-binary[229]: pam_unix(lxdm:session): session closed for user cyang
Sep 06 09:50:06 guava kernel: NVRM: GPU at 0000:01:00: GPU-e82cc4e3-4b64-d276-28a9-e1d75a726965
Sep 06 09:50:28 guava sudo[15765]: cyang : TTY=pts/5 ; PWD=/home/cyang ; USER=root ; COMMAND=/usr/bin/journalctl -f
Sep 06 09:50:28 guava sudo[15765]: pam_unix(sudo:session): session opened for user root by cyang(uid=0)
Sep 06 09:50:55 guava kernel: NVRM: Xid (0000:01:00): 31, Ch 00000000, engmask 00000101, intr 10000000
Sep 06 09:50:59 guava systemd[1]: lxdm.service: main process exited, code=exited, status=1
Sep 06 09:50:59 guava systemd[1]: Unit lxdm.service entered failed state.

Reloading the nvidia module and trying to restart lxdm gives me this:

Sep 06 09:52:28 guava kernel: vgaarb: device changed decodes: PCI:0000:01:00.0,olddecodes=none,decodes=none:owns=io+mem
Sep 06 09:52:28 guava kernel: NVRM: loading NVIDIA UNIX x86_64 Kernel Module  304.43  Sun Aug 19 20:14:03 PDT 2012
Sep 06 09:52:28 guava sudo[15780]: pam_unix(sudo:session): session closed for user root
Sep 06 09:52:39 guava sudo[15973]: cyang : TTY=pts/6 ; PWD=/home/cyang ; USER=root ; COMMAND=/usr/bin/systemctl restart lxdm.service
Sep 06 09:52:39 guava sudo[15973]: pam_unix(sudo:session): session opened for user root by cyang(uid=0)
Sep 06 09:52:39 guava sudo[15973]: pam_unix(sudo:session): session closed for user root
Sep 06 09:52:40 guava kernel: NVRM: GPU at 0000:01:00: GPU-e82cc4e3-4b64-d276-28a9-e1d75a726965
Sep 06 09:52:40 guava kernel: NVRM: Your system is not currently configured to drive a VGA console
Sep 06 09:52:40 guava kernel: NVRM: on the primary VGA device. The NVIDIA Linux graphics driver
Sep 06 09:52:40 guava kernel: NVRM: requires the use of a text-mode VGA console. Use of other console
Sep 06 09:52:40 guava kernel: NVRM: drivers including, but not limited to, vesafb, may result in
Sep 06 09:52:40 guava kernel: NVRM: corruption and stability problems, and is not supported.

Edit 2:
My problem may be related to Problem 2 in Xorg either crashes, or faces an mi EQ buffer overflow with Nividia

Last edited by cyang (2012-09-16 21:12:27)

Offline

Board footer

Powered by FluxBB