You are not logged in.

#1 2014-08-30 10:23:14

vanquish
Member
Registered: 2013-12-28
Posts: 49

[SOLVED][ATI] Driver and/or Kernel Issue?

Hi all,

I've got a serious issue on my Workstation.

The installation is only a few days old. It's the first Linux Installation on this system. So I don't know if ohter distributions/kernelversions would do it without any issue. I've not tried to install ATI propritary drivers as xorg 1.16 is installed.

System specs:
GPU ATI R9 270x
CPU Intel Haswell i5-4440
Board MSI H87-G43
RAM 8 GB Mushkin DDR-3/1333
Samsung SSD 830

Problem discription:

The system crahes randomly (after 5 mins. or after an two hours). Screen Blanks. Nothing is available. I had to do a hard reset every time. This happen in all circumstences e. g. Browsing Internet or even if there is nothing done on the computer. In all cases I've got no log-entries.
But the last crash was different: the Display came back after a few minutes (just for the record: I've waited every crash for 10-15 mins. -.-).
I've got the following log:

Log:

Aug 29 10:25:46 localhost kernel: radeon 0000:01:00.0: ring 0 stalled for more than 10013msec
Aug 29 10:25:46 localhost kernel: radeon 0000:01:00.0: GPU lockup (waiting for 0x00000000001d5b16 last fence id 0x00000000001d5b12 on ring 0)
Aug 29 10:25:46 localhost kernel: radeon 0000:01:00.0: scheduling IB failed (-35).
Aug 29 10:25:46 localhost kernel: radeon 0000:01:00.0: Saved 27536 dwords of commands on ring 0.
Aug 29 10:25:46 localhost kernel: radeon 0000:01:00.0: GPU softreset: 0x0000006C
Aug 29 10:25:46 localhost kernel: radeon 0000:01:00.0:   GRBM_STATUS               = 0xA0003028
Aug 29 10:25:46 localhost kernel: radeon 0000:01:00.0:   GRBM_STATUS_SE0           = 0x00000006
Aug 29 10:25:46 localhost kernel: radeon 0000:01:00.0:   GRBM_STATUS_SE1           = 0x00000006
Aug 29 10:25:46 localhost kernel: radeon 0000:01:00.0:   SRBM_STATUS               = 0x20000AC0
Aug 29 10:25:46 localhost kernel: radeon 0000:01:00.0:   SRBM_STATUS2              = 0x00000000
Aug 29 10:25:46 localhost kernel: radeon 0000:01:00.0:   R_008674_CP_STALLED_STAT1 = 0x00000000
Aug 29 10:25:46 localhost kernel: radeon 0000:01:00.0:   R_008678_CP_STALLED_STAT2 = 0x00010000
Aug 29 10:25:46 localhost kernel: radeon 0000:01:00.0:   R_00867C_CP_BUSY_STAT     = 0x00000002
Aug 29 10:25:46 localhost kernel: radeon 0000:01:00.0:   R_008680_CP_STAT          = 0x80010243
Aug 29 10:25:46 localhost kernel: radeon 0000:01:00.0:   R_00D034_DMA_STATUS_REG   = 0x44483146
Aug 29 10:25:46 localhost kernel: radeon 0000:01:00.0:   R_00D834_DMA_STATUS_REG   = 0x44C84246
Aug 29 10:25:46 localhost kernel: radeon 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x00000000
Aug 29 10:25:46 localhost kernel: radeon 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x00000000
Aug 29 10:25:47 localhost kernel: radeon 0000:01:00.0: GRBM_SOFT_RESET=0x0000DDFF
Aug 29 10:25:47 localhost kernel: radeon 0000:01:00.0: SRBM_SOFT_RESET=0x00100140
Aug 29 10:25:47 localhost kernel: radeon 0000:01:00.0:   GRBM_STATUS               = 0x00003028
Aug 29 10:25:47 localhost kernel: radeon 0000:01:00.0:   GRBM_STATUS_SE0           = 0x00000006
Aug 29 10:25:47 localhost kernel: radeon 0000:01:00.0:   GRBM_STATUS_SE1           = 0x00000006
Aug 29 10:25:47 localhost kernel: radeon 0000:01:00.0:   SRBM_STATUS               = 0x200000C0
Aug 29 10:25:47 localhost kernel: radeon 0000:01:00.0:   SRBM_STATUS2              = 0x00000000
Aug 29 10:25:47 localhost kernel: radeon 0000:01:00.0:   R_008674_CP_STALLED_STAT1 = 0x00000000
Aug 29 10:25:47 localhost kernel: radeon 0000:01:00.0:   R_008678_CP_STALLED_STAT2 = 0x00000000
Aug 29 10:25:47 localhost kernel: radeon 0000:01:00.0:   R_00867C_CP_BUSY_STAT     = 0x00000000
Aug 29 10:25:47 localhost kernel: radeon 0000:01:00.0:   R_008680_CP_STAT          = 0x00000000
Aug 29 10:25:47 localhost kernel: radeon 0000:01:00.0:   R_00D034_DMA_STATUS_REG   = 0x44C83D57
Aug 29 10:25:47 localhost kernel: radeon 0000:01:00.0:   R_00D834_DMA_STATUS_REG   = 0x44C83D57
Aug 29 10:25:47 localhost kernel: radeon 0000:01:00.0: GPU reset succeeded, trying to resume
Aug 29 10:25:47 localhost kernel: [drm] probing gen 2 caps for device 8086:c01 = 261ad03/e
Aug 29 10:25:47 localhost kernel: [drm] PCIE gen 3 link speeds already enabled
Aug 29 10:25:47 localhost kernel: [drm] PCIE GART of 1024M enabled (table at 0x0000000000276000).
Aug 29 10:25:47 localhost kernel: radeon 0000:01:00.0: WB enabled
Aug 29 10:25:47 localhost kernel: radeon 0000:01:00.0: fence driver on ring 0 use gpu addr 0x0000000080000c00 and cpu addr 0xffff8800da15ec00
Aug 29 10:25:47 localhost kernel: radeon 0000:01:00.0: fence driver on ring 1 use gpu addr 0x0000000080000c04 and cpu addr 0xffff8800da15ec04
Aug 29 10:25:47 localhost kernel: radeon 0000:01:00.0: fence driver on ring 2 use gpu addr 0x0000000080000c08 and cpu addr 0xffff8800da15ec08
Aug 29 10:25:47 localhost kernel: radeon 0000:01:00.0: fence driver on ring 3 use gpu addr 0x0000000080000c0c and cpu addr 0xffff8800da15ec0c
Aug 29 10:25:47 localhost kernel: radeon 0000:01:00.0: fence driver on ring 4 use gpu addr 0x0000000080000c10 and cpu addr 0xffff8800da15ec10
Aug 29 10:25:47 localhost kernel: radeon 0000:01:00.0: fence driver on ring 5 use gpu addr 0x0000000000075a18 and cpu addr 0xffffc90004fb5a18
Aug 29 10:25:47 localhost kernel: [drm] ring test on 0 succeeded in 4 usecs
Aug 29 10:25:47 localhost kernel: [drm] ring test on 1 succeeded in 1 usecs
Aug 29 10:25:47 localhost kernel: [drm] ring test on 2 succeeded in 1 usecs
Aug 29 10:25:47 localhost kernel: [drm] ring test on 3 succeeded in 2 usecs
Aug 29 10:25:47 localhost kernel: [drm] ring test on 4 succeeded in 1 usecs
Aug 29 10:25:47 localhost kernel: [drm] ring test on 5 succeeded in 2 usecs
Aug 29 10:25:47 localhost kernel: [drm] UVD initialized successfully.
Aug 29 10:25:57 localhost kernel: radeon 0000:01:00.0: ring 0 stalled for more than 10003msec
Aug 29 10:25:57 localhost kernel: radeon 0000:01:00.0: GPU lockup (waiting for 0x00000000001d5c59 last fence id 0x00000000001d5b12 on ring 0)
Aug 29 10:25:57 localhost kernel: [drm:r600_ib_test] *ERROR* radeon: fence wait failed (-35).
Aug 29 10:25:57 localhost kernel: [drm:radeon_ib_ring_tests] *ERROR* radeon: failed testing IB on GFX ring (-35).
Aug 29 10:25:57 localhost kernel: radeon 0000:01:00.0: ib ring test failed (-35).
Aug 29 10:25:58 localhost kernel: radeon 0000:01:00.0: GPU softreset: 0x00000048
Aug 29 10:25:58 localhost kernel: radeon 0000:01:00.0:   GRBM_STATUS               = 0xA0003028
Aug 29 10:25:58 localhost kernel: radeon 0000:01:00.0:   GRBM_STATUS_SE0           = 0x00000006
Aug 29 10:25:58 localhost kernel: radeon 0000:01:00.0:   GRBM_STATUS_SE1           = 0x00000006
Aug 29 10:25:58 localhost kernel: radeon 0000:01:00.0:   SRBM_STATUS               = 0x200000C0
Aug 29 10:25:58 localhost kernel: radeon 0000:01:00.0:   SRBM_STATUS2              = 0x00000000
Aug 29 10:25:58 localhost kernel: radeon 0000:01:00.0:   R_008674_CP_STALLED_STAT1 = 0x00000000
Aug 29 10:25:58 localhost kernel: radeon 0000:01:00.0:   R_008678_CP_STALLED_STAT2 = 0x00010000
Aug 29 10:25:58 localhost kernel: radeon 0000:01:00.0:   R_00867C_CP_BUSY_STAT     = 0x00000002
Aug 29 10:25:58 localhost kernel: radeon 0000:01:00.0:   R_008680_CP_STAT          = 0x80010243
Aug 29 10:25:58 localhost kernel: radeon 0000:01:00.0:   R_00D034_DMA_STATUS_REG   = 0x44C83D57
Aug 29 10:25:58 localhost kernel: radeon 0000:01:00.0:   R_00D834_DMA_STATUS_REG   = 0x44C83D57
Aug 29 10:25:58 localhost kernel: radeon 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x00000000
Aug 29 10:25:58 localhost kernel: radeon 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x00000000
Aug 29 10:25:58 localhost kernel: radeon 0000:01:00.0: GRBM_SOFT_RESET=0x0000DDFF
Aug 29 10:25:58 localhost kernel: radeon 0000:01:00.0: SRBM_SOFT_RESET=0x00000100
Aug 29 10:25:58 localhost kernel: radeon 0000:01:00.0:   GRBM_STATUS               = 0x00003028
Aug 29 10:25:58 localhost kernel: radeon 0000:01:00.0:   GRBM_STATUS_SE0           = 0x00000006
Aug 29 10:25:58 localhost kernel: radeon 0000:01:00.0:   GRBM_STATUS_SE1           = 0x00000006
Aug 29 10:25:58 localhost kernel: radeon 0000:01:00.0:   SRBM_STATUS               = 0x200000C0
Aug 29 10:25:58 localhost kernel: radeon 0000:01:00.0:   SRBM_STATUS2              = 0x00000000
Aug 29 10:25:58 localhost kernel: radeon 0000:01:00.0:   R_008674_CP_STALLED_STAT1 = 0x00000000
Aug 29 10:25:58 localhost kernel: radeon 0000:01:00.0:   R_008678_CP_STALLED_STAT2 = 0x00000000
Aug 29 10:25:58 localhost kernel: radeon 0000:01:00.0:   R_00867C_CP_BUSY_STAT     = 0x00000000
Aug 29 10:25:58 localhost kernel: radeon 0000:01:00.0:   R_008680_CP_STAT          = 0x00000000
Aug 29 10:25:58 localhost kernel: radeon 0000:01:00.0:   R_00D034_DMA_STATUS_REG   = 0x44C83D57
Aug 29 10:25:58 localhost kernel: radeon 0000:01:00.0:   R_00D834_DMA_STATUS_REG   = 0x44C83D57
Aug 29 10:25:58 localhost kernel: radeon 0000:01:00.0: GPU reset succeeded, trying to resume
Aug 29 10:25:58 localhost kernel: [drm] probing gen 2 caps for device 8086:c01 = 261ad03/e
Aug 29 10:25:58 localhost kernel: [drm] PCIE gen 3 link speeds already enabled
Aug 29 10:25:58 localhost kernel: [drm] PCIE GART of 1024M enabled (table at 0x0000000000276000).
Aug 29 10:25:58 localhost kernel: radeon 0000:01:00.0: WB enabled
Aug 29 10:25:58 localhost kernel: radeon 0000:01:00.0: fence driver on ring 0 use gpu addr 0x0000000080000c00 and cpu addr 0xffff8800da15ec00
Aug 29 10:25:58 localhost kernel: radeon 0000:01:00.0: fence driver on ring 1 use gpu addr 0x0000000080000c04 and cpu addr 0xffff8800da15ec04
Aug 29 10:25:58 localhost kernel: radeon 0000:01:00.0: fence driver on ring 2 use gpu addr 0x0000000080000c08 and cpu addr 0xffff8800da15ec08
Aug 29 10:25:58 localhost kernel: radeon 0000:01:00.0: fence driver on ring 3 use gpu addr 0x0000000080000c0c and cpu addr 0xffff8800da15ec0c
Aug 29 10:25:58 localhost kernel: radeon 0000:01:00.0: fence driver on ring 4 use gpu addr 0x0000000080000c10 and cpu addr 0xffff8800da15ec10
Aug 29 10:25:58 localhost kernel: radeon 0000:01:00.0: fence driver on ring 5 use gpu addr 0x0000000000075a18 and cpu addr 0xffffc90004fb5a18
Aug 29 10:25:58 localhost kernel: [drm] ring test on 0 succeeded in 4 usecs
Aug 29 10:25:58 localhost kernel: [drm] ring test on 1 succeeded in 1 usecs
Aug 29 10:25:58 localhost kernel: [drm] ring test on 2 succeeded in 1 usecs
Aug 29 10:25:58 localhost kernel: [drm] ring test on 3 succeeded in 2 usecs
Aug 29 10:25:58 localhost kernel: [drm] ring test on 4 succeeded in 1 usecs
Aug 29 10:25:59 localhost kernel: [drm] ring test on 5 succeeded in 2 usecs
Aug 29 10:25:59 localhost kernel: [drm] UVD initialized successfully.
Aug 29 10:25:59 localhost kernel: [drm] ib test on ring 0 succeeded in 0 usecs
Aug 29 10:25:59 localhost kernel: [drm] ib test on ring 1 succeeded in 0 usecs
Aug 29 10:25:59 localhost kernel: [drm] ib test on ring 2 succeeded in 0 usecs
Aug 29 10:25:59 localhost kernel: [drm] ib test on ring 3 succeeded in 0 usecs
Aug 29 10:25:59 localhost kernel: [drm] ib test on ring 4 succeeded in 0 usecs
Aug 29 10:26:09 localhost kernel: radeon 0000:01:00.0: ring 5 stalled for more than 10000msec
Aug 29 10:26:09 localhost kernel: radeon 0000:01:00.0: GPU lockup (waiting for 0x0000000000000004 last fence id 0x0000000000000002 on ring 5)
Aug 29 10:26:09 localhost kernel: [drm:uvd_v1_0_ib_test] *ERROR* radeon: fence wait failed (-35).
Aug 29 10:26:09 localhost kernel: [drm:radeon_ib_ring_tests] *ERROR* radeon: failed testing IB on ring 5 (-35).
Aug 29 10:26:09 localhost kernel: [drm:radeon_pm_resume_dpm] *ERROR* radeon: dpm resume failed
Aug 29 10:26:09 localhost kernel: radeon 0000:01:00.0: GPU fault detected: 146 0x04a44804
Aug 29 10:26:09 localhost kernel: radeon 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x000111A5
Aug 29 10:26:09 localhost kernel: radeon 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x04048004
Aug 29 10:26:09 localhost kernel: VM fault (0x04, vmid 2) at page 70053, read from TC (72)
Aug 29 10:26:09 localhost kernel: radeon 0000:01:00.0: GPU fault detected: 146 0x01443d04
Aug 29 10:26:09 localhost kernel: radeon 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x0001118A
Aug 29 10:26:09 localhost kernel: radeon 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0403D004
Aug 29 10:26:09 localhost kernel: VM fault (0x04, vmid 2) at page 70026, read from DMA1 (61)
Aug 29 10:26:09 localhost kernel: radeon 0000:01:00.0: GPU fault detected: 146 0x01443d04
Aug 29 10:26:09 localhost kernel: radeon 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x00011199
Aug 29 10:26:09 localhost kernel: radeon 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0403D004
Aug 29 10:26:09 localhost kernel: VM fault (0x04, vmid 2) at page 70041, read from DMA1 (61)
Aug 29 10:26:09 localhost kernel: radeon 0000:01:00.0: GPU fault detected: 146 0x01643d04
Aug 29 10:26:09 localhost kernel: radeon 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x000111A3
Aug 29 10:26:09 localhost kernel: radeon 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0403D004
Aug 29 10:26:09 localhost kernel: VM fault (0x04, vmid 2) at page 70051, read from DMA1 (61)
Aug 29 10:26:09 localhost kernel: radeon 0000:01:00.0: GPU fault detected: 146 0x01643d04
Aug 29 10:26:09 localhost kernel: radeon 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x000111A7
Aug 29 10:26:09 localhost kernel: radeon 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0403D004
Aug 29 10:26:09 localhost kernel: VM fault (0x04, vmid 2) at page 70055, read from DMA1 (61)
Aug 29 10:26:09 localhost kernel: radeon 0000:01:00.0: GPU fault detected: 146 0x03c44404
Aug 29 10:26:09 localhost kernel: radeon 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x0001119E
Aug 29 10:26:09 localhost kernel: radeon 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x04044004
Aug 29 10:26:09 localhost kernel: VM fault (0x04, vmid 2) at page 70046, read from TC (68)
Aug 29 10:26:09 localhost kernel: radeon 0000:01:00.0: GPU fault detected: 146 0x01a4c404
Aug 29 10:26:09 localhost kernel: radeon 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x0001118D
Aug 29 10:26:09 localhost kernel: radeon 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x040C4004
Aug 29 10:26:09 localhost kernel: VM fault (0x04, vmid 2) at page 70029, read from TC (196)
Aug 29 10:26:09 localhost kernel: radeon 0000:01:00.0: GPU fault detected: 146 0x0424c804
Aug 29 10:26:09 localhost kernel: radeon 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x000111A1
Aug 29 10:26:09 localhost kernel: radeon 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x040C8004
Aug 29 10:26:09 localhost kernel: VM fault (0x04, vmid 2) at page 70049, read from TC (200)

The last three lines will repaet for a few minutes with different ADDR and STATUS messages. I've cut the log.

Please help me to fix this. Thanks in advance.

Last edited by vanquish (2014-09-05 17:57:05)

Offline

#2 2014-08-30 12:14:53

clfarron4
Member
From: London, UK
Registered: 2013-06-28
Posts: 2,163
Website

Re: [SOLVED][ATI] Driver and/or Kernel Issue?

vanquish wrote:

System specs:
GPU ATI R9 270x
CPU Intel Haswell i5-4440
Board MSI H87-G43
RAM 8 GB Mushkin DDR-3/1333
Samsung SSD 830

What's the output of lspci? IIRC, that processor also has an Integrated Intel® HD Graphics 4600.


Claire is fine.
Problems? I have dysgraphia, so clear and concise please.
My public GPG key for package signing
My x86_64 package repository

Offline

#3 2014-08-31 12:38:32

Devcon
Member
Registered: 2010-03-07
Posts: 9

Re: [SOLVED][ATI] Driver and/or Kernel Issue?

Ironically I've begun to see a very similar problem on my desktop.  My hardware is a bit different:

GPU: ATI R9 280x
CPU: AMD Phenom 1100T
Gigabyte Mobo
OSS Drivers: Mesa 10.2.6/libdrm 2.4.54
Kernel: 3.16.1

I received the following error last time:

Aug 30 21:52:23 DCRIG kernel: \x09\x09power level 2    sclk: 100000 mclk: 150000 vddc: 1144 vddci: 875 pcie gen: 2
Aug 30 21:52:23 DCRIG kernel: \x09\x09power level 3    sclk: 110000 mclk: 150000 vddc: 1200 vddci: 875 pcie gen: 2
Aug 30 21:52:23 DCRIG kernel: \x09status: r 
Aug 30 21:52:23 DCRIG kernel: [drm:si_dpm_set_power_state] *ERROR* si_set_sw_state failed
Aug 30 21:52:23 DCRIG kernel: radeon 0000:07:00.0: GPU fault detected: 146 0x0f02a004
Aug 30 21:52:23 DCRIG kernel: radeon 0000:07:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x000097F8
Aug 30 21:52:23 DCRIG kernel: radeon 0000:07:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x020A0004
Aug 30 21:52:23 DCRIG kernel: VM fault (0x04, vmid 1) at page 38904, read from CB (160)
Aug 30 21:52:23 DCRIG kernel: radeon 0000:07:00.0: GPU fault detected: 146 0x0f029004
Aug 30 21:52:23 DCRIG kernel: radeon 0000:07:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x000097F9
Aug 30 21:52:23 DCRIG kernel: radeon 0000:07:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x020A0004
Aug 30 21:52:23 DCRIG kernel: VM fault (0x04, vmid 1) at page 38905, read from CB (160)
Aug 30 21:52:23 DCRIG kernel: radeon 0000:07:00.0: GPU fault detected: 146 0x0f02a004
Aug 30 21:52:23 DCRIG kernel: radeon 0000:07:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x000097F8
Aug 30 21:52:23 DCRIG kernel: radeon 0000:07:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x020A0004
Aug 30 21:52:23 DCRIG kernel: VM fault (0x04, vmid 1) at page 38904, read from CB (160)
Aug 30 21:52:23 DCRIG kernel: radeon 0000:07:00.0: GPU fault detected: 146 0x0f029004
Aug 30 21:52:23 DCRIG kernel: radeon 0000:07:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x000097F9
Aug 30 21:52:23 DCRIG kernel: radeon 0000:07:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x020A0004
Aug 30 21:52:23 DCRIG kernel: VM fault (0x04, vmid 1) at page 38905, read from CB (160)
Aug 30 21:52:24 DCRIG kernel: radeon 0000:07:00.0: GPU fault detected: 146 0x0f02a004
Aug 30 21:52:24 DCRIG kernel: radeon 0000:07:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x000097F8
Aug 30 21:52:24 DCRIG kernel: radeon 0000:07:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x020A0004
Aug 30 21:52:24 DCRIG kernel: VM fault (0x04, vmid 1) at page 38904, read from CB (160)
Aug 30 21:52:24 DCRIG kernel: radeon 0000:07:00.0: GPU fault detected: 146 0x0f029004
Aug 30 21:52:24 DCRIG kernel: radeon 0000:07:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x000097F9
Aug 30 21:52:24 DCRIG kernel: radeon 0000:07:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x020A0004
Aug 30 21:52:24 DCRIG kernel: VM fault (0x04, vmid 1) at page 38905, read from CB (160)
Aug 30 21:52:24 DCRIG kernel: radeon 0000:07:00.0: GPU fault detected: 146 0x0f22a004
Aug 30 21:52:24 DCRIG kernel: radeon 0000:07:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x00000000
Aug 30 21:52:24 DCRIG kernel: radeon 0000:07:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x02084004
Aug 30 21:52:24 DCRIG kernel: VM fault (0x04, vmid 1) at page 0, read from TC (132)
Aug 30 21:52:25 DCRIG kernel: radeon 0000:07:00.0: GPU fault detected: 146 0x0f02a004
Aug 30 21:52:25 DCRIG kernel: radeon 0000:07:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x000097F8
Aug 30 21:52:25 DCRIG kernel: radeon 0000:07:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x020A0004
Aug 30 21:52:25 DCRIG kernel: VM fault (0x04, vmid 1) at page 38904, read from CB (160)
Aug 30 21:52:25 DCRIG kernel: radeon 0000:07:00.0: GPU fault detected: 146 0x0f029004
Aug 30 21:52:25 DCRIG kernel: radeon 0000:07:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x000097F9
Aug 30 21:52:25 DCRIG kernel: radeon 0000:07:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x020A0004
Aug 30 21:52:25 DCRIG kernel: VM fault (0x04, vmid 1) at page 38905, read from CB (160)
Aug 30 21:52:26 DCRIG kernel: radeon 0000:07:00.0: GPU fault detected: 146 0x0f02a004
Aug 30 21:52:26 DCRIG kernel: radeon 0000:07:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x000097F8
Aug 30 21:52:26 DCRIG kernel: radeon 0000:07:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x020A0004
Aug 30 21:52:26 DCRIG kernel: VM fault (0x04, vmid 1) at page 38904, read from CB (160)
Aug 30 21:52:26 DCRIG kernel: radeon 0000:07:00.0: GPU fault detected: 146 0x0f029004
Aug 30 21:52:26 DCRIG kernel: radeon 0000:07:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x000097F9
Aug 30 21:52:26 DCRIG kernel: radeon 0000:07:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x020A0004
Aug 30 21:52:26 DCRIG kernel: VM fault (0x04, vmid 1) at page 38905, read from CB (160)
Aug 30 21:52:27 DCRIG kernel: radeon 0000:07:00.0: GPU fault detected: 146 0x0f02a004
Aug 30 21:52:27 DCRIG kernel: radeon 0000:07:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x000097F8
Aug 30 21:52:27 DCRIG kernel: radeon 0000:07:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x020A0004
Aug 30 21:52:27 DCRIG kernel: VM fault (0x04, vmid 1) at page 38904, read from CB (160)
Aug 30 21:52:27 DCRIG kernel: radeon 0000:07:00.0: GPU fault detected: 146 0x0f029004
Aug 30 21:52:27 DCRIG kernel: radeon 0000:07:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x000097F9
Aug 30 21:52:27 DCRIG kernel: radeon 0000:07:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x020A0004
Aug 30 21:52:27 DCRIG kernel: VM fault (0x04, vmid 1) at page 38905, read from CB (160)
Aug 30 21:52:28 DCRIG kernel: radeon 0000:07:00.0: GPU fault detected: 146 0x0f02a004
Aug 30 21:52:28 DCRIG kernel: radeon 0000:07:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x000097F8
Aug 30 21:52:28 DCRIG kernel: radeon 0000:07:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x020A0004
Aug 30 21:52:28 DCRIG kernel: VM fault (0x04, vmid 1) at page 38904, read from CB (160)
Aug 30 21:52:28 DCRIG kernel: radeon 0000:07:00.0: GPU fault detected: 146 0x0f029004
Aug 30 21:52:28 DCRIG kernel: radeon 0000:07:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x000097F8
Aug 30 21:52:28 DCRIG kernel: radeon 0000:07:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x02050004
Aug 30 21:52:28 DCRIG kernel: VM fault (0x04, vmid 1) at page 38904, read from CB (80)
Aug 30 21:52:28 DCRIG kernel: radeon 0000:07:00.0: GPU fault detected: 146 0x0f225004
Aug 30 21:52:28 DCRIG kernel: radeon 0000:07:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x000097F8
Aug 30 21:52:28 DCRIG kernel: radeon 0000:07:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x02088004
Aug 30 21:52:28 DCRIG kernel: VM fault (0x04, vmid 1) at page 38904, read from TC (136)

As the message implies, it seems to occur when the kernel changes power states.  In use it does align, as the hard locks tend to occur most frequently in steam or if I encounter rich web content.  I'm beginning to suspect a hardware fault, but I'm going to try disabling DRM and test.

Last edited by Devcon (2014-08-31 12:49:00)

Offline

#4 2014-08-31 15:35:22

vanquish
Member
Registered: 2013-12-28
Posts: 49

Re: [SOLVED][ATI] Driver and/or Kernel Issue?

Thank you guys for helping me.

@ clfarron4:

This is my Gaming/Workstation. As my other workstation is broken I decided to install Linux on this machine. Under Windows all works just fine. The only game running on this machine is Diablo III. This should run under Linux in connection with this ati card. I doubt the game game will run on the integrated gpu. I want to try to solve this issue. Maybe I'm able to purge Windows from this machine. oO

@ Devcon:

The powermanagement was my first thought too.

I hope it's not an hardware issue. -.-

If more input necessary please let me know.

cat /sys/kernel/debug/dri/64/radeon_pm_info
uvd    vclk: 0 dclk: 0
power level 0    sclk: 30000 mclk: 15000 vddc: 900 vddci: 850 pcie gen: 3

I'll test dpm off too. But this is not a good solution. -.-

Offline

#5 2014-08-31 18:13:29

clfarron4
Member
From: London, UK
Registered: 2013-06-28
Posts: 2,163
Website

Re: [SOLVED][ATI] Driver and/or Kernel Issue?

I'll ask again.

clfarron4 wrote:
vanquish wrote:

System specs:
GPU ATI R9 270x
CPU Intel Haswell i5-4440
Board MSI H87-G43
RAM 8 GB Mushkin DDR-3/1333
Samsung SSD 830

What's the output of lspci? IIRC, that processor also has an Integrated Intel® HD Graphics 4600.

Also, if there is an integrated Intel card, did you disable it?

Last edited by clfarron4 (2014-08-31 18:18:02)


Claire is fine.
Problems? I have dysgraphia, so clear and concise please.
My public GPG key for package signing
My x86_64 package repository

Offline

#6 2014-08-31 21:54:24

vanquish
Member
Registered: 2013-12-28
Posts: 49

Re: [SOLVED][ATI] Driver and/or Kernel Issue?

Sorry, I've forgotten it.

00:00.0 Host bridge: Intel Corporation 4th Gen Core Processor DRAM Controller (rev 06)
00:01.0 PCI bridge: Intel Corporation Xeon E3-1200 v3/4th Gen Core Processor PCI Express x16 Controller (rev 06)
00:14.0 USB controller: Intel Corporation 8 Series/C220 Series Chipset Family USB xHCI (rev 05)
00:16.0 Communication controller: Intel Corporation 8 Series/C220 Series Chipset Family MEI Controller #1 (rev 04)
00:1a.0 USB controller: Intel Corporation 8 Series/C220 Series Chipset Family USB EHCI #2 (rev 05)
00:1b.0 Audio device: Intel Corporation 8 Series/C220 Series Chipset High Definition Audio Controller (rev 05)
00:1c.0 PCI bridge: Intel Corporation 8 Series/C220 Series Chipset Family PCI Express Root Port #1 (rev d5)
00:1c.1 PCI bridge: Intel Corporation 8 Series/C220 Series Chipset Family PCI Express Root Port #2 (rev d5)
00:1c.3 PCI bridge: Intel Corporation 82801 PCI Bridge (rev d5)
00:1d.0 USB controller: Intel Corporation 8 Series/C220 Series Chipset Family USB EHCI #1 (rev 05)
00:1f.0 ISA bridge: Intel Corporation H87 Express LPC Controller (rev 05)
00:1f.2 SATA controller: Intel Corporation 8 Series/C220 Series Chipset Family 6-port SATA Controller 1 [AHCI mode] (rev 05)
00:1f.3 SMBus: Intel Corporation 8 Series/C220 Series Chipset Family SMBus Controller (rev 05)
01:00.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI] Curacao XT [Radeon R9 270X]
01:00.1 Audio device: Advanced Micro Devices, Inc. [AMD/ATI] Cape Verde/Pitcairn HDMI Audio [Radeon HD 7700/7800 Series]
03:00.0 Ethernet controller: Qualcomm Atheros Killer E220x Gigabit Ethernet Controller (rev 13)
04:00.0 PCI bridge: ASMedia Technology Inc. ASM1083/1085 PCIe to PCI Bridge (rev 03)

If the vv option is needed let me know which device.

Integrated GPU is disabled.

Offline

#7 2014-08-31 22:32:59

clfarron4
Member
From: London, UK
Registered: 2013-06-28
Posts: 2,163
Website

Re: [SOLVED][ATI] Driver and/or Kernel Issue?

I think the best option for this card would be to try AMD Catalyst as I'm not entirely sure whether the Open Source driver has support for Volcanic Islands cards yet.


Claire is fine.
Problems? I have dysgraphia, so clear and concise please.
My public GPG key for package signing
My x86_64 package repository

Offline

#8 2014-08-31 22:33:44

Morn
Member
Registered: 2012-09-02
Posts: 886

Re: [SOLVED][ATI] Driver and/or Kernel Issue?

I think this is the same freezing issue with the Radeon open source driver that has been reported elsewhere (https://bbs.archlinux.org/viewtopic.php?id=167782). Downgrading to Xorg 1.15 and installing Catalyst fixed it for me. Downgrading to Mesa 10.1.4 might also work.

Offline

#9 2014-09-01 20:20:25

vanquish
Member
Registered: 2013-12-28
Posts: 49

Re: [SOLVED][ATI] Driver and/or Kernel Issue?

I've downgraded to Xorg 1.15 and installed Catalyst. I'll report back if it solves my problems or not.
Thank you all for your help. smile

EDIT:

After a few days of testing I can say the crashes are gone.

Last edited by vanquish (2014-09-05 17:56:45)

Offline

Board footer

Powered by FluxBB