You are not logged in.
Pages: 1
I have a Sony Vaio VPCEC1S1E laptop, around 9 years old.
Recently lxdm stopped working. I get a corrupt screen with a mouse pointer, so I can't log in. lxdm.log says:
radeon: The kernel rejected CS, see dmesg for more information (-16).
radeon: The kernel rejected CS, see dmesg for more information (-16).
multiple times.
dmesg says:
...
[ 88.613523] radeon 0000:01:00.0: ring 0 stalled for more than 34460msec
[ 88.613531] radeon 0000:01:00.0: GPU lockup (current fence id 0x0000000000000003 last fence id 0x0000000000000015 on ring 0)
[ 89.120193] radeon 0000:01:00.0: ring 0 stalled for more than 34966msec
[ 89.120200] radeon 0000:01:00.0: GPU lockup (current fence id 0x0000000000000003 last fence id 0x0000000000000015 on ring 0)
[ 89.626832] radeon 0000:01:00.0: ring 0 stalled for more than 35473msec
[ 89.626840] radeon 0000:01:00.0: GPU lockup (current fence id 0x0000000000000003 last fence id 0x0000000000000015 on ring 0)
[ 90.120011] radeon 0000:01:00.0: Saved 562 dwords of commands on ring 0.
[ 90.120038] radeon 0000:01:00.0: GPU softreset: 0x00000019
[ 90.120042] radeon 0000:01:00.0: GRBM_STATUS = 0xE5703CA0
[ 90.120046] radeon 0000:01:00.0: GRBM_STATUS_SE0 = 0xFC000007
[ 90.120050] radeon 0000:01:00.0: GRBM_STATUS_SE1 = 0x00000007
[ 90.120054] radeon 0000:01:00.0: SRBM_STATUS = 0x200000C0
[ 90.120058] radeon 0000:01:00.0: SRBM_STATUS2 = 0x00000000
[ 90.120062] radeon 0000:01:00.0: R_008674_CP_STALLED_STAT1 = 0x01000000
[ 90.120066] radeon 0000:01:00.0: R_008678_CP_STALLED_STAT2 = 0x00011000
[ 90.120070] radeon 0000:01:00.0: R_00867C_CP_BUSY_STAT = 0x00068406
[ 90.120074] radeon 0000:01:00.0: R_008680_CP_STAT = 0x80878647
[ 90.120078] radeon 0000:01:00.0: R_00D034_DMA_STATUS_REG = 0x44C83D57
[ 90.129565] radeon 0000:01:00.0: GRBM_SOFT_RESET=0x00007F6B
[ 90.129622] radeon 0000:01:00.0: SRBM_SOFT_RESET=0x00000100
[ 90.130793] radeon 0000:01:00.0: GRBM_STATUS = 0x00003828
[ 90.130797] radeon 0000:01:00.0: GRBM_STATUS_SE0 = 0x00000007
[ 90.130801] radeon 0000:01:00.0: GRBM_STATUS_SE1 = 0x00000007
[ 90.130805] radeon 0000:01:00.0: SRBM_STATUS = 0x200000C0
[ 90.130809] radeon 0000:01:00.0: SRBM_STATUS2 = 0x00000000
[ 90.130813] radeon 0000:01:00.0: R_008674_CP_STALLED_STAT1 = 0x00000000
[ 90.130817] radeon 0000:01:00.0: R_008678_CP_STALLED_STAT2 = 0x00000000
[ 90.130821] radeon 0000:01:00.0: R_00867C_CP_BUSY_STAT = 0x00000000
[ 90.130825] radeon 0000:01:00.0: R_008680_CP_STAT = 0x00000000
[ 90.130829] radeon 0000:01:00.0: R_00D034_DMA_STATUS_REG = 0x44C83D57
[ 90.130861] radeon 0000:01:00.0: GPU reset succeeded, trying to resume
[ 90.165969] [drm] PCIE GART of 1024M enabled (table at 0x000000000014C000).
[ 90.166114] radeon 0000:01:00.0: WB enabled
[ 90.166119] radeon 0000:01:00.0: fence driver on ring 0 use gpu addr 0x0000000040000c00 and cpu addr 0x000000002fd020e3
[ 90.166122] radeon 0000:01:00.0: fence driver on ring 3 use gpu addr 0x0000000040000c0c and cpu addr 0x000000006cfa5204
[ 90.166500] radeon 0000:01:00.0: fence driver on ring 5 use gpu addr 0x000000000005c418 and cpu addr 0x00000000574d587a
[ 90.183286] [drm] ring test on 0 succeeded in 1 usecs
[ 90.183296] [drm] ring test on 3 succeeded in 3 usecs
[ 90.381088] [drm] ring test on 5 succeeded in 1 usecs
[ 90.381094] [drm] UVD initialized successfully.
[ 92.613454] [drm:r600_ib_test [radeon]] *ERROR* radeon: fence wait timed out.
[ 92.613503] [drm:radeon_ib_ring_tests [radeon]] *ERROR* radeon: failed testing IB on GFX ring (-110).
lspci -v for the graphics controller says:
01:00.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI] Madison [Mobility Radeon HD 5650/5750 / 6530M/6550M] (prog-if 00 [VGA controller])
Subsystem: Sony Corporation Mobility Radeon HD 5650
Flags: bus master, fast devsel, latency 0, IRQ 29
Memory at e0000000 (64-bit, prefetchable) [size=256M]
Memory at f0020000 (64-bit, non-prefetchable) [size=128K]
I/O ports at d000 [size=256]
Expansion ROM at 000c0000 [disabled] [size=128K]
Capabilities: [50] Power Management version 3
Capabilities: [58] Express Legacy Endpoint, MSI 00
Capabilities: [a0] MSI: Enable+ Count=1/1 Maskable- 64bit+
Capabilities: [100] Vendor Specific Information: ID=0001 Rev=1 Len=010 <?>
Kernel driver in use: radeon
Kernel modules: radeon
Any ideas how I could get the desktop working again?
Offline
I have done some more investigation, and got my desktop working again. I noticed it is possible to downgrade to a particular date:
https://wiki.archlinux.org/index.php/Ar … cific_date
After some trial and error I discovered the error appeared on the 9th June. On that date the following packages on my system were updated:
Packages (14) eric-18.06-1 eric-common-18.06-1 lib32-glu-9.0.0-4 lib32-libxdamage-1.1.4-3 lib32-sdl_ttf-2.0.11-4 mesa-18.1.1-1 ogre-1.11.0-2 ogre-docs-1.11.0-2 python-pytoml-0.1.16-1 python2-psutil-5.4.6-1 python2-pytoml-0.1.16-1
xorg-server-1.20.0-6 xorg-server-common-1.20.0-6 xorg-xkbcomp-1.4.2-1
I assume it was either xorg-server-1.20.0-6 or xorg-server-common-1.20.0-6 that caused the problem, or possibly mesa. This doesn't make sense as the errors seem to come from the radeon driver.
Anyway, what do I do next?
Offline
I am experiencing the same problem and appreciate you posting your issue.
A temporary solution that worked for me was to downgrade mesa. mesa-18.1.1-1-x86_64.pkg.tar.xz is the update that broke my system so I downgraded to mesa-18.0.4.1-x86_64.pkg.tar.xz.
pacman -U /var/cache/pacman/pkg/mesa-18.0.4-1-x86_64.pkg.tar.xz
Then, I added the following to /etc/pacman.conf to prevent the package from updating until a permanent solution is discovered.
IgnorePkg = mesa
Offline
I've got the same kind of problem (desktop, not a laptop): Since mesa-18.1.x, I just get a corrupted screen (green/pink checkerboard with lots of weird artifacts, but a normal looking, moveable mouse pointer) after booting, and I never get to see lxdm (I'm not using lightdm). My systemlog also has a lot of ‘ring 0 stalled’ and ‘GPU lockups’, and the only thing that has helped so far was downgrading back to mesa-18.0.4 every time I tried out a new mesa update, hoping the issue would fix itself.
My card is a Radeon HD6870.
Last edited by Lustique (2018-08-15 11:32:26)
Offline
After running upgrades last night, we get the same types of irregularities. The system is semi-usable and sometimes recovers, but X segfaults occasionally, and the screen is corrupted on boot.
lspci for the video card:
00:01.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI] BeaverCreek [Radeon HD 6530D] (prog-if 00 [VGA controller])
Subsystem: Acer Incorporated [ALI] BeaverCreek [Radeon HD 6530D]
Flags: bus master, fast devsel, latency 0, IRQ 38
Memory at c0000000 (32-bit, prefetchable) [size=256M]
I/O ports at f000 [size=256]
Memory at fef00000 (32-bit, non-prefetchable) [size=256K]
[virtual] Expansion ROM at 000c0000 [disabled] [size=128K]
Capabilities: [50] Power Management version 3
Capabilities: [58] Express Root Complex Integrated Endpoint, MSI 00
Capabilities: [a0] MSI: Enable+ Count=1/1 Maskable- 64bit+
Capabilities: [100] Vendor Specific Information: ID=0001 Rev=1 Len=010 <?>
Kernel driver in use: radeon
Kernel modules: radeon
Snippets from dmesg:
[ 17.444407] [drm] Initialized radeon 2.50.0 20080528 for 0000:00:01.0 on minor 0
[ 29.854597] radeon 0000:00:01.0: evergreen_cs_track_validate_texture:855 texture bo too small (layer size 8294400, offset 0, max layer 1, depth 1, bo size 4096) (1920 1080)
[ 29.854672] [drm:radeon_cs_ioctl [radeon]] *ERROR* Invalid command stream !
[ 33.301071] [drm:radeon_cs_parser_relocs [radeon]] *ERROR* gem object lookup failed 0xf
[ 33.301101] [drm:radeon_cs_ioctl [radeon]] *ERROR* Failed to parse relocation -2!
[ 44.107525] radeon 0000:00:01.0: ring 0 stalled for more than 10154msec
[ 44.107542] radeon 0000:00:01.0: GPU lockup (current fence id 0x0000000000000003 last fence id 0x0000000000000019 on ring 0)
[ 44.614205] radeon 0000:00:01.0: ring 0 stalled for more than 10660msec
[ 44.614221] radeon 0000:00:01.0: GPU lockup (current fence id 0x0000000000000003 last fence id 0x000000000000001a on ring 0)
[ 45.120883] radeon 0000:00:01.0: ring 0 stalled for more than 11167msec
[ 45.120900] radeon 0000:00:01.0: GPU lockup (current fence id 0x0000000000000003 last fence id 0x000000000000001a on ring 0)
[ 45.627569] radeon 0000:00:01.0: ring 0 stalled for more than 11674msec
[ 45.627585] radeon 0000:00:01.0: GPU lockup (current fence id 0x0000000000000003 last fence id 0x000000000000001b on ring 0)
and
[ 141.898999] radeon 0000:00:01.0: ring 0 stalled for more than 107940msec
[ 141.899015] radeon 0000:00:01.0: GPU lockup (current fence id 0x0000000000000003 last fence id 0x000000000000007c on ring 0)
[ 142.397930] radeon 0000:00:01.0: failed to get a new IB (-35)
[ 142.398005] [drm:radeon_cs_ioctl [radeon]] *ERROR* Failed to get ib !
[ 142.407297] radeon 0000:00:01.0: Saved 3858 dwords of commands on ring 0.
[ 142.407312] radeon 0000:00:01.0: GPU softreset: 0x00000019
[ 142.407316] radeon 0000:00:01.0: GRBM_STATUS = 0xE5703CA0
[ 142.407320] radeon 0000:00:01.0: GRBM_STATUS_SE0 = 0xFC000007
[ 142.407324] radeon 0000:00:01.0: GRBM_STATUS_SE1 = 0x00000007
[ 142.407328] radeon 0000:00:01.0: SRBM_STATUS = 0x20000840
[ 142.407331] radeon 0000:00:01.0: SRBM_STATUS2 = 0x00000000
[ 142.407336] radeon 0000:00:01.0: R_008674_CP_STALLED_STAT1 = 0x01000000
[ 142.407340] radeon 0000:00:01.0: R_008678_CP_STALLED_STAT2 = 0x00011000
[ 142.407344] radeon 0000:00:01.0: R_00867C_CP_BUSY_STAT = 0x00068406
[ 142.407348] radeon 0000:00:01.0: R_008680_CP_STAT = 0x80878647
[ 142.407352] radeon 0000:00:01.0: R_00D034_DMA_STATUS_REG = 0x44C83D57
[ 142.416277] radeon 0000:00:01.0: GRBM_SOFT_RESET=0x00007F6B
[ 142.416333] radeon 0000:00:01.0: SRBM_SOFT_RESET=0x00000100
[ 142.417549] radeon 0000:00:01.0: GRBM_STATUS = 0x00003828
[ 142.417553] radeon 0000:00:01.0: GRBM_STATUS_SE0 = 0x00000007
[ 142.417556] radeon 0000:00:01.0: GRBM_STATUS_SE1 = 0x00000007
[ 142.417560] radeon 0000:00:01.0: SRBM_STATUS = 0x20000040
[ 142.417564] radeon 0000:00:01.0: SRBM_STATUS2 = 0x00000000
[ 142.417567] radeon 0000:00:01.0: R_008674_CP_STALLED_STAT1 = 0x00000000
[ 142.417571] radeon 0000:00:01.0: R_008678_CP_STALLED_STAT2 = 0x00000000
[ 142.417574] radeon 0000:00:01.0: R_00867C_CP_BUSY_STAT = 0x00000000
[ 142.417578] radeon 0000:00:01.0: R_008680_CP_STAT = 0x00000000
[ 142.417582] radeon 0000:00:01.0: R_00D034_DMA_STATUS_REG = 0x44C83D57
[ 142.417592] radeon 0000:00:01.0: GPU reset succeeded, trying to resume
[ 142.432919] [drm] Found smc ucode version: 0x00011100
[ 142.441624] [drm] PCIE GART of 1024M enabled (table at 0x0000000000162000).
[ 142.441787] radeon 0000:00:01.0: WB enabled
[ 142.441795] radeon 0000:00:01.0: fence driver on ring 0 use gpu addr 0x0000000020000c00 and cpu addr 0x00000000f9ab3738
[ 142.441800] radeon 0000:00:01.0: fence driver on ring 3 use gpu addr 0x0000000020000c0c and cpu addr 0x000000008ded1b39
[ 142.442239] radeon 0000:00:01.0: fence driver on ring 5 use gpu addr 0x0000000000072118 and cpu addr 0x0000000063f6dfda
[ 142.458395] [drm] ring test on 0 succeeded in 1 usecs
[ 142.458403] [drm] ring test on 3 succeeded in 2 usecs
[ 142.504461] [drm] ring test on 5 succeeded in 1 usecs
[ 142.524478] [drm] UVD initialized successfully.
[ 143.552438] [drm:r600_ib_test [radeon]] *ERROR* radeon: fence wait timed out.
[ 143.552501] [drm:radeon_ib_ring_tests [radeon]] *ERROR* radeon: failed testing IB on GFX ring (-110).
It is after the soft reset that things start to be usable.
There was a minor upgrade of Xorg/Mesa last night, but the significant change was linux 4.17.11 -> 4.18.5. This system was last updated on Aug 4.
Offline
I thought I would try to upgrade and see if it works. It does now, so looks like the bug was fixed. Mesa currently is 18.2.4-1
Offline
Pages: 1