You are not logged in.

#1 2018-06-13 20:30:40

paul_huffton
Member
Registered: 2017-12-28
Posts: 3

lxdm stopped working

I have a Sony Vaio VPCEC1S1E laptop, around 9 years old.

Recently lxdm stopped working. I get a corrupt screen with a mouse pointer, so I can't log in. lxdm.log says:

radeon: The kernel rejected CS, see dmesg for more information (-16).
radeon: The kernel rejected CS, see dmesg for more information (-16).

multiple times.

dmesg says:

...
[   88.613523] radeon 0000:01:00.0: ring 0 stalled for more than 34460msec
[   88.613531] radeon 0000:01:00.0: GPU lockup (current fence id 0x0000000000000003 last fence id 0x0000000000000015 on ring 0)
[   89.120193] radeon 0000:01:00.0: ring 0 stalled for more than 34966msec
[   89.120200] radeon 0000:01:00.0: GPU lockup (current fence id 0x0000000000000003 last fence id 0x0000000000000015 on ring 0)
[   89.626832] radeon 0000:01:00.0: ring 0 stalled for more than 35473msec
[   89.626840] radeon 0000:01:00.0: GPU lockup (current fence id 0x0000000000000003 last fence id 0x0000000000000015 on ring 0)
[   90.120011] radeon 0000:01:00.0: Saved 562 dwords of commands on ring 0.
[   90.120038] radeon 0000:01:00.0: GPU softreset: 0x00000019
[   90.120042] radeon 0000:01:00.0:   GRBM_STATUS               = 0xE5703CA0
[   90.120046] radeon 0000:01:00.0:   GRBM_STATUS_SE0           = 0xFC000007
[   90.120050] radeon 0000:01:00.0:   GRBM_STATUS_SE1           = 0x00000007
[   90.120054] radeon 0000:01:00.0:   SRBM_STATUS               = 0x200000C0
[   90.120058] radeon 0000:01:00.0:   SRBM_STATUS2              = 0x00000000
[   90.120062] radeon 0000:01:00.0:   R_008674_CP_STALLED_STAT1 = 0x01000000
[   90.120066] radeon 0000:01:00.0:   R_008678_CP_STALLED_STAT2 = 0x00011000
[   90.120070] radeon 0000:01:00.0:   R_00867C_CP_BUSY_STAT     = 0x00068406
[   90.120074] radeon 0000:01:00.0:   R_008680_CP_STAT          = 0x80878647
[   90.120078] radeon 0000:01:00.0:   R_00D034_DMA_STATUS_REG   = 0x44C83D57
[   90.129565] radeon 0000:01:00.0: GRBM_SOFT_RESET=0x00007F6B
[   90.129622] radeon 0000:01:00.0: SRBM_SOFT_RESET=0x00000100
[   90.130793] radeon 0000:01:00.0:   GRBM_STATUS               = 0x00003828
[   90.130797] radeon 0000:01:00.0:   GRBM_STATUS_SE0           = 0x00000007
[   90.130801] radeon 0000:01:00.0:   GRBM_STATUS_SE1           = 0x00000007
[   90.130805] radeon 0000:01:00.0:   SRBM_STATUS               = 0x200000C0
[   90.130809] radeon 0000:01:00.0:   SRBM_STATUS2              = 0x00000000
[   90.130813] radeon 0000:01:00.0:   R_008674_CP_STALLED_STAT1 = 0x00000000
[   90.130817] radeon 0000:01:00.0:   R_008678_CP_STALLED_STAT2 = 0x00000000
[   90.130821] radeon 0000:01:00.0:   R_00867C_CP_BUSY_STAT     = 0x00000000
[   90.130825] radeon 0000:01:00.0:   R_008680_CP_STAT          = 0x00000000
[   90.130829] radeon 0000:01:00.0:   R_00D034_DMA_STATUS_REG   = 0x44C83D57
[   90.130861] radeon 0000:01:00.0: GPU reset succeeded, trying to resume
[   90.165969] [drm] PCIE GART of 1024M enabled (table at 0x000000000014C000).
[   90.166114] radeon 0000:01:00.0: WB enabled
[   90.166119] radeon 0000:01:00.0: fence driver on ring 0 use gpu addr 0x0000000040000c00 and cpu addr 0x000000002fd020e3
[   90.166122] radeon 0000:01:00.0: fence driver on ring 3 use gpu addr 0x0000000040000c0c and cpu addr 0x000000006cfa5204
[   90.166500] radeon 0000:01:00.0: fence driver on ring 5 use gpu addr 0x000000000005c418 and cpu addr 0x00000000574d587a
[   90.183286] [drm] ring test on 0 succeeded in 1 usecs
[   90.183296] [drm] ring test on 3 succeeded in 3 usecs
[   90.381088] [drm] ring test on 5 succeeded in 1 usecs
[   90.381094] [drm] UVD initialized successfully.
[   92.613454] [drm:r600_ib_test [radeon]] *ERROR* radeon: fence wait timed out.
[   92.613503] [drm:radeon_ib_ring_tests [radeon]] *ERROR* radeon: failed testing IB on GFX ring (-110).

lspci -v for the graphics controller says:

01:00.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI] Madison [Mobility Radeon HD 5650/5750 / 6530M/6550M] (prog-if 00 [VGA controller])
	Subsystem: Sony Corporation Mobility Radeon HD 5650
	Flags: bus master, fast devsel, latency 0, IRQ 29
	Memory at e0000000 (64-bit, prefetchable) [size=256M]
	Memory at f0020000 (64-bit, non-prefetchable) [size=128K]
	I/O ports at d000 [size=256]
	Expansion ROM at 000c0000 [disabled] [size=128K]
	Capabilities: [50] Power Management version 3
	Capabilities: [58] Express Legacy Endpoint, MSI 00
	Capabilities: [a0] MSI: Enable+ Count=1/1 Maskable- 64bit+
	Capabilities: [100] Vendor Specific Information: ID=0001 Rev=1 Len=010 <?>
	Kernel driver in use: radeon
	Kernel modules: radeon

Any ideas how I could get the desktop working again?

Offline

#2 2018-06-22 20:03:39

paul_huffton
Member
Registered: 2017-12-28
Posts: 3

Re: lxdm stopped working

I have done some more investigation, and got my desktop working again. I noticed it is possible to downgrade to a particular date:

https://wiki.archlinux.org/index.php/Ar … cific_date

After some trial and error I discovered the error appeared on the 9th June. On that date the following packages on my system were updated:

Packages (14) eric-18.06-1  eric-common-18.06-1  lib32-glu-9.0.0-4  lib32-libxdamage-1.1.4-3  lib32-sdl_ttf-2.0.11-4  mesa-18.1.1-1  ogre-1.11.0-2  ogre-docs-1.11.0-2  python-pytoml-0.1.16-1  python2-psutil-5.4.6-1  python2-pytoml-0.1.16-1
              xorg-server-1.20.0-6  xorg-server-common-1.20.0-6  xorg-xkbcomp-1.4.2-1

I assume it was either xorg-server-1.20.0-6 or xorg-server-common-1.20.0-6 that caused the problem, or possibly mesa. This doesn't make sense as the errors seem to come from the radeon driver.

Anyway, what do I do next?

Offline

#3 2018-06-24 20:53:41

biz
Member
Registered: 2018-06-24
Posts: 1

Re: lxdm stopped working

I am experiencing the same problem and appreciate you posting your issue.

A temporary solution that worked for me was to downgrade mesa.  mesa-18.1.1-1-x86_64.pkg.tar.xz is the update that broke my system so I downgraded to mesa-18.0.4.1-x86_64.pkg.tar.xz.

pacman -U /var/cache/pacman/pkg/mesa-18.0.4-1-x86_64.pkg.tar.xz

Then, I added the following to /etc/pacman.conf to prevent the package from updating until a permanent solution is discovered.

IgnorePkg  = mesa

Offline

#4 2018-08-15 11:31:57

Lustique
Member
From: DE
Registered: 2013-11-15
Posts: 17

Re: lxdm stopped working

I've got the same kind of problem (desktop, not a laptop): Since mesa-18.1.x, I just get a corrupted screen (green/pink checkerboard with lots of weird artifacts, but a normal looking, moveable mouse pointer) after booting, and I never get to see lxdm (I'm not using lightdm). My systemlog also has a lot of ‘ring 0 stalled’ and ‘GPU lockups’, and the only thing that has helped so far was downgrading back to mesa-18.0.4 every time I tried out a new mesa update, hoping the issue would fix itself.

My card is a Radeon HD6870.

Last edited by Lustique (2018-08-15 11:32:26)

Offline

#5 2018-09-02 19:11:37

cookiecaper
Member
Registered: 2007-09-22
Posts: 198

Re: lxdm stopped working

After running upgrades last night, we get the same types of irregularities. The system is semi-usable and sometimes recovers, but X segfaults occasionally, and the screen is corrupted on boot.

lspci for the video card:

00:01.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI] BeaverCreek [Radeon HD 6530D] (prog-if 00 [VGA controller])
        Subsystem: Acer Incorporated [ALI] BeaverCreek [Radeon HD 6530D]
        Flags: bus master, fast devsel, latency 0, IRQ 38
        Memory at c0000000 (32-bit, prefetchable) [size=256M]
        I/O ports at f000 [size=256]
        Memory at fef00000 (32-bit, non-prefetchable) [size=256K]
        [virtual] Expansion ROM at 000c0000 [disabled] [size=128K]
        Capabilities: [50] Power Management version 3
        Capabilities: [58] Express Root Complex Integrated Endpoint, MSI 00
        Capabilities: [a0] MSI: Enable+ Count=1/1 Maskable- 64bit+
        Capabilities: [100] Vendor Specific Information: ID=0001 Rev=1 Len=010 <?>
        Kernel driver in use: radeon
        Kernel modules: radeon

Snippets from dmesg:

[   17.444407] [drm] Initialized radeon 2.50.0 20080528 for 0000:00:01.0 on minor 0
[   29.854597] radeon 0000:00:01.0: evergreen_cs_track_validate_texture:855 texture bo too small (layer size 8294400, offset 0, max layer 1, depth 1, bo size 4096) (1920 1080)
[   29.854672] [drm:radeon_cs_ioctl [radeon]] *ERROR* Invalid command stream !
[   33.301071] [drm:radeon_cs_parser_relocs [radeon]] *ERROR* gem object lookup failed 0xf
[   33.301101] [drm:radeon_cs_ioctl [radeon]] *ERROR* Failed to parse relocation -2!
[   44.107525] radeon 0000:00:01.0: ring 0 stalled for more than 10154msec
[   44.107542] radeon 0000:00:01.0: GPU lockup (current fence id 0x0000000000000003 last fence id 0x0000000000000019 on ring 0)
[   44.614205] radeon 0000:00:01.0: ring 0 stalled for more than 10660msec
[   44.614221] radeon 0000:00:01.0: GPU lockup (current fence id 0x0000000000000003 last fence id 0x000000000000001a on ring 0)
[   45.120883] radeon 0000:00:01.0: ring 0 stalled for more than 11167msec
[   45.120900] radeon 0000:00:01.0: GPU lockup (current fence id 0x0000000000000003 last fence id 0x000000000000001a on ring 0)
[   45.627569] radeon 0000:00:01.0: ring 0 stalled for more than 11674msec
[   45.627585] radeon 0000:00:01.0: GPU lockup (current fence id 0x0000000000000003 last fence id 0x000000000000001b on ring 0)

and

[  141.898999] radeon 0000:00:01.0: ring 0 stalled for more than 107940msec
[  141.899015] radeon 0000:00:01.0: GPU lockup (current fence id 0x0000000000000003 last fence id 0x000000000000007c on ring 0)
[  142.397930] radeon 0000:00:01.0: failed to get a new IB (-35)
[  142.398005] [drm:radeon_cs_ioctl [radeon]] *ERROR* Failed to get ib !
[  142.407297] radeon 0000:00:01.0: Saved 3858 dwords of commands on ring 0.
[  142.407312] radeon 0000:00:01.0: GPU softreset: 0x00000019
[  142.407316] radeon 0000:00:01.0:   GRBM_STATUS               = 0xE5703CA0
[  142.407320] radeon 0000:00:01.0:   GRBM_STATUS_SE0           = 0xFC000007
[  142.407324] radeon 0000:00:01.0:   GRBM_STATUS_SE1           = 0x00000007
[  142.407328] radeon 0000:00:01.0:   SRBM_STATUS               = 0x20000840
[  142.407331] radeon 0000:00:01.0:   SRBM_STATUS2              = 0x00000000
[  142.407336] radeon 0000:00:01.0:   R_008674_CP_STALLED_STAT1 = 0x01000000
[  142.407340] radeon 0000:00:01.0:   R_008678_CP_STALLED_STAT2 = 0x00011000
[  142.407344] radeon 0000:00:01.0:   R_00867C_CP_BUSY_STAT     = 0x00068406
[  142.407348] radeon 0000:00:01.0:   R_008680_CP_STAT          = 0x80878647
[  142.407352] radeon 0000:00:01.0:   R_00D034_DMA_STATUS_REG   = 0x44C83D57
[  142.416277] radeon 0000:00:01.0: GRBM_SOFT_RESET=0x00007F6B
[  142.416333] radeon 0000:00:01.0: SRBM_SOFT_RESET=0x00000100
[  142.417549] radeon 0000:00:01.0:   GRBM_STATUS               = 0x00003828
[  142.417553] radeon 0000:00:01.0:   GRBM_STATUS_SE0           = 0x00000007
[  142.417556] radeon 0000:00:01.0:   GRBM_STATUS_SE1           = 0x00000007
[  142.417560] radeon 0000:00:01.0:   SRBM_STATUS               = 0x20000040
[  142.417564] radeon 0000:00:01.0:   SRBM_STATUS2              = 0x00000000
[  142.417567] radeon 0000:00:01.0:   R_008674_CP_STALLED_STAT1 = 0x00000000
[  142.417571] radeon 0000:00:01.0:   R_008678_CP_STALLED_STAT2 = 0x00000000
[  142.417574] radeon 0000:00:01.0:   R_00867C_CP_BUSY_STAT     = 0x00000000
[  142.417578] radeon 0000:00:01.0:   R_008680_CP_STAT          = 0x00000000
[  142.417582] radeon 0000:00:01.0:   R_00D034_DMA_STATUS_REG   = 0x44C83D57
[  142.417592] radeon 0000:00:01.0: GPU reset succeeded, trying to resume
[  142.432919] [drm] Found smc ucode version: 0x00011100
[  142.441624] [drm] PCIE GART of 1024M enabled (table at 0x0000000000162000).
[  142.441787] radeon 0000:00:01.0: WB enabled
[  142.441795] radeon 0000:00:01.0: fence driver on ring 0 use gpu addr 0x0000000020000c00 and cpu addr 0x00000000f9ab3738
[  142.441800] radeon 0000:00:01.0: fence driver on ring 3 use gpu addr 0x0000000020000c0c and cpu addr 0x000000008ded1b39
[  142.442239] radeon 0000:00:01.0: fence driver on ring 5 use gpu addr 0x0000000000072118 and cpu addr 0x0000000063f6dfda
[  142.458395] [drm] ring test on 0 succeeded in 1 usecs
[  142.458403] [drm] ring test on 3 succeeded in 2 usecs
[  142.504461] [drm] ring test on 5 succeeded in 1 usecs
[  142.524478] [drm] UVD initialized successfully.
[  143.552438] [drm:r600_ib_test [radeon]] *ERROR* radeon: fence wait timed out.
[  143.552501] [drm:radeon_ib_ring_tests [radeon]] *ERROR* radeon: failed testing IB on GFX ring (-110).

It is after the soft reset that things start to be usable.

There was a minor upgrade of Xorg/Mesa last night, but the significant change was linux 4.17.11 -> 4.18.5. This system was last updated on Aug 4.

Offline

#6 2018-11-05 21:17:03

paul_huffton
Member
Registered: 2017-12-28
Posts: 3

Re: lxdm stopped working

I thought I would try to upgrade and see if it works. It does now, so looks like the bug was fixed. Mesa currently is 18.2.4-1

Offline

Board footer

Powered by FluxBB