You are not logged in.

#1 2018-04-01 04:05:48

Bl4ckb0ne
Member
Registered: 2014-12-26
Posts: 70

amdgpu shuts down Xorg

Hi,

I recently got a MSI rx580, and after using it for a few hours, im getting kicked out of Xorg back to the display manager. I have 2 gpus in my computer

$ uname -a
Linux lovecraft 4.15.12-1-ARCH #1 SMP PREEMPT Wed Mar 21 15:14:56 UTC 2018 x86_64 GNU/Linux

$ lspci -k | grep -A 2 -E "(VGA|3D)"
00:02.0 VGA compatible controller: Intel Corporation Xeon E3-1200 v3/4th Gen Core Processor Integrated Graphics Controller (rev 06)
        Subsystem: Gigabyte Technology Co., Ltd Xeon E3-1200 v3/4th Gen Core Processor Integrated Graphics Controller
        Kernel driver in use: i915
--
01:00.0 VGA compatible controller: NVIDIA Corporation GM204 [GeForce GTX 970] (rev a1)
        Subsystem: Micro-Star International Co., Ltd. [MSI] GM204 [GeForce GTX 970]
        Kernel driver in use: vfio-pci
--
05:00.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI] Ellesmere [Radeon RX 470/480/570/580] (rev e7)
        Subsystem: Micro-Star International Co., Ltd. [MSI] Ellesmere [Radeon RX 470/480/570/580]
        Kernel driver in use: amdgpu

The rx580 is using the driver 18.0.1-1 from the repo extra. The gtx970 is for gpu passthrough, but it's actually unused.

dmesg logs are not giving much informations

$ dmesg | tail
[340169.844648] [drm] {1920x1080, 2200x1125@148500Khz}
[344710.432910] [drm] {1920x1080, 2200x1125@148500Khz}
[344710.434405] [drm] {1920x1080, 2200x1125@148500Khz}
[344710.450062] [drm] {1920x1080, 2200x1125@148500Khz}
[345310.738670] [drm] {1920x1080, 2200x1125@148500Khz}
[360953.968651] [drm] {1920x1080, 2200x1125@148500Khz}
[360953.970688] [drm] {1920x1080, 2200x1125@148500Khz}
[360953.986288] [drm] {1920x1080, 2200x1125@148500Khz}
[361389.199467] [drm] {1920x1080, 2200x1125@148500Khz}
[361389.217197] [drm] {1920x1080, 2200x1125@148500Khz}

No logs in .local/share/xorg/Xorg.0.log, but a few lines are being written in /var/log/Xorg.0.log at every kickout.

[361544.917] (II) AMDGPU(0): EDID vendor "AOC", prod id 8807
[361544.917] (II) AMDGPU(0): Using hsync ranges from config file
[361544.917] (II) AMDGPU(0): Using vrefresh ranges from config file
[361544.917] (II) AMDGPU(0): Printing DDC gathered Modelines:
[361544.917] (II) AMDGPU(0): Modeline "1920x1080"x0.0  148.50  1920 2008 2052 2200  1080 1084 1089 1125 +hsync +vsync (67.5 kHz eP)
[361544.917] (II) AMDGPU(0): Modeline "800x600"x0.0   40.00  800 840 968 1056  600 601 605 628 +hsync +vsync (37.9 kHz e)
[361544.917] (II) AMDGPU(0): Modeline "800x600"x0.0   36.00  800 824 896 1024  600 601 603 625 +hsync +vsync (35.2 kHz e)
[361544.917] (II) AMDGPU(0): Modeline "640x480"x0.0   31.50  640 656 720 840  480 481 484 500 -hsync -vsync (37.5 kHz e)
[361544.917] (II) AMDGPU(0): Modeline "640x480"x0.0   31.50  640 664 704 832  480 489 492 520 -hsync -vsync (37.9 kHz e)
[361544.917] (II) AMDGPU(0): Modeline "640x480"x0.0   30.24  640 704 768 864  480 483 486 525 -hsync -vsync (35.0 kHz e)
[361544.917] (II) AMDGPU(0): Modeline "640x480"x0.0   25.18  640 656 752 800  480 490 492 525 -hsync -vsync (31.5 kHz e)
[361544.917] (II) AMDGPU(0): Modeline "720x400"x0.0   28.32  720 738 846 900  400 412 414 449 -hsync +vsync (31.5 kHz e)
[361544.917] (II) AMDGPU(0): Modeline "1280x1024"x0.0  135.00  1280 1296 1440 1688  1024 1025 1028 1066 +hsync +vsync (80.0 kHz e)
[361544.917] (II) AMDGPU(0): Modeline "1024x768"x0.0   78.75  1024 1040 1136 1312  768 769 772 800 +hsync +vsync (60.0 kHz e)
[361544.917] (II) AMDGPU(0): Modeline "1024x768"x0.0   75.00  1024 1048 1184 1328  768 771 777 806 -hsync -vsync (56.5 kHz e)
[361544.917] (II) AMDGPU(0): Modeline "1024x768"x0.0   65.00  1024 1048 1184 1344  768 771 777 806 -hsync -vsync (48.4 kHz e)
[361544.917] (II) AMDGPU(0): Modeline "832x624"x0.0   57.28  832 864 928 1152  624 625 628 667 -hsync -vsync (49.7 kHz e)
[361544.917] (II) AMDGPU(0): Modeline "800x600"x0.0   49.50  800 816 896 1056  600 601 604 625 +hsync +vsync (46.9 kHz e)
[361544.917] (II) AMDGPU(0): Modeline "800x600"x0.0   50.00  800 856 976 1040  600 637 643 666 +hsync +vsync (48.1 kHz e)
[361544.917] (II) AMDGPU(0): Modeline "1920x1080"x60.0  172.80  1920 2040 2248 2576  1080 1081 1084 1118 -hsync +vsync (67.1 kHz e)
[361544.917] (II) AMDGPU(0): Modeline "1680x1050"x0.0  119.00  1680 1728 1760 1840  1050 1053 1059 1080 +hsync -vsync (64.7 kHz e)
[361544.917] (II) AMDGPU(0): Modeline "1440x900"x0.0   88.75  1440 1488 1520 1600  900 903 909 926 +hsync -vsync (55.5 kHz e)
[361544.917] (II) AMDGPU(0): Modeline "1280x1024"x0.0  108.00  1280 1328 1440 1688  1024 1025 1028 1066 +hsync +vsync (64.0 kHz e)
[361544.917] (II) AMDGPU(0): Modeline "1280x960"x0.0  108.00  1280 1376 1488 1800  960 961 964 1000 +hsync +vsync (60.0 kHz e)
[361544.917] (II) AMDGPU(0): Modeline "1280x720"x60.0   74.48  1280 1336 1472 1664  720 721 724 746 -hsync +vsync (44.8 kHz e)

I have no other ideas on where else to look for log. Is this a known problem? I was not able to find any info on the same issue online.

Thanks.

Offline

#2 2018-04-01 07:55:16

seth
Member
Registered: 2012-09-03
Posts: 49,981

Re: amdgpu shuts down Xorg

If the x11 server does not restart (thus you gent no new log but increments to the existing one) it's rather the session that crashes, not X11. Check your journal and coredumps itr.

Offline

#3 2018-04-01 16:53:22

Bl4ckb0ne
Member
Registered: 2014-12-26
Posts: 70

Re: amdgpu shuts down Xorg

journalctl is giving the exact same info as dmesg. Where can I consult those coredumps ?

Offline

#4 2018-04-01 18:18:41

seth
Member
Registered: 2012-09-03
Posts: 49,981

Re: amdgpu shuts down Xorg

You're asking the wrong journal command then: the journal contains *far* more than the kernel ringbuffer.
"man coredumpctl" for coredumps.

Offline

#5 2018-04-01 19:07:56

Bl4ckb0ne
Member
Registered: 2014-12-26
Posts: 70

Re: amdgpu shuts down Xorg

I was able to get more infos from coredumpctl

           PID: 1077 (Xorg)
           UID: 0 (root)
           GID: 0 (root)
        Signal: 11 (SEGV)
     Timestamp: Sun 2018-04-01 14:27:19 ADT (34min ago)
  Command Line: /usr/lib/xorg-server/Xorg -nolisten tcp -auth /var/run/sddm/{99549815-d109-4eb7-acf1-3d0b70fd5d7c} -bac>
    Executable: /usr/lib/xorg-server/Xorg
 Control Group: /system.slice/sddm.service
          Unit: sddm.service
         Slice: system.slice
       Boot ID: e246d10d3aa645c0836d7a4faae79962
    Machine ID: 7172b7a36fef4853b9e8f3d9c4f373ad
      Hostname: lovecraft
       Storage: /var/lib/systemd/coredump/core.Xorg.0.e246d10d3aa645c0836d7a4faae79962.1077.1522603639000000.lz4 (inacc>
       Message: Process 1077 (Xorg) of user 0 dumped core.
                Stack trace of thread 1106:
                #0  0x00007f28a41bd6b2 _IO_default_xsputn (libc.so.6)
                #1  0x00007f28a41b0a4d _IO_padn (libc.so.6)
                #2  0x00007f28a41908d8 vfprintf (libc.so.6)
                #3  0x00007f28a424699b __vsnprintf_chk (libc.so.6)
                #4  0x00007f28a42468c5 __snprintf_chk (libc.so.6)
                #5  0x00007f28a59ed6cb pci_device_vgaarb_set_target (libpciaccess.so.0)
                #6  0x000055d216078188 n/a (Xorg)
                #7  0x000055d21607819f n/a (Xorg)
                #8  0x000055d21607819f n/a (Xorg)
                #9  0x000055d21607819f n/a (Xorg)
                #10 0x000055d21607819f n/a (Xorg)
                #11 0x000055d21607819f n/a (Xorg)
                #12 0x000055d21607819f n/a (Xorg)
                #13 0x000055d21607819f n/a (Xorg)
                #14 0x000055d21607819f n/a (Xorg)
                #15 0x000055d21607819f n/a (Xorg)
                #16 0x000055d21607819f n/a (Xorg)
                #17 0x000055d21607819f n/a (Xorg)
                #18 0x000055d21607819f n/a (Xorg)
                #19 0x000055d21607819f n/a (Xorg)
                #20 0x000055d21607819f n/a (Xorg)
                #21 0x000055d21607819f n/a (Xorg)
                #22 0x000055d21607819f n/a (Xorg)
                #23 0x000055d21607819f n/a (Xorg)
                #24 0x000055d21607819f n/a (Xorg)
                #25 0x000055d21607819f n/a (Xorg)
                #26 0x000055d21607819f n/a (Xorg)
                #27 0x000055d21607819f n/a (Xorg)
                #28 0x000055d21607819f n/a (Xorg)
                #29 0x000055d21607819f n/a (Xorg)
                #30 0x000055d21607819f n/a (Xorg)
                #31 0x000055d21607819f n/a (Xorg)
                #32 0x000055d21607819f n/a (Xorg)
                #33 0x000055d21607819f n/a (Xorg)
                #34 0x000055d21607819f n/a (Xorg)
                #35 0x000055d21607819f n/a (Xorg)
                #36 0x000055d21607819f n/a (Xorg)
                #37 0x000055d21607819f n/a (Xorg)
                #38 0x000055d21607819f n/a (Xorg)
                #39 0x000055d21607819f n/a (Xorg)
                #40 0x000055d21607819f n/a (Xorg)
                #41 0x000055d21607819f n/a (Xorg)
                #42 0x000055d21607819f n/a (Xorg)
                #43 0x000055d21607819f n/a (Xorg)
                #44 0x000055d21607819f n/a (Xorg)
                #45 0x000055d21607819f n/a (Xorg)
                #46 0x000055d21607819f n/a (Xorg)
                #47 0x000055d21607819f n/a (Xorg)
                #48 0x000055d21607819f n/a (Xorg)
                #49 0x000055d21607819f n/a (Xorg)
                #50 0x000055d21607819f n/a (Xorg)
                #51 0x000055d21607819f n/a (Xorg)
                #52 0x000055d21607819f n/a (Xorg)
                #53 0x000055d21607819f n/a (Xorg)
                #54 0x000055d21607819f n/a (Xorg)
                #55 0x000055d21607819f n/a (Xorg)
                #56 0x000055d21607819f n/a (Xorg)
                #57 0x000055d21607819f n/a (Xorg)
                #58 0x000055d21607819f n/a (Xorg)
                #59 0x000055d21607819f n/a (Xorg)
                #60 0x000055d21607819f n/a (Xorg)
                #61 0x000055d21607819f n/a (Xorg)
                #62 0x000055d21607819f n/a (Xorg)
                #63 0x000055d21607819f n/a (Xorg)

                Stack trace of thread 1077:
                #0  0x00007f28a450973c __lll_lock_wait (libpthread.so.0)
                #1  0x00007f28a4502ac6 __pthread_mutex_lock (libpthread.so.0)
                #2  0x000055d21617fec0 input_lock (Xorg)
                #3  0x000055d21617b4c4 n/a (Xorg)
                #4  0x000055d21617b7b7 WaitForSomething (Xorg)
                #5  0x000055d21601bfc3 n/a (Xorg)
                #6  0x000055d216020260 n/a (Xorg)
                #7  0x00007f28a4162f4a __libc_start_main (libc.so.6)
                #8  0x000055d216009f0a _start (Xorg)

                Stack trace of thread 1080:
                #0  0x00007f28a45063bd pthread_cond_wait@@GLIBC_2.3.2 (libpthread.so.0)
                #1  0x00007f289f3278ec n/a (radeonsi_dri.so)
                #2  0x00007f289f3277f8 n/a (radeonsi_dri.so)
                #3  0x00007f28a450008c start_thread (libpthread.so.0)
                #4  0x00007f28a4237e7f __clone (libc.so.6)

                Stack trace of thread 1085:
                #0  0x00007f28a45063bd pthread_cond_wait@@GLIBC_2.3.2 (libpthread.so.0)
                #1  0x00007f289f3278ec n/a (radeonsi_dri.so)
                #2  0x00007f289f3277f8 n/a (radeonsi_dri.so)
                #3  0x00007f28a450008c start_thread (libpthread.so.0)
                #4  0x00007f28a4237e7f __clone (libc.so.6)

                Stack trace of thread 1082:
                #0  0x00007f28a45063bd pthread_cond_wait@@GLIBC_2.3.2 (libpthread.so.0)
                #1  0x00007f289f3278ec n/a (radeonsi_dri.so)
                #2  0x00007f289f3277f8 n/a (radeonsi_dri.so)
                #3  0x00007f28a450008c start_thread (libpthread.so.0)
                #4  0x00007f28a4237e7f __clone (libc.so.6)

                Stack trace of thread 1083:
                #0  0x00007f28a45063bd pthread_cond_wait@@GLIBC_2.3.2 (libpthread.so.0)
                #1  0x00007f289f3278ec n/a (radeonsi_dri.so)
                #2  0x00007f289f3277f8 n/a (radeonsi_dri.so)
                #3  0x00007f28a450008c start_thread (libpthread.so.0)
                #4  0x00007f28a4237e7f __clone (libc.so.6)

                Stack trace of thread 1084:
                #0  0x00007f28a45063bd pthread_cond_wait@@GLIBC_2.3.2 (libpthread.so.0)
                #1  0x00007f289f3278ec n/a (radeonsi_dri.so)
                #2  0x00007f289f3277f8 n/a (radeonsi_dri.so)
                #3  0x00007f28a450008c start_thread (libpthread.so.0)
                #4  0x00007f28a4237e7f __clone (libc.so.6)

                Stack trace of thread 1099:
                #0  0x00007f28a45063bd pthread_cond_wait@@GLIBC_2.3.2 (libpthread.so.0)
                #1  0x00007f289f3278ec n/a (radeonsi_dri.so)
                #2  0x00007f289f3277f8 n/a (radeonsi_dri.so)
                #3  0x00007f28a450008c start_thread (libpthread.so.0)
                #4  0x00007f28a4237e7f __clone (libc.so.6)

                Stack trace of thread 1086:
                #0  0x00007f28a45063bd pthread_cond_wait@@GLIBC_2.3.2 (libpthread.so.0)
                #1  0x00007f289f3278ec n/a (radeonsi_dri.so)
                #2  0x00007f289f3277f8 n/a (radeonsi_dri.so)
                #3  0x00007f28a450008c start_thread (libpthread.so.0)
                #4  0x00007f28a4237e7f __clone (libc.so.6)

                Stack trace of thread 1081:
                #0  0x00007f28a45063bd pthread_cond_wait@@GLIBC_2.3.2 (libpthread.so.0)
                #1  0x00007f289f3278ec n/a (radeonsi_dri.so)
                #2  0x00007f289f3277f8 n/a (radeonsi_dri.so)
                #3  0x00007f28a450008c start_thread (libpthread.so.0)
                #4  0x00007f28a4237e7f __clone (libc.so.6)

And the journalctl entries

$ journalctl /usr/bin/Xorg
-- Logs begin at Sun 2017-05-14 13:04:43 ADT, end at Sun 2018-04-01 15:01:09 ADT. --
-- No entries --

$ journalctl /usr/lib/xorg-server/Xorg
-- Logs begin at Sun 2017-05-14 13:04:43 ADT, end at Sun 2018-04-01 15:01:09 ADT. --
-- No entries --

Offline

#6 2018-04-01 19:39:24

seth
Member
Registered: 2012-09-03
Posts: 49,981

Re: amdgpu shuts down Xorg

Looks like https://lists.freedesktop.org/archives/ … 58943.html - the relevant journal section would btw. be "journalctl -b" and the crash is likely registered in /var/log/Xorg.0.log.old (/var/log/Xorg.0.log simply being the log for the new X11 server sddm kicked off automatically)

Offline

#7 2018-04-02 17:15:34

Bl4ckb0ne
Member
Registered: 2014-12-26
Posts: 70

Re: amdgpu shuts down Xorg

Is this patch available on testing or do I have to build the amdgpu by hand with that patch?

Offline

#8 2018-04-03 18:00:11

seth
Member
Registered: 2012-09-03
Posts: 49,981

Re: amdgpu shuts down Xorg

I do not even see an xorg-server page in the testing repos. And the patch is not in 1.19.6, so ...

Offline

#9 2018-04-03 19:56:02

Lone_Wolf
Member
From: Netherlands, Europe
Registered: 2005-10-04
Posts: 11,868

Re: amdgpu shuts down Xorg

xorg-server 1.20 release date is unknown. a few weeks ago a possible RC was mentioned on xorg ML, but consensus was that some things needed to be taken care of before 1.20 could be released.

You could try building xorg-server-git in AUR, maybe trunk already has that patch.


Disliking systemd intensely, but not satisfied with alternatives so focusing on taming systemd.


(A works at time B)  && (time C > time B ) ≠  (A works at time C)

Offline

#10 2018-04-04 13:10:48

Bl4ckb0ne
Member
Registered: 2014-12-26
Posts: 70

Re: amdgpu shuts down Xorg

Passing to xorg-server-git kicks out the amdgpu driver, and amdgpu-git requires xorg-server. Is there a guide to properly pass to xorg-server-git or some tips somewhere?

Offline

#11 2018-04-05 10:04:36

Lone_Wolf
Member
From: Netherlands, Europe
Registered: 2005-10-04
Posts: 11,868

Re: amdgpu shuts down Xorg

Passing to xorg-server-git kicks out the amdgpu driver, and amdgpu-git requires xorg-server.

xorg-server-git and it's sub-packages do provide xorg-server, but I did overlook that a new xorg-server usually requires  a rebuild of xf86-* due to ABI differences.

use asp to retrieve PKGBUILD and other needed files to build xf86-video-amdgpu, change the name to avoid conflicts/confusion with stock packages. .
Check which ABI version xorg-server-git provides, change the PKGBUILD to use that and build an xorg-server-git compatable version of xf86-video-amdgpu.


Disliking systemd intensely, but not satisfied with alternatives so focusing on taming systemd.


(A works at time B)  && (time C > time B ) ≠  (A works at time C)

Offline

Board footer

Powered by FluxBB