You are not logged in.

#1 2019-12-12 19:25:38

daanjderuiter
Member
Registered: 2016-12-28
Posts: 40

AMDGPU Navi sensor I/O errors and related bugs

I am using an AMD RX 5700 GPU, which so far has been working fine for me. However, over the last few days I started experiencing issues. All of these would be negated after a reboot, though. I am using the open source driver, and experience these on Sway (haven't tried X).

I experience the following after suspending my PC and resuming again. On keypresses, the screen flickers with bar-like artefacts. The built-in audio doesn't output anything (analog out works fine). Other than that, after starting Sway (both after a reboot or otherwise starting a new session), it takes ~10 sec for GUI applications to be able to start, and data from lm-sensors has issues. Specifically, it reports issues with regards to /sys/class/hwmon/hwmon1/temp1_input (associated with my GPU, I checked the PCI address). Also, my syslog is spammed with entries from AMDGPU, e.g.:

Dec 12 20:23:20 <hostname> kernel: amdgpu: [powerplay] failed send message: SetDriverDramAddrHigh (14)         param: 0x00000080 response 0xfffffffb
Dec 12 20:23:20 <hostname> kernel: amdgpu: [powerplay] Failed to export SMU metrics table!
Dec 12 20:23:20 <hostname> sway[36977]: /usr/lib/python3.8/site-packages/psutil/_pslinux.py:1226: RuntimeWarning: ignoring OSError(5, 'Input/output error') for file '/sys/class/hwmon/hwmon1/temp1_input'

(The psutil errors are due to some bar blocklet using it to display sensor values)

Does anyone have similar experiences? Are these just some issues with the immaturity of Navi?

I am not sure what package upgrade might have triggered this, but I don't think it's the 5.4 kernel itself as I've been running that for longer than I have experienced these issues

Offline

#2 2019-12-12 23:13:35

Pse
Member
Registered: 2008-03-15
Posts: 407

Re: AMDGPU Navi sensor I/O errors and related bugs

Have you recently connected two DisplayPort displays? I started seeing similar errors in my logs a couple of days ago just when I added a second DisplayPort display. I only see the errors once after boot, they don't spam the logs continually. I do experience, however, a slow login since I started seeing this. After that initial delay during login, everything works normally. Radeon 5700 XT using AMDGPU-PRO.

Last edited by Pse (2019-12-12 23:15:29)

Offline

#3 2019-12-13 10:27:39

daanjderuiter
Member
Registered: 2016-12-28
Posts: 40

Re: AMDGPU Navi sensor I/O errors and related bugs

No, I am only using my one main monitor with this machine. And my syslog definitely constituted a spam situation, the above three lines appeared repeatedly.

But most curiously, after waking up my machine this morning to type this reply, I haven't seen any of the issues yet. I hope that this means that this issue has solved itself. I have no idea what triggered that, though - I didn't install a newer kernel or graphics-related drivers. I'll leave this post unsolved for a day longer or so, and mark it solved if it remains solid. Still a very curious issue.

Offline

#4 2019-12-14 00:56:26

daanjderuiter
Member
Registered: 2016-12-28
Posts: 40

Re: AMDGPU Navi sensor I/O errors and related bugs

Nope, it's still there, false alarm

Offline

#5 2019-12-26 13:18:34

maerowinger
Member
From: Austria
Registered: 2011-08-01
Posts: 4

Re: AMDGPU Navi sensor I/O errors and related bugs

It appears I have a similar issue. I would say since mid December startup of SDDM and login into KDE is terrible slow. Once everything is started the system behaves as snappy as usual. Don't know what happend/has changed. I have a RX 5700, using open-source amdgpu driver and have connected two displays using DP.

Did anyone of you resolve the problem by now?

My dmesg looks like this:

   11.178651] amdgpu: [powerplay] failed send message: NumOfDisplays (64)      param: 0x00000002 response 0xffffffc2
[   13.293662] amdgpu: [powerplay] failed send message: NumOfDisplays (64)      param: 0x00000002 response 0xffffffc2
[   15.408631] amdgpu: [powerplay] failed send message: NumOfDisplays (64)      param: 0x00000002 response 0xffffffc2
[   15.995207] audit: type=1131 audit(1577365410.385:25): pid=1 uid=0 auid=4294967295 ses=4294967295 msg='unit=NetworkManager-dispatcher comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success'
[   17.559163] amdgpu: [powerplay] failed send message: NumOfDisplays (64)      param: 0x00000002 response 0xffffffc2
[   17.697788] audit: type=1130 audit(1577365153.291:26): pid=1 uid=0 auid=4294967295 ses=4294967295 msg='unit=ntpdate comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success'
[   17.697800] audit: type=1131 audit(1577365153.291:27): pid=1 uid=0 auid=4294967295 ses=4294967295 msg='unit=ntpdate comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success'
[   19.667710] amdgpu: [powerplay] failed send message: NumOfDisplays (64)      param: 0x00000002 response 0xffffffc2
[   21.775698] amdgpu: [powerplay] failed send message: NumOfDisplays (64)      param: 0x00000002 response 0xffffffc2
[   23.883249] amdgpu: [powerplay] failed send message: NumOfDisplays (64)      param: 0x00000002 response 0xffffffc2
[   26.010493] amdgpu: [powerplay] failed send message: GetMaxDpmFreq (31)      param: 0x00000000 response 0xffffffc2
[   28.114119] amdgpu: [powerplay] failed send message: GetMaxDpmFreq (31)      param: 0x00000000 response 0xffffffc2
[   30.217901] amdgpu: [powerplay] failed send message: GetMaxDpmFreq (31)      param: 0x00020000 response 0xffffffc2
[   32.321439] amdgpu: [powerplay] failed send message: GetMaxDpmFreq (31)      param: 0x00020000 response 0xffffffc2
[   34.554980] amdgpu: [powerplay] failed send message: GetMaxDpmFreq (31)      param: 0x00000000 response 0xffffffc2
[   35.997997] audit: type=1131 audit(1577365171.591:28): pid=1 uid=0 auid=4294967295 ses=4294967295 msg='unit=systemd-hostnamed comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success'
[   36.661523] amdgpu: [powerplay] failed send message: GetMaxDpmFreq (31)      param: 0x00000000 response 0xffffffc2
[   38.765750] amdgpu: [powerplay] failed send message: GetMaxDpmFreq (31)      param: 0x00020000 response 0xffffffc2
[   40.869985] amdgpu: [powerplay] failed send message: GetMaxDpmFreq (31)      param: 0x00020000 response 0xffffffc2
[   43.041595] amdgpu: [powerplay] failed send message: NumOfDisplays (64)      param: 0x00000002 response 0xffffffc2
[   45.145778] amdgpu: [powerplay] failed send message: NumOfDisplays (64)      param: 0x00000002 response 0xffffffc2
[   47.263143] amdgpu: [powerplay] failed send message: NumOfDisplays (64)      param: 0x00000002 response 0xffffffc2
[   49.367420] amdgpu: [powerplay] failed send message: NumOfDisplays (64)      param: 0x00000002 response 0xffffffc2
[   51.504521] amdgpu: [powerplay] failed send message: NumOfDisplays (64)      param: 0x00000002 response 0xffffffc2
[   53.608781] amdgpu: [powerplay] failed send message: NumOfDisplays (64)      param: 0x00000002 response 0xffffffc2
[   55.722315] amdgpu: [powerplay] failed send message: NumOfDisplays (64)      param: 0x00000002 response 0xffffffc2
[   57.826286] amdgpu: [powerplay] failed send message: NumOfDisplays (64)      param: 0x00000002 response 0xffffffc2
[   58.518986] audit: type=1100 audit(1577365194.111:29): pid=718 uid=0 auid=4294967295 ses=4294967295 msg='op=PAM:authentication grantors=pam_permit acct="sddm" exe="/usr/lib/sddm/sddm-helper" hostname=? addr=? terminal=? res=success'
[   58.518991] audit: type=1101 audit(1577365194.111:30): pid=718 uid=0 auid=4294967295 ses=4294967295 msg='op=PAM:accounting grantors=pam_permit acct="sddm" exe="/usr/lib/sddm/sddm-helper" hostname=? addr=? terminal=? res=success'
[   58.519354] audit: type=1103 audit(1577365194.111:31): pid=718 uid=0 auid=4294967295 ses=4294967295 msg='op=PAM:setcred grantors=pam_permit acct="sddm" exe="/usr/lib/sddm/sddm-helper" hostname=? addr=? terminal=? res=success'
[   58.535278] audit: type=1130 audit(1577365194.127:32): pid=1 uid=0 auid=4294967295 ses=4294967295 msg='unit=user-runtime-dir@975 comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success'
[   58.541919] audit: type=1101 audit(1577365194.134:33): pid=721 uid=0 auid=4294967295 ses=4294967295 msg='op=PAM:accounting grantors=pam_tally2,pam_access,pam_unix,pam_permit,pam_time acct="sddm" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success'
[   58.541943] audit: type=1006 audit(1577365194.134:34): pid=721 uid=0 old-auid=4294967295 auid=975 tty=(none) old-ses=4294967295 ses=1 res=1
[   58.542477] audit: type=1105 audit(1577365194.134:35): pid=721 uid=0 auid=975 ses=1 msg='op=PAM:session_open grantors=pam_loginuid,pam_loginuid,pam_keyinit,pam_limits,pam_unix,pam_permit,pam_mail,pam_systemd,pam_env acct="sddm" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success'
[   58.613841] audit: type=1130 audit(1577365194.207:36): pid=1 uid=0 auid=4294967295 ses=4294967295 msg='unit=user@975 comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success'
[   58.618503] audit: type=1105 audit(1577365194.211:37): pid=718 uid=0 auid=4294967295 ses=4294967295 msg='op=PAM:session_open grantors=pam_unix,pam_systemd acct="sddm" exe="/usr/lib/sddm/sddm-helper" hostname=? addr=? terminal=:0 res=success'
[   59.330649] audit: type=1130 audit(1577365194.924:38): pid=1 uid=0 auid=4294967295 ses=4294967295 msg='unit=polkit comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success'
[  110.316806] kauditd_printk_skb: 2 callbacks suppressed
[  110.316808] audit: type=1100 audit(1577365245.911:41): pid=812 uid=0 auid=4294967295 ses=4294967295 msg='op=PAM:authentication grantors=pam_tally2,pam_shells,pam_unix,pam_permit acct="maerowinger" exe="/usr/lib/sddm/sddm-helper" hostname=? addr=? terminal=? res=success'
[  110.320212] audit: type=1101 audit(1577365245.914:42): pid=812 uid=0 auid=4294967295 ses=4294967295 msg='op=PAM:accounting grantors=pam_tally2,pam_access,pam_unix,pam_permit,pam_time acct="maerowinger" exe="/usr/lib/sddm/sddm-helper" hostname=? addr=? terminal=? res=success'
[  110.320686] audit: type=1103 audit(1577365245.914:43): pid=812 uid=0 auid=4294967295 ses=4294967295 msg='op=PAM:setcred grantors=pam_tally2,pam_shells,pam_unix,pam_permit acct="maerowinger" exe="/usr/lib/sddm/sddm-helper" hostname=? addr=? terminal=? res=success'
[  110.320861] audit: type=1006 audit(1577365245.914:44): pid=812 uid=0 old-auid=4294967295 auid=1000 tty=(none) old-ses=4294967295 ses=2 res=1
[  110.334947] audit: type=1130 audit(1577365245.927:45): pid=1 uid=0 auid=4294967295 ses=4294967295 msg='unit=user-runtime-dir@1000 comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success'
[  110.339905] audit: type=1101 audit(1577365245.931:46): pid=814 uid=0 auid=4294967295 ses=4294967295 msg='op=PAM:accounting grantors=pam_tally2,pam_access,pam_unix,pam_permit,pam_time acct="maerowinger" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success'
[  110.339923] audit: type=1006 audit(1577365245.931:47): pid=814 uid=0 old-auid=4294967295 auid=1000 tty=(none) old-ses=4294967295 ses=3 res=1
[  110.340218] audit: type=1105 audit(1577365245.934:48): pid=814 uid=0 auid=1000 ses=3 msg='op=PAM:session_open grantors=pam_loginuid,pam_loginuid,pam_keyinit,pam_limits,pam_unix,pam_permit,pam_mail,pam_systemd,pam_env acct="maerowinger" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success'
[  110.408263] audit: type=1106 audit(1577365246.001:49): pid=718 uid=0 auid=4294967295 ses=4294967295 msg='op=PAM:session_close grantors=pam_unix,pam_systemd acct="sddm" exe="/usr/lib/sddm/sddm-helper" hostname=? addr=? terminal=:0 res=success'
[  110.408295] audit: type=1104 audit(1577365246.001:50): pid=718 uid=0 auid=4294967295 ses=4294967295 msg='op=PAM:setcred grantors=pam_permit acct="sddm" exe="/usr/lib/sddm/sddm-helper" hostname=? addr=? terminal=:0 res=success'
[  113.595499] amdgpu: [powerplay] failed send message: NumOfDisplays (64)      param: 0x00000002 response 0xffffffc2
[  115.700903] amdgpu: [powerplay] failed send message: GetMaxDpmFreq (31)      param: 0x00000000 response 0xffffffc2
[  117.804992] amdgpu: [powerplay] failed send message: GetMaxDpmFreq (31)      param: 0x00000000 response 0xffffffc2
[  119.910082] amdgpu: [powerplay] failed send message: GetMaxDpmFreq (31)      param: 0x00020000 response 0xffffffc2
[  120.557127] kauditd_printk_skb: 3 callbacks suppressed
[  120.557129] audit: type=1131 audit(1577365256.151:54): pid=1 uid=0 auid=4294967295 ses=4294967295 msg='unit=user@975 comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success'
[  120.566258] audit: type=1131 audit(1577365256.157:55): pid=1 uid=0 auid=4294967295 ses=4294967295 msg='unit=user-runtime-dir@975 comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success'
[  122.014069] amdgpu: [powerplay] failed send message: GetMaxDpmFreq (31)      param: 0x00020000 response 0xffffffc2
[  124.138466] amdgpu: [powerplay] failed send message: NumOfDisplays (64)      param: 0x00000002 response 0xffffffc2
[  126.261706] amdgpu: [powerplay] failed send message: NumOfDisplays (64)      param: 0x00000002 response 0xffffffc2
[  128.376625] amdgpu: [powerplay] failed send message: GetMaxDpmFreq (31)      param: 0x00000000 response 0xffffffc2
[  130.489235] amdgpu: [powerplay] failed send message: GetMaxDpmFreq (31)      param: 0x00000000 response 0xffffffc2
[  132.605992] amdgpu: [powerplay] failed send message: GetMaxDpmFreq (31)      param: 0x00020000 response 0xffffffc2
[  134.719061] amdgpu: [powerplay] failed send message: GetMaxDpmFreq (31)      param: 0x00020000 response 0xffffffc2
[  136.843460] amdgpu: [powerplay] failed send message: NumOfDisplays (64)      param: 0x00000002 response 0xffffffc2
[  138.967264] amdgpu: [powerplay] failed send message: NumOfDisplays (64)      param: 0x00000002 response 0xffffffc2
[  142.790396] snd_hda_intel 0000:09:00.1: Refused to change power state, currently in D0
[  162.384844] amdgpu: [powerplay] failed send message: GetMaxDpmFreq (31)      param: 0x00000000 response 0xffffffc2
[  164.503941] amdgpu: [powerplay] failed send message: GetMaxDpmFreq (31)      param: 0x00000000 response 0xffffffc2
[  166.623070] amdgpu: [powerplay] failed send message: GetMaxDpmFreq (31)      param: 0x00020000 response 0xffffffc2
[  168.742184] amdgpu: [powerplay] failed send message: GetMaxDpmFreq (31)      param: 0x00020000 response 0xffffffc2
[  170.859911] amdgpu: [powerplay] failed send message: GetMaxDpmFreq (31)      param: 0x00000000 response 0xffffffc2
[  172.977599] amdgpu: [powerplay] failed send message: GetMaxDpmFreq (31)      param: 0x00000000 response 0xffffffc2
[  175.095536] amdgpu: [powerplay] failed send message: GetMaxDpmFreq (31)      param: 0x00020000 response 0xffffffc2
[  177.214839] amdgpu: [powerplay] failed send message: GetMaxDpmFreq (31)      param: 0x00020000 response 0xffffffc2
[  197.004578] amdgpu: [powerplay] failed send message: GetMaxDpmFreq (31)      param: 0x00000000 response 0xffffffc2
[  199.128701] amdgpu: [powerplay] failed send message: GetMaxDpmFreq (31)      param: 0x00000000 response 0xffffffc2
[  201.255163] amdgpu: [powerplay] failed send message: GetMaxDpmFreq (31)      param: 0x00020000 response 0xffffffc2
[  203.378606] amdgpu: [powerplay] failed send message: GetMaxDpmFreq (31)      param: 0x00020000 response 0xffffffc2
[  205.479549] amdgpu: [powerplay] failed send message: GetMaxDpmFreq (31)      param: 0x00000000 response 0xffffffc2
[  207.579646] amdgpu: [powerplay] failed send message: GetMaxDpmFreq (31)      param: 0x00000000 response 0xffffffc2
[  209.682310] amdgpu: [powerplay] failed send message: GetMaxDpmFreq (31)      param: 0x00020000 response 0xffffffc2
[  209.682315] amdgpu: [powerplay] failed send message: GetMaxDpmFreq (31)      param: 0x00020000 response 0xfffffffb
[  209.682352] amdgpu: [powerplay] failed send message: GetMaxDpmFreq (31)      param: 0x00000000 response 0xfffffffb
[  209.682359] amdgpu: [powerplay] failed send message: GetMaxDpmFreq (31)      param: 0x00000000 response 0xfffffffb
[  209.682367] amdgpu: [powerplay] failed send message: GetMaxDpmFreq (31)      param: 0x00020000 response 0xfffffffb
[  209.682372] amdgpu: [powerplay] failed send message: GetMaxDpmFreq (31)      param: 0x00020000 response 0xfffffffb
[  209.682506] amdgpu: [powerplay] failed send message: GetMaxDpmFreq (31)      param: 0x00000000 response 0xfffffffb
[  209.682513] amdgpu: [powerplay] failed send message: GetMaxDpmFreq (31)      param: 0x00000000 response 0xfffffffb
[  209.682516] amdgpu: [powerplay] failed send message: GetMaxDpmFreq (31)      param: 0x00020000 response 0xfffffffb
[  209.682522] amdgpu: [powerplay] failed send message: GetMaxDpmFreq (31)      param: 0x00020000 response 0xfffffffb
[  209.682648] amdgpu: [powerplay] failed send message: GetMaxDpmFreq (31)      param: 0x00000000 response 0xfffffffb
[  209.682653] amdgpu: [powerplay] failed send message: GetMaxDpmFreq (31)      param: 0x00000000 response 0xfffffffb
[  209.682655] amdgpu: [powerplay] failed send message: GetMaxDpmFreq (31)      param: 0x00020000 response 0xfffffffb
[  209.682660] amdgpu: [powerplay] failed send message: GetMaxDpmFreq (31)      param: 0x00020000 response 0xfffffffb
[  209.682781] amdgpu: [powerplay] failed send message: GetMaxDpmFreq (31)      param: 0x00000000 response 0xfffffffb
[  209.682787] amdgpu: [powerplay] failed send message: GetMaxDpmFreq (31)      param: 0x00000000 response 0xfffffffb
[  209.682790] amdgpu: [powerplay] failed send message: GetMaxDpmFreq (31)      param: 0x00020000 response 0xfffffffb
[  209.682795] amdgpu: [powerplay] failed send message: GetMaxDpmFreq (31)      param: 0x00020000 response 0xfffffffb
[  209.795598] amdgpu: [powerplay] failed send message: NumOfDisplays (64)      param: 0x00000002 response 0xfffffffb
[  209.795604] amdgpu: [powerplay] failed send message: NumOfDisplays (64)      param: 0x00000002 response 0xfffffffb
[  209.809682] amdgpu: [powerplay] failed send message: NumOfDisplays (64)      param: 0x00000002 response 0xfffffffb
[  209.809689] amdgpu: [powerplay] failed send message: NumOfDisplays (64)      param: 0x00000002 response 0xfffffffb
[  274.119542] amdgpu: [powerplay] failed send message: NumOfDisplays (64)      param: 0x00000002 response 0xfffffffb
[  274.119550] amdgpu: [powerplay] failed send message: NumOfDisplays (64)      param: 0x00000002 response 0xfffffffb
[  274.127323] amdgpu: [powerplay] failed send message: NumOfDisplays (64)      param: 0x00000002 response 0xfffffffb
[  274.127329] amdgpu: [powerplay] failed send message: NumOfDisplays (64)      param: 0x00000002 response 0xfffffffb
[  274.133108] amdgpu: [powerplay] failed send message: NumOfDisplays (64)      param: 0x00000002 response 0xfffffffb
[  274.133115] amdgpu: [powerplay] failed send message: NumOfDisplays (64)      param: 0x00000002 response 0xfffffffb
[  274.144212] amdgpu: [powerplay] failed send message: NumOfDisplays (64)      param: 0x00000002 response 0xfffffffb
[  274.144219] amdgpu: [powerplay] failed send message: NumOfDisplays (64)      param: 0x00000002 response 0xfffffffb
[  451.194890] amdgpu: [powerplay] failed send message: GetMaxDpmFreq (31)      param: 0x00000000 response 0xfffffffb
[  451.194896] amdgpu: [powerplay] failed send message: GetMaxDpmFreq (31)      param: 0x00000000 response 0xfffffffb
[  451.194898] amdgpu: [powerplay] failed send message: GetMaxDpmFreq (31)      param: 0x00020000 response 0xfffffffb
[  451.194902] amdgpu: [powerplay] failed send message: GetMaxDpmFreq (31)      param: 0x00020000 response 0xfffffffb
[  451.195100] amdgpu: [powerplay] failed send message: GetMaxDpmFreq (31)      param: 0x00000000 response 0xfffffffb
[  451.195105] amdgpu: [powerplay] failed send message: GetMaxDpmFreq (31)      param: 0x00000000 response 0xfffffffb
[  451.195107] amdgpu: [powerplay] failed send message: GetMaxDpmFreq (31)      param: 0x00020000 response 0xfffffffb
[  451.195112] amdgpu: [powerplay] failed send message: GetMaxDpmFreq (31)      param: 0x00020000 response 0xfffffffb
[  451.288545] amdgpu: [powerplay] failed send message: NumOfDisplays (64)      param: 0x00000002 response 0xfffffffb
[  451.288552] amdgpu: [powerplay] failed send message: NumOfDisplays (64)      param: 0x00000002 response 0xfffffffb
[  451.302409] amdgpu: [powerplay] failed send message: NumOfDisplays (64)      param: 0x00000002 response 0xfffffffb
[  451.302416] amdgpu: [powerplay] failed send message: NumOfDisplays (64)      param: 0x00000002 response 0xfffffffb
[  451.308158] amdgpu: [powerplay] failed send message: NumOfDisplays (64)      param: 0x00000002 response 0xfffffffb
[  451.308165] amdgpu: [powerplay] failed send message: NumOfDisplays (64)      param: 0x00000002 response 0xfffffffb
[  451.318791] amdgpu: [powerplay] failed send message: NumOfDisplays (64)      param: 0x00000002 response 0xfffffffb
[  451.318798] amdgpu: [powerplay] failed send message: NumOfDisplays (64)      param: 0x00000002 response 0xfffffffb
[  452.790541] amdgpu: [powerplay] failed send message: NumOfDisplays (64)      param: 0x00000002 response 0xfffffffb
[  452.790548] amdgpu: [powerplay] failed send message: NumOfDisplays (64)      param: 0x00000002 response 0xfffffffb
[  452.803733] amdgpu: [powerplay] failed send message: NumOfDisplays (64)      param: 0x00000002 response 0xfffffffb
[  452.803740] amdgpu: [powerplay] failed send message: NumOfDisplays (64)      param: 0x00000002 response 0xfffffffb
[  452.811028] amdgpu: [powerplay] failed send message: NumOfDisplays (64)      param: 0x00000002 response 0xfffffffb
[  452.811035] amdgpu: [powerplay] failed send message: NumOfDisplays (64)      param: 0x00000002 response 0xfffffffb
[  452.820003] amdgpu: [powerplay] failed send message: NumOfDisplays (64)      param: 0x00000002 response 0xfffffffb
[  452.820010] amdgpu: [powerplay] failed send message: NumOfDisplays (64)      param: 0x00000002 response 0xfffffffb
[  453.857116] amdgpu: [powerplay] failed send message: NumOfDisplays (64)      param: 0x00000002 response 0xfffffffb
[  453.857122] amdgpu: [powerplay] failed send message: NumOfDisplays (64)      param: 0x00000002 response 0xfffffffb
[  453.870194] amdgpu: [powerplay] failed send message: NumOfDisplays (64)      param: 0x00000002 response 0xfffffffb
[  453.870199] amdgpu: [powerplay] failed send message: NumOfDisplays (64)      param: 0x00000002 response 0xfffffffb
[  453.890587] amdgpu: [powerplay] failed send message: NumOfDisplays (64)      param: 0x00000002 response 0xfffffffb
[  453.890595] amdgpu: [powerplay] failed send message: NumOfDisplays (64)      param: 0x00000002 response 0xfffffffb
[  453.904323] amdgpu: [powerplay] failed send message: NumOfDisplays (64)      param: 0x00000002 response 0xfffffffb
[  453.904330] amdgpu: [powerplay] failed send message: NumOfDisplays (64)      param: 0x00000002 response 0xfffffffb

Update:

"Fixed" my issue by disabling AMDGPU Power Management via kernel cmd line parameter:

amdgpu.aspm=0 amdgpu.dpm=0

Last edited by maerowinger (2020-01-03 13:13:23)

Offline

#6 2019-12-30 15:42:25

daanjderuiter
Member
Registered: 2016-12-28
Posts: 40

Re: AMDGPU Navi sensor I/O errors and related bugs

This seems to be fixed in the 5.4.6 kernel. If I don't run into anything for the next day I'll mark this as solved.

Offline

#7 2019-12-31 11:21:38

daanjderuiter
Member
Registered: 2016-12-28
Posts: 40

Re: AMDGPU Navi sensor I/O errors and related bugs

Once again, false alarm. Issue persists, will leave this up

Offline

#8 2020-01-04 16:43:11

carbolymer
Member
Registered: 2012-04-25
Posts: 34

Re: AMDGPU Navi sensor I/O errors and related bugs

I have the same issue. In my cases the issue correlates with using of

sensors

which looks similar to: https://www.gamingonlinux.com/forum/topic/4128/page=5 . I am able to reproduce it, when running multiple sensors in parallel. I've tried newer kernels, but the problem still exists:

linux 5.4.7.arch1-1
linux-mainline 5.5rc4-1
linux-git 5.5rc4.r124.g3a562aee727a-1

Adidtionally, using

amdgpu.aspm=0 amdgpu.dpm=0

makes system unable to boot.

Related bug: https://gitlab.freedesktop.org/drm/amd/issues/929

any ideas where to look further?

Last edited by carbolymer (2020-01-04 18:15:52)

Offline

#9 2020-01-13 12:44:25

daanjderuiter
Member
Registered: 2016-12-28
Posts: 40

Re: AMDGPU Navi sensor I/O errors and related bugs

Having a look at some of the commit messages for the 5.5 kernel rc's in the LKML (rc5 in particular) it looks like there are some promising fixes coming along that might affect this issue as well. Eagerly awaiting the 5.5 stable release

Offline

Board footer

Powered by FluxBB