You are not logged in.

#1 2024-10-17 02:37:28

freeandeasy
Member
Registered: 2024-10-17
Posts: 7

Kernel Oops on AMD Framework shortly after boot

I'm running KDE 6.2.1 (wayland edition) and recently I have been getting this kernel oops on login:

WARNING: CPU: 2 PID: 1874 at drivers/gpu/drm/amd/amdgpu/../display/dc/dpp/dcn30/dcn30_dpp.c:534 dpp3_deferred_update+0x101/0x330 [amdgpu]
Modules linked in: rfcomm snd_seq_dummy snd_hrtimer uhid nf_conntrack_netbios_ns nf_conntrack_broadcast nft_fib_inet nft_fib_ipv4 nft_fib_ipv6 nft_fib ip_set qrtr bnep sunrpc binfmt_misc vfat fat snd_sof_amd_acp63 snd_sof_amd_vangogh snd_sof_amd_rembrandt snd_sof_amd_renoir snd_sof_amd_acp snd_sof_pci snd_sof_xtensa_dsp cdc_mbim cdc_wdm snd_sof intel_rapl_msr snd_hda_codec_realtek snd_sof_utils amd_atl snd_hda_codec_generic snd_pci_ps intel_rapl_common snd_amd_sdw_acpi snd_hda_scodec_component btusb mt7921e soundwire_amd edac_mce_amd btrtl soundwire_generic_allocation btintel mt7921_common soundwire_bus snd_hda_codec_hdmi btbcm mt792x_lib kvm_amd mt76_connac_lib btmtk cros_usbpd_charger leds_cros_ec cros_ec_hwmon cros_charge_control cros_ec_chardev cros_ec_sysfs led_class_multicolor gpio_cros_ec cros_usbpd_logger cros_usbpd_notify snd_soc_core mt76 snd_hda_intel bluetooth snd_compress cros_ec_dev spd5118 kvm snd_intel_dspcfg mac80211 snd_usb_audio cros_ec_lpcs ac97_bus snd_intel_sdw_acpi cros_ec
 snd_pcm_dmaengine snd_rpl_pci_acp6x snd_usbmidi_lib snd_hda_codec snd_acp_pci snd_ump snd_acp_legacy_common snd_rawmidi hid_sensor_als snd_hda_core libarc4 hid_sensor_trigger hid_sensor_iio_common snd_pci_acp6x mc snd_hwdep industrialio_triggered_buffer kfifo_buf snd_seq industrialio snd_seq_device cdc_ncm rapl wmi_bmof cfg80211 cdc_ether snd_pci_acp5x snd_pcm usbnet thunderbolt i2c_piix4 snd_rn_pci_acp3x snd_timer pcspkr snd_acp_config mii k10temp i2c_smbus snd_soc_acpi snd snd_pci_acp3x amd_pmf rfkill soundcore amdtee amd_sfh tee joydev platform_profile amd_pmc nft_reject_inet nf_reject_ipv4 nf_reject_ipv6 nft_reject nft_masq nft_ct nft_chain_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 nf_tables loop nfnetlink zram dm_crypt uas usb_storage typec_displayport amdgpu amdxcp i2c_algo_bit drm_ttm_helper ttm crct10dif_pclmul drm_exec crc32_pclmul gpu_sched crc32c_intel nvme polyval_clmulni drm_suballoc_helper polyval_generic drm_buddy nvme_core drm_display_helper ghash_clmulni_intel video
 hid_multitouch sha512_ssse3 ucsi_acpi hid_sensor_hub sha256_ssse3 ccp typec_ucsi sha1_ssse3 cec sp5100_tco nvme_auth typec i2c_hid_acpi wmi i2c_hid ip6_tables ip_tables fuse i2c_dev
CPU: 2 UID: 0 PID: 1874 Comm: kworker/u64:15 Not tainted 6.11.3-arch1-1 #1
Hardware name: Framework Laptop 16 (AMD Ryzen 7040 Series)/FRANMZCP09, BIOS 03.04 07/09/2024
Workqueue: events_unbound commit_work
RIP: 0010:dpp3_deferred_update+0x101/0x330 [amdgpu]
Code: 83 78 e1 00 00 0f b6 90 a8 02 00 00 48 8b 83 70 e1 00 00 8b b0 78 04 00 00 e8 3b 21 1b 00 8b 74 24 04 85 f6 0f 84 5d 01 00 00 <0f> 0b 0f b6 83 48 96 00 00 83 e0 f7 88 83 48 96 00 00 a8 01 0f 84
RSP: 0018:ffffb59685387b80 EFLAGS: 00010202
RAX: 0000000000000066 RBX: ffff96c9939e0000 RCX: 0000000000000004
RDX: 0000000000000000 RSI: 0000000000000002 RDI: ffff96c99d580000
RBP: 0000000000000000 R08: ffffb59685387b84 R09: ffffb59685387bb8
R10: 0000000000000000 R11: 0000000000000000 R12: ffff96ca44540000
R13: ffff96c99e000000 R14: 0000000000000000 R15: ffff96c9939e0000
FS:  0000000000000000(0000) GS:ffff96d001b00000(0000) knlGS:0000000000000000
CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: 00007f685c3db378 CR3: 000000043642a000 CR4: 0000000000f50ef0
PKRU: 55555554
Call Trace:
 <TASK>
 ? dpp3_deferred_update+0x101/0x330 [amdgpu]
 ? __warn.cold+0x8e/0xe8
 ? dpp3_deferred_update+0x101/0x330 [amdgpu]
 ? report_bug+0xff/0x140
 ? handle_bug+0x3c/0x80
 ? exc_invalid_op+0x17/0x70
 ? asm_exc_invalid_op+0x1a/0x20
 ? dpp3_deferred_update+0x101/0x330 [amdgpu]
 ? __pfx_dpp3_deferred_update+0x10/0x10 [amdgpu]
 dc_post_update_surfaces_to_stream+0x223/0x3d0 [amdgpu]
 amdgpu_dm_atomic_commit_tail+0x2fc5/0x4430 [amdgpu]
 commit_tail+0xac/0x160
 ? srso_alias_return_thunk+0x5/0xfbef5
 process_one_work+0x176/0x330
 worker_thread+0x252/0x390
 ? __pfx_worker_thread+0x10/0x10
 kthread+0xcf/0x100
 ? __pfx_kthread+0x10/0x10
 ret_from_fork+0x31/0x50
 ? __pfx_kthread+0x10/0x10
 ret_from_fork_asm+0x1a/0x30

This happens regardless of whether I use the linux or linux-lts packages.

Anybody else experiencing this issue? Could it be a firmware issue?

Offline

#2 2024-10-17 08:16:51

yataro
Member
Registered: 2024-03-09
Posts: 76

Re: Kernel Oops on AMD Framework shortly after boot

Can you check the GPU temperature during system startup? Could be overheating

Offline

#3 2024-10-17 15:28:11

freeandeasy
Member
Registered: 2024-10-17
Posts: 7

Re: Kernel Oops on AMD Framework shortly after boot

yataro wrote:

Can you check the GPU temperature during system startup? Could be overheating

GPU temperatures are all normal.

Booted into a distro with kernel 6.1 and GNOME and could not reproduce the kernel oops. So the issue looks like it's between the Arch kernels and KDE...

Offline

#4 2024-10-17 15:32:15

freeandeasy
Member
Registered: 2024-10-17
Posts: 7

Re: Kernel Oops on AMD Framework shortly after boot

Update: I tried turning off Variable Refresh Rate in KDE settings and the oops no longer happens. This may be the issue, going to dig further into why this is happening...

Offline

#5 2024-10-18 00:46:58

freeandeasy
Member
Registered: 2024-10-17
Posts: 7

Re: Kernel Oops on AMD Framework shortly after boot

freeandeasy wrote:

Update: I tried turning off Variable Refresh Rate in KDE settings and the oops no longer happens. This may be the issue, going to dig further into why this is happening...

Okay, this was a red herring. The oops is back again. At this point, I'm kind of stumped.

Offline

#6 2024-10-20 13:49:22

per_joe
Member
Registered: 2024-10-20
Posts: 2

Re: Kernel Oops on AMD Framework shortly after boot

i have this error to, did you manage to found a solution on the issue ?

[   27.910398] ------------[ cut here ]------------
WARNING: CPU: 12 PID: 12 at drivers/gpu/drm/amd/amdgpu/../display/dc/dpp/dcn30/dcn30_dpp.c:534 dpp3_deferred_update+0x101/0x330 [amdgpu]
[   27.911005] Modules linked in: dummy nfsv3 nfs_acl bridge stp llc nf_tables libcrc32c rpcsec_gss_krb5 vhost_vsock vmw_vsock_virtio_transport_common auth_rpcgss vhost vhost_iotlb vsock nfsv4 dns_resolver nfs lockd grace sunrpc netfs cmac algif_hash bnep uvcvideo videobuf2_vmalloc uvc videobuf2_memops videobuf2_v4l2 videodev videobuf2_common qrtr vmnet(OE) dm_crypt encrypted_keys trusted asn1_encoder tee blowfish_generic blowfish_x86_64 blowfish_common des_generic des3_ede_x86_64 libdes cast5_avx_x86_64 cast5_generic cast_common cbc lrw camellia_generic camellia_aesni_avx2 camellia_aesni_avx_x86_64 camellia_x86_64 twofish_generic twofish_avx_x86_64 twofish_x86_64_3way twofish_x86_64 twofish_common serpent_avx2 serpent_avx_x86_64 serpent_sse2_x86_64 serpent_generic xts algif_skcipher af_alg r8153_ecm cdc_ether usbnet vfat fat r8152 mii libphy amd_atl intel_rapl_msr intel_rapl_common snd_soc_dmic snd_soc_acp6x_mach snd_acp6x_pdm_dma snd_sof_amd_acp63 snd_sof_amd_vangogh snd_sof_amd_rembrandt snd_sof_amd_renoir
[   27.911074]  snd_sof_amd_acp snd_sof_pci snd_sof_xtensa_dsp snd_sof snd_sof_utils kvm_amd uas snd_pci_ps usb_storage snd_amd_sdw_acpi iwlmvm snd_hda_codec_realtek kvm soundwire_amd snd_hda_codec_generic soundwire_generic_allocation crct10dif_pclmul snd_hda_scodec_component crc32_pclmul soundwire_bus mac80211 polyval_clmulni snd_hda_codec_hdmi spd5118 polyval_generic snd_soc_core libarc4 ghash_clmulni_intel snd_hda_intel sha512_ssse3 btusb snd_compress snd_usb_audio sha256_ssse3 snd_intel_dspcfg btrtl snd_intel_sdw_acpi ac97_bus sha1_ssse3 snd_usbmidi_lib aesni_intel snd_pcm_dmaengine snd_hda_codec btintel snd_ump snd_rpl_pci_acp6x iwlwifi gf128mul btbcm snd_acp_pci snd_rawmidi crypto_simd snd_hda_core snd_acp_legacy_common btmtk sp5100_tco snd_seq_device cryptd mc snd_pci_acp6x snd_hwdep bluetooth wmi_bmof rapl snd_pcm pcspkr thunderbolt i2c_piix4 k10temp snd_pci_acp5x cfg80211 snd_timer joydev i2c_smbus igc mousedev snd_rn_pci_acp3x snd snd_acp_config ptp snd_soc_acpi soundcore pps_core rfkill ccp snd_pci_acp3x
[   27.911139]  i2c_hid_acpi amd_pmc i2c_hid acpi_tad mac_hid vmmon(OE) vmw_vmci uinput i2c_dev crypto_user loop dm_mod nfnetlink ip_tables x_tables ext4 crc32c_generic mbcache jbd2 hid_generic usbhid amdgpu amdxcp i2c_algo_bit drm_ttm_helper ttm serio_raw drm_exec atkbd gpu_sched drm_suballoc_helper libps2 drm_buddy vivaldi_fmap nvme drm_display_helper nvme_core cec xhci_pci crc32c_intel i8042 video xhci_pci_renesas crc16 nvme_auth serio wmi
[   27.911179] CPU: 12 UID: 0 PID: 12 Comm: kworker/u64:1 Tainted: G           OE      6.11.4-1-MANJARO #1 1400000003000000474e5500b3c5e86de0a24fdd
[   27.911184] Tainted: [O]=OOT_MODULE, [E]=UNSIGNED_MODULE
[   27.911185] Hardware name: AZW SER/SER, BIOS SER6proMax_P5C8V20 08/15/2023
[   27.911187] Workqueue: events_unbound commit_work
[   27.911193] RIP: 0010:dpp3_deferred_update+0x101/0x330 [amdgpu]
[   27.911403] Code: 83 78 e1 00 00 0f b6 90 a8 02 00 00 48 8b 83 70 e1 00 00 8b b0 78 04 00 00 e8 bb bf 11 00 8b 74 24 04 85 f6 0f 84 5d 01 00 00 <0f> 0b 0f b6 83 48 96 00 00 83 e0 f7 88 83 48 96 00 00 a8 01 0f 84
[   27.911405] RSP: 0018:ffffc0b1c00efba0 EFLAGS: 00010202
[   27.911407] RAX: 0000000000000066 RBX: ffff9ede5a220000 RCX: 0000000000000004
[   27.911408] RDX: 0000000000000000 RSI: 0000000000000002 RDI: ffff9ede59800000
[   27.911410] RBP: ffff9edf4cbc0000 R08: ffffc0b1c00efba4 R09: ffffc0b1c00efbd0
[   27.911411] R10: ffffc0b1c00efa40 R11: 0000000000000000 R12: 0000000000000000
[   27.911412] R13: ffff9edf4cbc40a8 R14: ffff9edf4cbc5f78 R15: ffff9ede9642c600
[   27.911413] FS:  0000000000000000(0000) GS:ffff9ee482000000(0000) knlGS:0000000000000000
[   27.911415] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[   27.911416] CR2: 000076930714c000 CR3: 0000000198a5a000 CR4: 0000000000f50ef0
[   27.911417] PKRU: 55555554
[   27.911419] Call Trace:
[   27.911421]  <TASK>
[   27.911423]  ? dpp3_deferred_update+0x101/0x330 [amdgpu 1400000003000000474e5500411a439e8112ea48]
[   27.911626]  ? __warn.cold+0x8e/0xe8
[   27.911628]  ? dpp3_deferred_update+0x101/0x330 [amdgpu 1400000003000000474e5500411a439e8112ea48]
[   27.911833]  ? report_bug+0xff/0x140
[   27.911836]  ? handle_bug+0x3c/0x80
[   27.911839]  ? exc_invalid_op+0x17/0x70
[   27.911841]  ? asm_exc_invalid_op+0x1a/0x20
[   27.911845]  ? dpp3_deferred_update+0x101/0x330 [amdgpu 1400000003000000474e5500411a439e8112ea48]
[   27.912046]  dc_post_update_surfaces_to_stream+0x1b4/0x2b0 [amdgpu 1400000003000000474e5500411a439e8112ea48]
[   27.912234]  amdgpu_dm_atomic_commit_tail+0x2c3b/0x3990 [amdgpu 1400000003000000474e5500411a439e8112ea48]
[   27.912448]  commit_tail+0x94/0x130
[   27.912451]  process_one_work+0x17e/0x330
[   27.912455]  worker_thread+0x2ce/0x3f0
[   27.912458]  ? __pfx_worker_thread+0x10/0x10
[   27.912460]  kthread+0xd2/0x100
[   27.912463]  ? __pfx_kthread+0x10/0x10
[   27.912466]  ret_from_fork+0x34/0x50
[   27.912468]  ? __pfx_kthread+0x10/0x10
[   27.912470]  ret_from_fork_asm+0x1a/0x30
[   27.912476]  </TASK>
[   27.912477] ---[ end trace 0000000000000000 ]---

Last edited by per_joe (2024-10-21 09:51:54)

Offline

#7 2024-10-20 23:17:07

freeandeasy
Member
Registered: 2024-10-17
Posts: 7

Re: Kernel Oops on AMD Framework shortly after boot

Nope, but what desktop environment are you using?

Offline

#8 2024-10-21 07:10:26

seth
Member
Registered: 2012-09-03
Posts: 59,588

Re: Kernel Oops on AMD Framework shortly after boot

@freeandeasy, do you autologin?
Please post your complete system journal for the boot to show the oops in full context for estimations on what might have caused it

sudo journalctl -b | curl -F 'file=@-' 0x0.st

@per_joe, please use [code][/code] tags. Edit your post in this regard.

Offline

#9 2024-10-21 11:15:28

per_joe
Member
Registered: 2024-10-20
Posts: 2

Re: Kernel Oops on AMD Framework shortly after boot

freeandeasy wrote:

Nope, but what desktop environment are you using?

Was running KDE version 6.2.1 on kernel 6.11.4-1, but i have just downgraded KDE to version 6.1.5 on kernel 6.11.2-4,  and that have seem to fixed the issue. ( running Manjaro not Arch ), but we saw the exact same issue.

Last edited by per_joe (2024-10-21 11:23:03)

Offline

#10 2024-10-21 23:26:53

freeandeasy
Member
Registered: 2024-10-17
Posts: 7

Re: Kernel Oops on AMD Framework shortly after boot

seth wrote:

@freeandeasy, do you autologin?
Please post your complete system journal for the boot to show the oops in full context for estimations on what might have caused it

sudo journalctl -b | curl -F 'file=@-' 0x0.st

@per_joe, please use [code][/code] tags. Edit your post in this regard.

I do not autologin. Here's the output from journalctl from my most recent boot (0x0.st is currently down):

https://termbin.com/dis0

Offline

#11 2024-10-22 14:30:07

seth
Member
Registered: 2012-09-03
Posts: 59,588

Re: Kernel Oops on AMD Framework shortly after boot

I assume https://community.frame.work/t/arch-lin … /44854/194 and https://bugs.kde.org/show_bug.cgi?id=490619 are you?

There's also https://lore.kernel.org/lkml/b110ad59-f … @gmx.de/T/

https://github.com/torvalds/linux/blob/ … dpp.c#L519
The function seems to deal w/ color correction.

Do you get this w/ any other compositor (weston, sway) or X11?

Offline

#12 2024-10-29 15:13:30

freeandeasy
Member
Registered: 2024-10-17
Posts: 7

Re: Kernel Oops on AMD Framework shortly after boot

Quick update:

I don't know what exactly happened, but with the latest KWin (6.2.2) and latest Linux kernel (6.11.5) this kernel oops has gone away.
Additionally, I am loading a custom ICC profile in Display settings in KDE system settings now, since the above post suggested that the problematic function was color correction related and I figured this was adjacent enough to give it a shot. No idea if doing this helped or solved the issue either.

Offline

#13 2024-12-01 17:56:33

Karcsesz
Member
Registered: 2024-12-01
Posts: 1

Re: Kernel Oops on AMD Framework shortly after boot

Ran into the same issue with kernel 6.12.1-arch1-1 running Plasma 6.2.4 Wayland on a Framework 13.

I can confirm that loading the ICC profile freeandeasy linked through Display Settings is what fixed the issue for me. Going back to no profile causes the message to come up again.

Offline

Board footer

Powered by FluxBB