You are not logged in.

#1 2024-04-30 05:24:08

cfr
Member
From: Cymru
Registered: 2011-11-27
Posts: 7,143

6.8.7.arch1-1 BUG: scheduling while atomic; watchdog: BUG: soft lockup

I'm been having freezes and lockups which I thought were due to KDE 6 (and I'm pretty sure *some* of it *is* due to KDE 6). However, since updating the kernel to 6.8.7-arch.1-1,  I clearly have more fundamental problems.

I expected to find a thread concerning similar problems with 6.8.7.arch1-1, but haven't found anything. What's the recommended way to diagnose and/or address this?

Looking back at the log, I have a lot of instances similar to the following:

Ebr 29 15:57:34 MyComputer kernel: e1000e 0000:00:1f.6 eth0: NIC Link is Up 1000 Mbps Half Duplex, Flow Control: None
Ebr 29 15:57:34 MyComputer kernel: BUG: scheduling while atomic: kworker/2:1/5797/0x00000002
Ebr 29 15:57:34 MyComputer kernel: Modules linked in: 8021q garp mrp stp llc ccm uvcvideo videobuf2_vmalloc uvc videobuf2_memops ip6t_REJECT btusb videobuf2_
v4l2 nf_reject_ipv6 ip6table_filter btrtl ip6_tables videodev btintel btbcm videobuf2_common btmtk ipt_REJECT mc nf_reject_ipv4 xt_LOG nf_log_syslog bluetoot
h xt_limit xt_recent xt_tcpudp xt_conntrack ecdh_generic nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 libcrc32c iptable_filter joydev mousedev intel_rapl_msr s
nd_soc_avs snd_soc_hda_codec intel_rapl_common snd_soc_skl intel_uncore_frequency intel_uncore_frequency_common snd_soc_hdac_hda snd_hda_ext_core intel_pmc_c
ore_pltdrv snd_soc_sst_ipc intel_pmc_core snd_soc_sst_dsp intel_vsec pmt_telemetry snd_soc_acpi_intel_match snd_hda_codec_hdmi snd_soc_acpi pmt_class intel_t
cc_cooling x86_pkg_temp_thermal intel_powerclamp snd_soc_core coretemp snd_ctl_led snd_compress ac97_bus snd_hda_codec_realtek iwlmvm tps6598x kvm_intel snd_
pcm_dmaengine snd_hda_codec_generic iTCO_wdt intel_pmc_bxt typec ee1004 mei_pxp mei_hdcp iTCO_vendor_support
Ebr 29 15:57:34 MyComputer kernel:  snd_hda_intel mac80211 kvm libarc4 irqbypass rapl snd_intel_dspcfg snd_intel_sdw_acpi intel_cstate think_lmi nls_iso8859_
1 intel_uncore snd_hda_codec iwlwifi intel_wmi_thunderbolt vfat wmi_bmof firmware_attributes_class e1000e pcspkr fat psmouse i2c_i801 thinkpad_acpi snd_hda_c
ore ptp cfg80211 ledtrig_audio snd_hwdep i2c_smbus pps_core platform_profile snd_pcm snd_timer rfkill intel_lpss_pci mei_me intel_lpss intel_pch_thermal idma
64 intel_xhci_usb_role_switch mei roles serial_multi_instantiate snd soundcore acpi_pad mac_hid i2c_dev crypto_user acpi_call(OE) loop fuse nfnetlink ip_tabl
es x_tables ext4 crc32c_generic crc16 mbcache jbd2 dm_crypt cbc encrypted_keys trusted asn1_encoder tee dm_mod crct10dif_pclmul crc32_pclmul crc32c_intel pol
yval_clmulni polyval_generic gf128mul ghash_clmulni_intel sha512_ssse3 serio_raw atkbd sha256_ssse3 rtsx_pci_sdmmc sha1_ssse3 libps2 mmc_core vivaldi_fmap ae
sni_intel nvme crypto_simd cryptd nvme_core xhci_pci rtsx_pci nvme_auth xhci_pci_renesas i8042 serio i915
Ebr 29 15:57:34 MyComputer kernel:  i2c_algo_bit drm_buddy video wmi ttm intel_gtt drm_display_helper cec
Ebr 29 15:57:34 MyComputer kernel: CPU: 2 PID: 5797 Comm: kworker/2:1 Tainted: G     U     OE      6.8.7-arch1-1 #1 cb8440eaa48704794690ea311c777c18c4e95af9
Ebr 29 15:57:34 MyComputer kernel: Hardware name: LENOVO 20HNCTO1WW/20HNCTO1WW, BIOS R0IET68W (1.46 ) 07/04/2022
Ebr 29 15:57:34 MyComputer kernel: Workqueue: events linkwatch_event
Ebr 29 15:57:34 MyComputer kernel: Call Trace:
Ebr 29 15:57:34 MyComputer kernel:  <TASK>
Ebr 29 15:57:34 MyComputer kernel:  dump_stack_lvl+0x64/0x80
Ebr 29 15:57:34 MyComputer kernel:  __schedule_bug+0x56/0x70
Ebr 29 15:57:34 MyComputer kernel:  __schedule+0x10f0/0x1520
Ebr 29 15:57:34 MyComputer kernel:  ? ttwu_do_activate+0x64/0x200
Ebr 29 15:57:34 MyComputer kernel:  schedule+0x32/0xd0
Ebr 29 15:57:34 MyComputer kernel:  schedule_hrtimeout_range_clock+0xbe/0x140
Ebr 29 15:57:34 MyComputer kernel:  ? __pfx_hrtimer_wakeup+0x10/0x10
Ebr 29 15:57:34 MyComputer kernel:  usleep_range_state+0x64/0x90
Ebr 29 15:57:34 MyComputer kernel:  e1000e_read_phy_reg_mdic+0x87/0x280 [e1000e 5998468c47e3397a0a0c74928bb3f670f3bf9b64]
Ebr 29 15:57:34 MyComputer kernel:  e1000e_update_stats+0x513/0x730 [e1000e 5998468c47e3397a0a0c74928bb3f670f3bf9b64]
Ebr 29 15:57:34 MyComputer kernel:  e1000e_get_stats64+0x2e/0x120 [e1000e 5998468c47e3397a0a0c74928bb3f670f3bf9b64]
Ebr 29 15:57:34 MyComputer kernel:  dev_get_stats+0x5d/0x110
Ebr 29 15:57:34 MyComputer kernel:  rtnl_fill_stats+0x3b/0x130
Ebr 29 15:57:34 MyComputer kernel:  rtnl_fill_ifinfo+0x872/0x1580
Ebr 29 15:57:34 MyComputer kernel:  ? __alloc_skb+0x8c/0x1a0
Ebr 29 15:57:34 MyComputer kernel:  rtmsg_ifinfo_build_skb+0xae/0x120
Ebr 29 15:57:34 MyComputer kernel:  rtmsg_ifinfo+0x3c/0x90
Ebr 29 15:57:34 MyComputer kernel:  netdev_state_change+0x89/0x90
Ebr 29 15:57:34 MyComputer kernel:  linkwatch_do_dev+0x4f/0x60
Ebr 29 15:57:34 MyComputer kernel:  __linkwatch_run_queue+0xe1/0x260
Ebr 29 15:57:34 MyComputer kernel:  linkwatch_event+0x31/0x40
Ebr 29 15:57:34 MyComputer kernel:  process_one_work+0x178/0x350
Ebr 29 15:57:34 MyComputer kernel:  worker_thread+0x30f/0x450
Ebr 29 15:57:34 MyComputer kernel:  ? __pfx_worker_thread+0x10/0x10
Ebr 29 15:57:34 MyComputer kernel:  kthread+0xe5/0x120
Ebr 29 15:57:34 MyComputer kernel:  ? __pfx_kthread+0x10/0x10
Ebr 29 15:57:34 MyComputer kernel:  ret_from_fork+0x31/0x50
Ebr 29 15:57:34 MyComputer kernel:  ? __pfx_kthread+0x10/0x10
Ebr 29 15:57:34 MyComputer kernel:  ret_from_fork_asm+0x1b/0x30
Ebr 29 15:57:34 MyComputer kernel:  </TASK>

I then eventually get a complete lockup:

Ebr 30 04:52:04 MyComputer kernel: iwlwifi 0000:03:00.0: Queue 10 is active on fifo 1 and stuck for 10000 ms. SW [131, 136] HW [136, 136] FH TRB=0x0c010a087
Ebr 30 04:52:14 MyComputer kernel: watchdog: BUG: soft lockup - CPU#0 stuck for 22s! [kworker/0:1:13315]
Ebr 30 04:52:14 MyComputer kernel: Modules linked in: 8021q garp mrp stp llc ccm uvcvideo videobuf2_vmalloc uvc videobuf2_memops ip6t_REJECT btusb videobuf2_v4l2 nf_reject_ipv6 ip6table_filter btrtl ip6_tables videodev btintel btbcm videobuf2_common btmtk ipt_REJECT mc nf_reject_ipv4 xt_LOG nf_log_syslog bluetooth xt_limit xt_recent xt_tcpudp xt_conntrack ecdh_generic nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 libcrc32c iptable_filter joydev mousedev intel_rapl_msr snd_soc_avs snd_soc_hda_codec intel_rapl_common snd_soc_skl intel_uncore_frequency intel_uncore_frequency_common snd_soc_hdac_hda snd_hda_ext_core intel_pmc_core_pltdrv snd_soc_sst_ipc intel_pmc_core snd_soc_sst_dsp intel_vsec pmt_telemetry snd_soc_acpi_intel_match snd_hda_codec_hdmi snd_soc_acpi pmt_class intel_tcc_cooling x86_pkg_temp_thermal intel_powerclamp snd_soc_core coretemp snd_ctl_led snd_compress ac97_bus snd_hda_codec_realtek iwlmvm tps6598x kvm_intel snd_pcm_dmaengine snd_hda_codec_generic iTCO_wdt intel_pmc_bxt typec ee1004 mei_pxp mei_hdcp iTCO_vendor_support
Ebr 30 04:52:14 MyComputer kernel:  snd_hda_intel mac80211 kvm libarc4 irqbypass rapl snd_intel_dspcfg snd_intel_sdw_acpi intel_cstate think_lmi nls_iso8859_1 intel_uncore snd_hda_codec iwlwifi intel_wmi_thunderbolt vfat wmi_bmof firmware_attributes_class e1000e pcspkr fat psmouse i2c_i801 thinkpad_acpi snd_hda_core ptp cfg80211 ledtrig_audio snd_hwdep i2c_smbus pps_core platform_profile snd_pcm snd_timer rfkill intel_lpss_pci mei_me intel_lpss intel_pch_thermal idma64 intel_xhci_usb_role_switch mei roles serial_multi_instantiate snd soundcore acpi_pad mac_hid i2c_dev crypto_user acpi_call(OE) loop fuse nfnetlink ip_tables x_tables ext4 crc32c_generic crc16 mbcache jbd2 dm_crypt cbc encrypted_keys trusted asn1_encoder tee dm_mod crct10dif_pclmul crc32_pclmul crc32c_intel polyval_clmulni polyval_generic gf128mul ghash_clmulni_intel sha512_ssse3 serio_raw atkbd sha256_ssse3 rtsx_pci_sdmmc sha1_ssse3 libps2 mmc_core vivaldi_fmap aesni_intel nvme crypto_simd cryptd nvme_core xhci_pci rtsx_pci nvme_auth xhci_pci_renesas i8042 serio i915
Ebr 30 04:52:14 MyComputer kernel:  i2c_algo_bit drm_buddy video wmi ttm intel_gtt drm_display_helper cec
Ebr 30 04:52:14 MyComputer kernel: CPU: 0 PID: 13315 Comm: kworker/0:1 Tainted: G     U  W  OE      6.8.7-arch1-1 #1 cb8440eaa48704794690ea311c777c18c4e95af9
Ebr 30 04:52:14 MyComputer kernel: Hardware name: LENOVO 20HNCTO1WW/20HNCTO1WW, BIOS R0IET68W (1.46 ) 07/04/2022
Ebr 30 04:52:14 MyComputer kernel: Workqueue: pm pm_runtime_work
Ebr 30 04:52:14 MyComputer kernel: RIP: 0010:native_queued_spin_lock_slowpath+0x223/0x2e0
Ebr 30 04:52:14 MyComputer kernel: Code: c5 01 41 c1 e4 10 41 c1 e5 12 45 09 ec 44 89 e0 c1 e8 10 66 87 43 02 89 c2 c1 e2 10 81 fa ff ff 00 00 77 5e 31 d2 eb02 f3 90 <8b> 03 66 85 c0 75 f7 44 39 e0 0f 84 8e 00 00 00 c6 03 01 48 85 d2
Ebr 30 04:52:14 MyComputer kernel: RSP: 0018:ffff9f69cc0f7cd0 EFLAGS: 00000202
Ebr 30 04:52:14 MyComputer kernel: RAX: 0000000000040101 RBX: ffff8f8e0897f428 RCX: ffff8f8e071c6408
Ebr 30 04:52:14 MyComputer kernel: RDX: 0000000000000000 RSI: 0000000000000101 RDI: ffff8f8e0897f428
Ebr 30 04:52:14 MyComputer kernel: RBP: ffff8f8f37435640 R08: ffff8f8e071c6400 R09: 0000000000000000
Ebr 30 04:52:14 MyComputer kernel: R10: 0000000000000000 R11: 0000000000000001 R12: 0000000000040000
Ebr 30 04:52:14 MyComputer kernel: R13: 0000000000040000 R14: 0000000004008000 R15: ffff8f8effed2400
Ebr 30 04:52:14 MyComputer kernel: FS:  0000000000000000(0000) GS:ffff8f8f37400000(0000) knlGS:0000000000000000
Ebr 30 04:52:14 MyComputer kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Ebr 30 04:52:14 MyComputer kernel: CR2: 00007d4534b5f000 CR3: 0000000146220006 CR4: 00000000003706f0
Ebr 30 04:52:14 MyComputer kernel: Call Trace:
Ebr 30 04:52:14 MyComputer kernel:  <IRQ>
Ebr 30 04:52:14 MyComputer kernel:  ? watchdog_timer_fn+0x1e6/0x270
Ebr 30 04:52:14 MyComputer kernel:  ? __pfx_watchdog_timer_fn+0x10/0x10
Ebr 30 04:52:14 MyComputer kernel:  ? __hrtimer_run_queues+0x10f/0x2b0
Ebr 30 04:52:14 MyComputer kernel:  ? hrtimer_interrupt+0xf8/0x230
Ebr 30 04:52:14 MyComputer kernel:  ? __sysvec_apic_timer_interrupt+0x4d/0x140
Ebr 30 04:52:14 MyComputer kernel:  ? sysvec_apic_timer_interrupt+0x6d/0x90
Ebr 30 04:52:14 MyComputer kernel:  </IRQ>
Ebr 30 04:52:14 MyComputer kernel:  <TASK>
Ebr 30 04:52:14 MyComputer kernel:  ? asm_sysvec_apic_timer_interrupt+0x1a/0x20
Ebr 30 04:52:14 MyComputer kernel:  ? native_queued_spin_lock_slowpath+0x223/0x2e0
Ebr 30 04:52:14 MyComputer kernel:  _raw_spin_lock+0x29/0x30
Ebr 30 04:52:14 MyComputer kernel:  e1000e_down+0xf2/0x290 [e1000e 5998468c47e3397a0a0c74928bb3f670f3bf9b64]
Ebr 30 04:52:14 MyComputer kernel:  e1000e_pm_runtime_suspend+0x73/0xd0 [e1000e 5998468c47e3397a0a0c74928bb3f670f3bf9b64]
Ebr 30 04:52:14 MyComputer kernel:  pci_pm_runtime_suspend+0x67/0x1e0
Ebr 30 04:52:14 MyComputer kernel:  ? __pfx_pci_pm_runtime_suspend+0x10/0x10
Ebr 30 04:52:14 MyComputer kernel:  __rpm_callback+0x41/0x170
Ebr 30 04:52:14 MyComputer kernel:  rpm_callback+0x35/0x70
Ebr 30 04:52:14 MyComputer kernel:  ? __pfx_pci_pm_runtime_suspend+0x10/0x10
Ebr 30 04:52:14 MyComputer kernel:  rpm_suspend+0x120/0x6a0
Ebr 30 04:52:14 MyComputer kernel:  ? __intel_display_power_put_async+0x9a/0x120 [i915 a122c14c472c7e351ecb2fafc8151904ddc54c07]
Ebr 30 04:52:14 MyComputer kernel:  pm_runtime_work+0x57/0xb0
Ebr 30 04:52:14 MyComputer kernel:  process_one_work+0x178/0x350
Ebr 30 04:52:14 MyComputer kernel:  worker_thread+0x30f/0x450
Ebr 30 04:52:14 MyComputer kernel:  ? __pfx_worker_thread+0x10/0x10
Ebr 30 04:52:14 MyComputer kernel:  kthread+0xe5/0x120
Ebr 30 04:52:14 MyComputer kernel:  ? __pfx_kthread+0x10/0x10
Ebr 30 04:52:14 MyComputer kernel:  ret_from_fork+0x31/0x50
Ebr 30 04:52:14 MyComputer kernel:  ? __pfx_kthread+0x10/0x10
Ebr 30 04:52:14 MyComputer kernel:  ret_from_fork_asm+0x1b/0x30
Ebr 30 04:52:14 MyComputer kernel:  </TASK>
Ebr 30 04:52:15 MyComputer kernel: watchdog: BUG: soft lockup - CPU#3 stuck for 26s! [kworker/3:2:13107]
Ebr 30 04:52:15 MyComputer kernel: Modules linked in: 8021q garp mrp stp llc ccm uvcvideo videobuf2_vmalloc uvc videobuf2_memops ip6t_REJECT btusb videobuf2_v4l2 nf_reject_ipv6 ip6table_filter btrtl ip6_tables videodev btintel btbcm videobuf2_common btmtk ipt_REJECT mc nf_reject_ipv4 xt_LOG nf_log_syslog bluetooth xt_limit xt_recent xt_tcpudp xt_conntrack ecdh_generic nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 libcrc32c iptable_filter joydev mousedev intel_rapl_msr snd_soc_avs snd_soc_hda_codec intel_rapl_common snd_soc_skl intel_uncore_frequency intel_uncore_frequency_common snd_soc_hdac_hda snd_hda_ext_core intel_pmc_core_pltdrv snd_soc_sst_ipc intel_pmc_core snd_soc_sst_dsp intel_vsec pmt_telemetry snd_soc_acpi_intel_match snd_hda_codec_hdmi snd_soc_acpi pmt_class intel_tcc_cooling x86_pkg_temp_thermal intel_powerclamp snd_soc_core coretemp snd_ctl_led snd_compress ac97_bus snd_hda_codec_realtek iwlmvm tps6598x kvm_intel snd_pcm_dmaengine snd_hda_codec_generic iTCO_wdt intel_pmc_bxt typec ee1004 mei_pxp mei_hdcp iTCO_vendor_support
Ebr 30 04:52:15 MyComputer kernel:  snd_hda_intel mac80211 kvm libarc4 irqbypass rapl snd_intel_dspcfg snd_intel_sdw_acpi intel_cstate think_lmi nls_iso8859_1 intel_uncore snd_hda_codec iwlwifi intel_wmi_thunderbolt vfat wmi_bmof firmware_attributes_class e1000e pcspkr fat psmouse i2c_i801 thinkpad_acpi snd_hda_core ptp cfg80211 ledtrig_audio snd_hwdep i2c_smbus pps_core platform_profile snd_pcm snd_timer rfkill intel_lpss_pci mei_me intel_lpss intel_pch_thermal idma64 intel_xhci_usb_role_switch mei roles serial_multi_instantiate snd soundcore acpi_pad mac_hid i2c_dev crypto_user acpi_call(OE) loop fuse nfnetlink ip_tables x_tables ext4 crc32c_generic crc16 mbcache jbd2 dm_crypt cbc encrypted_keys trusted asn1_encoder tee dm_mod crct10dif_pclmul crc32_pclmul crc32c_intel polyval_clmulni polyval_generic gf128mul ghash_clmulni_intel sha512_ssse3 serio_raw atkbd sha256_ssse3 rtsx_pci_sdmmc sha1_ssse3 libps2 mmc_core vivaldi_fmap aesni_intel nvme crypto_simd cryptd nvme_core xhci_pci rtsx_pci nvme_auth xhci_pci_renesas i8042 serio i915
Ebr 30 04:52:15 MyComputer kernel:  i2c_algo_bit drm_buddy video wmi ttm intel_gtt drm_display_helper cec
Ebr 30 04:52:15 MyComputer kernel: CPU: 3 PID: 13107 Comm: kworker/3:2 Tainted: G     U  W  OEL     6.8.7-arch1-1 #1 cb8440eaa48704794690ea311c777c18c4e95af9
Ebr 30 04:52:15 MyComputer kernel: Hardware name: LENOVO 20HNCTO1WW/20HNCTO1WW, BIOS R0IET68W (1.46 ) 07/04/2022
Ebr 30 04:52:15 MyComputer kernel: Workqueue: events linkwatch_event
Ebr 30 04:52:15 MyComputer kernel: RIP: 0010:native_queued_spin_lock_slowpath+0x6e/0x2e0
Ebr 30 04:52:15 MyComputer kernel: Code: 77 7f f0 0f ba 2b 08 0f 92 c2 8b 03 0f b6 d2 c1 e2 08 30 e4 09 d0 3d ff 00 00 00 77 5b 85 c0 74 10 0f b6 03 84 c0 7409 f3 90 <0f> b6 03 84 c0 75 f7 b8 01 00 00 00 66 89 03 65 48 ff 05 c3 48 05
Ebr 30 04:52:15 MyComputer kernel: RSP: 0018:ffff9f69cbb0bbc0 EFLAGS: 00000202
Ebr 30 04:52:15 MyComputer kernel: RAX: 0000000000000001 RBX: ffff8f8e0897f428 RCX: 0000000000000000
Ebr 30 04:52:15 MyComputer kernel: RDX: 0000000000000000 RSI: 0000000000000001 RDI: ffff8f8e0897f428
Ebr 30 04:52:15 MyComputer kernel: RBP: ffff8f8e28a8f134 R08: 0000000000000000 R09: 0000000000000004
Ebr 30 04:52:15 MyComputer kernel: R10: ffff8f8e28a8f1fc R11: 0000000000000010 R12: ffff8f8e0897f428
Ebr 30 04:52:15 MyComputer kernel: R13: ffff8f8e28a8f000 R14: 0000000000000000 R15: 0000000000000000
Ebr 30 04:52:15 MyComputer kernel: FS:  0000000000000000(0000) GS:ffff8f8f37580000(0000) knlGS:0000000000000000
Ebr 30 04:52:15 MyComputer kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Ebr 30 04:52:15 MyComputer kernel: CR2: 00003a6c0213c000 CR3: 00000001dd200002 CR4: 00000000003706f0
Ebr 30 04:52:15 MyComputer kernel: Call Trace:
Ebr 30 04:52:15 MyComputer kernel:  <IRQ>
Ebr 30 04:52:15 MyComputer kernel:  ? watchdog_timer_fn+0x1e6/0x270
Ebr 30 04:52:15 MyComputer kernel:  ? __pfx_watchdog_timer_fn+0x10/0x10
Ebr 30 04:52:15 MyComputer kernel:  ? __hrtimer_run_queues+0x10f/0x2b0
Ebr 30 04:52:15 MyComputer kernel:  ? hrtimer_interrupt+0xf8/0x230
Ebr 30 04:52:15 MyComputer kernel:  ? __sysvec_apic_timer_interrupt+0x4d/0x140
Ebr 30 04:52:15 MyComputer kernel:  ? sysvec_apic_timer_interrupt+0x6d/0x90
Ebr 30 04:52:15 MyComputer kernel:  </IRQ>
Ebr 30 04:52:15 MyComputer kernel:  <TASK>
Ebr 30 04:52:15 MyComputer kernel:  ? asm_sysvec_apic_timer_interrupt+0x1a/0x20
Ebr 30 04:52:15 MyComputer kernel:  ? native_queued_spin_lock_slowpath+0x6e/0x2e0
Ebr 30 04:52:15 MyComputer kernel:  _raw_spin_lock+0x29/0x30
Ebr 30 04:52:15 MyComputer kernel:  e1000e_get_stats64+0x22/0x120 [e1000e 5998468c47e3397a0a0c74928bb3f670f3bf9b64]
Ebr 30 04:52:15 MyComputer kernel:  dev_get_stats+0x5d/0x110
Ebr 30 04:52:15 MyComputer kernel:  rtnl_fill_stats+0x3b/0x130
Ebr 30 04:52:15 MyComputer kernel:  rtnl_fill_ifinfo+0x872/0x1580
Ebr 30 04:52:15 MyComputer kernel:  ? __alloc_skb+0x8c/0x1a0
Ebr 30 04:52:15 MyComputer kernel:  rtmsg_ifinfo_build_skb+0xae/0x120
Ebr 30 04:52:15 MyComputer kernel:  rtmsg_ifinfo+0x3c/0x90
Ebr 30 04:52:15 MyComputer kernel:  netdev_state_change+0x89/0x90
Ebr 30 04:52:15 MyComputer kernel:  linkwatch_do_dev+0x4f/0x60
Ebr 30 04:52:15 MyComputer kernel:  __linkwatch_run_queue+0xe1/0x260
Ebr 30 04:52:15 MyComputer kernel:  linkwatch_event+0x31/0x40
Ebr 30 04:52:15 MyComputer kernel:  process_one_work+0x178/0x350
Ebr 30 04:52:15 MyComputer kernel:  worker_thread+0x30f/0x450
Ebr 30 04:52:15 MyComputer kernel:  ? __pfx_worker_thread+0x10/0x10
Ebr 30 04:52:15 MyComputer kernel:  kthread+0xe5/0x120
Ebr 30 04:52:15 MyComputer kernel:  ? __pfx_kthread+0x10/0x10
Ebr 30 04:52:15 MyComputer kernel:  ret_from_fork+0x31/0x50
Ebr 30 04:52:15 MyComputer kernel:  ? __pfx_kthread+0x10/0x10
Ebr 30 04:52:15 MyComputer kernel:  ret_from_fork_asm+0x1b/0x30
Ebr 30 04:52:15 MyComputer kernel:  </TASK>
Ebr 30 04:52:42 MyComputer kernel: watchdog: BUG: soft lockup - CPU#0 stuck for 48s! [kworker/0:1:13315]
Ebr 30 04:52:42 MyComputer kernel: Modules linked in: 8021q garp mrp stp llc ccm uvcvideo videobuf2_vmalloc uvc videobuf2_memops ip6t_REJECT btusb videobuf2_v4l2 nf_reject_ipv6 ip6table_filter btrtl ip6_tables videodev btintel btbcm videobuf2_common btmtk ipt_REJECT mc nf_reject_ipv4 xt_LOG nf_log_syslog bluetooth xt_limit xt_recent xt_tcpudp xt_conntrack ecdh_generic nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 libcrc32c iptable_filter joydev mousedev intel_rapl_msr snd_soc_avs snd_soc_hda_codec intel_rapl_common snd_soc_skl intel_uncore_frequency intel_uncore_frequency_common snd_soc_hdac_hda snd_hda_ext_core intel_pmc_core_pltdrv snd_soc_sst_ipc intel_pmc_core snd_soc_sst_dsp intel_vsec pmt_telemetry snd_soc_acpi_intel_match snd_hda_codec_hdmi snd_soc_acpi pmt_class intel_tcc_cooling x86_pkg_temp_thermal intel_powerclamp snd_soc_core coretemp snd_ctl_led snd_compress ac97_bus snd_hda_codec_realtek iwlmvm tps6598x kvm_intel snd_pcm_dmaengine snd_hda_codec_generic iTCO_wdt intel_pmc_bxt typec ee1004 mei_pxp mei_hdcp iTCO_vendor_support
Ebr 30 04:52:42 MyComputer kernel:  snd_hda_intel mac80211 kvm libarc4 irqbypass rapl snd_intel_dspcfg snd_intel_sdw_acpi intel_cstate think_lmi nls_iso8859_1 intel_uncore snd_hda_codec iwlwifi intel_wmi_thunderbolt vfat wmi_bmof firmware_attributes_class e1000e pcspkr fat psmouse i2c_i801 thinkpad_acpi snd_hda_core ptp cfg80211 ledtrig_audio snd_hwdep i2c_smbus pps_core platform_profile snd_pcm snd_timer rfkill intel_lpss_pci mei_me intel_lpss intel_pch_thermal idma64 intel_xhci_usb_role_switch mei roles serial_multi_instantiate snd soundcore acpi_pad mac_hid i2c_dev crypto_user acpi_call(OE) loop fuse nfnetlink ip_tables x_tables ext4 crc32c_generic crc16 mbcache jbd2 dm_crypt cbc encrypted_keys trusted asn1_encoder tee dm_mod crct10dif_pclmul crc32_pclmul crc32c_intel polyval_clmulni polyval_generic gf128mul ghash_clmulni_intel sha512_ssse3 serio_raw atkbd sha256_ssse3 rtsx_pci_sdmmc sha1_ssse3 libps2 mmc_core vivaldi_fmap aesni_intel nvme crypto_simd cryptd nvme_core xhci_pci rtsx_pci nvme_auth xhci_pci_renesas i8042 serio i915
Ebr 30 04:52:42 MyComputer kernel:  i2c_algo_bit drm_buddy video wmi ttm intel_gtt drm_display_helper cec
Ebr 30 04:52:42 MyComputer kernel: CPU: 0 PID: 13315 Comm: kworker/0:1 Tainted: G     U  W  OEL     6.8.7-arch1-1 #1 cb8440eaa48704794690ea311c777c18c4e95af9
Ebr 30 04:52:42 MyComputer kernel: Hardware name: LENOVO 20HNCTO1WW/20HNCTO1WW, BIOS R0IET68W (1.46 ) 07/04/2022
Ebr 30 04:52:42 MyComputer kernel: Workqueue: pm pm_runtime_work
Ebr 30 04:52:42 MyComputer kernel: RIP: 0010:native_queued_spin_lock_slowpath+0x223/0x2e0
Ebr 30 04:52:42 MyComputer kernel: Code: c5 01 41 c1 e4 10 41 c1 e5 12 45 09 ec 44 89 e0 c1 e8 10 66 87 43 02 89 c2 c1 e2 10 81 fa ff ff 00 00 77 5e 31 d2 eb02 f3 90 <8b> 03 66 85 c0 75 f7 44 39 e0 0f 84 8e 00 00 00 c6 03 01 48 85 d2
Ebr 30 04:52:42 MyComputer kernel: RSP: 0018:ffff9f69cc0f7cd0 EFLAGS: 00000202
Ebr 30 04:52:42 MyComputer kernel: RAX: 0000000000040101 RBX: ffff8f8e0897f428 RCX: ffff8f8e071c6408
Ebr 30 04:52:42 MyComputer kernel: RDX: 0000000000000000 RSI: 0000000000000101 RDI: ffff8f8e0897f428
Ebr 30 04:52:42 MyComputer kernel: RBP: ffff8f8f37435640 R08: ffff8f8e071c6400 R09: 0000000000000000
Ebr 30 04:52:42 MyComputer kernel: R10: 0000000000000000 R11: 0000000000000001 R12: 0000000000040000
Ebr 30 04:52:42 MyComputer kernel: R13: 0000000000040000 R14: 0000000004008000 R15: ffff8f8effed2400
Ebr 30 04:52:42 MyComputer kernel: FS:  0000000000000000(0000) GS:ffff8f8f37400000(0000) knlGS:0000000000000000
Ebr 30 04:52:42 MyComputer kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Ebr 30 04:52:42 MyComputer kernel: CR2: 00007d4534b5f000 CR3: 0000000146220006 CR4: 00000000003706f0
Ebr 30 04:52:42 MyComputer kernel: Call Trace:
Ebr 30 04:52:42 MyComputer kernel:  <IRQ>
Ebr 30 04:52:42 MyComputer kernel:  ? watchdog_timer_fn+0x1e6/0x270
Ebr 30 04:52:42 MyComputer kernel:  ? __pfx_watchdog_timer_fn+0x10/0x10
Ebr 30 04:52:42 MyComputer kernel:  ? __hrtimer_run_queues+0x10f/0x2b0
Ebr 30 04:52:42 MyComputer kernel:  ? hrtimer_interrupt+0xf8/0x230
Ebr 30 04:52:42 MyComputer kernel:  ? __sysvec_apic_timer_interrupt+0x4d/0x140
Ebr 30 04:52:42 MyComputer kernel:  ? sysvec_apic_timer_interrupt+0x6d/0x90
Ebr 30 04:52:42 MyComputer kernel:  </IRQ>
Ebr 30 04:52:42 MyComputer kernel:  <TASK>
Ebr 30 04:52:42 MyComputer kernel:  ? asm_sysvec_apic_timer_interrupt+0x1a/0x20
Ebr 30 04:52:42 MyComputer kernel:  ? native_queued_spin_lock_slowpath+0x223/0x2e0
Ebr 30 04:52:42 MyComputer kernel:  _raw_spin_lock+0x29/0x30
Ebr 30 04:52:42 MyComputer kernel:  e1000e_down+0xf2/0x290 [e1000e 5998468c47e3397a0a0c74928bb3f670f3bf9b64]
Ebr 30 04:52:42 MyComputer kernel:  e1000e_pm_runtime_suspend+0x73/0xd0 [e1000e 5998468c47e3397a0a0c74928bb3f670f3bf9b64]
Ebr 30 04:52:42 MyComputer kernel:  pci_pm_runtime_suspend+0x67/0x1e0
Ebr 30 04:52:42 MyComputer kernel:  ? __pfx_pci_pm_runtime_suspend+0x10/0x10
Ebr 30 04:52:42 MyComputer kernel:  __rpm_callback+0x41/0x170
Ebr 30 04:52:42 MyComputer kernel:  rpm_callback+0x35/0x70
Ebr 30 04:52:42 MyComputer kernel:  ? __pfx_pci_pm_runtime_suspend+0x10/0x10
Ebr 30 04:52:42 MyComputer kernel:  rpm_suspend+0x120/0x6a0
Ebr 30 04:52:42 MyComputer kernel:  ? __intel_display_power_put_async+0x9a/0x120 [i915 a122c14c472c7e351ecb2fafc8151904ddc54c07]
Ebr 30 04:52:42 MyComputer kernel:  pm_runtime_work+0x57/0xb0
Ebr 30 04:52:42 MyComputer kernel:  process_one_work+0x178/0x350
Ebr 30 04:52:42 MyComputer kernel:  worker_thread+0x30f/0x450
Ebr 30 04:52:42 MyComputer kernel:  ? __pfx_worker_thread+0x10/0x10
Ebr 30 04:52:42 MyComputer kernel:  kthread+0xe5/0x120
Ebr 30 04:52:42 MyComputer kernel:  ? __pfx_kthread+0x10/0x10
Ebr 30 04:52:42 MyComputer kernel:  ret_from_fork+0x31/0x50
Ebr 30 04:52:42 MyComputer kernel:  ? __pfx_kthread+0x10/0x10
Ebr 30 04:52:42 MyComputer kernel:  ret_from_fork_asm+0x1b/0x30
Ebr 30 04:52:42 MyComputer kernel:  </TASK>
Ebr 30 04:52:43 MyComputer kernel: watchdog: BUG: soft lockup - CPU#3 stuck for 52s! [kworker/3:2:13107]
...

which repeats with minor variations until I hard reboot the machine. The machine is *not* totally unresponsive at this point. I was able to see these lines in the journal, could use some commands successfully and switch tabs or applications in some cases. But `ip addr` hung and could not be interrupted (even using `kill` etc.) and `reboot` failed with the complaint that it could not connect to the `init daemon`. (I'm sorry - I forgot to copy or note the exact text. I remembered partly because I thought `systemd` was my `init` daemon.)

Right before the first occurrence of `BUG` in the journal, an attempt to update `fangfrisch` goes haywire. This fills the journal with errors. However, this is not especially unusual: the same thing happens whenever the timer triggers in the absence of a network connection. The code is apparently unable to handle this case and its attempts to handle the error just spawn yet further errors, which its attempts to handle spawn yet more errors and so on - not quite *ad infinitum*, but not far off. This is a snippet, but I can post more if that might be informative.

...
Ebr 29 15:31:22 MyComputer fangfrisch[5467]: During handling of the above exception, another exception occurred:
Ebr 29 15:31:22 MyComputer fangfrisch[5467]: Traceback (most recent call last):
Ebr 29 15:31:22 MyComputer fangfrisch[5467]:   File "/usr/lib/python3.11/site-packages/fangfrisch/refresh.py", line 120, in refresh
Ebr 29 15:31:22 MyComputer fangfrisch[5467]:     digest = get_digest(ci)
Ebr 29 15:31:22 MyComputer fangfrisch[5467]:              ^^^^^^^^^^^^^^
Ebr 29 15:31:22 MyComputer fangfrisch[5467]:   File "/usr/lib/python3.11/site-packages/fangfrisch/download.py", line 73, in get_digest
Ebr 29 15:31:22 MyComputer fangfrisch[5467]:     download = _download(f'{ci.url}.{ci.check}', max_size, ci.connection_timeout)
Ebr 29 15:31:22 MyComputer fangfrisch[5467]:                ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
Ebr 29 15:31:22 MyComputer fangfrisch[5467]:   File "/usr/lib/python3.11/site-packages/fangfrisch/download.py", line 60, in _download
Ebr 29 15:31:22 MyComputer fangfrisch[5467]:     response = _session.get(url, stream=True, timeout=timeout)
Ebr 29 15:31:22 MyComputer fangfrisch[5467]:                ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
Ebr 29 15:31:22 MyComputer fangfrisch[5467]:   File "/usr/lib/python3.11/site-packages/requests/sessions.py", line 602, in get
Ebr 29 15:31:22 MyComputer fangfrisch[5467]:     return self.request("GET", url, **kwargs)
Ebr 29 15:31:22 MyComputer fangfrisch[5467]:            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
Ebr 29 15:31:22 MyComputer fangfrisch[5467]:   File "/usr/lib/python3.11/site-packages/requests/sessions.py", line 589, in request
Ebr 29 15:31:22 MyComputer fangfrisch[5467]:     resp = self.send(prep, **send_kwargs)
Ebr 29 15:31:22 MyComputer fangfrisch[5467]:            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
Ebr 29 15:31:22 MyComputer fangfrisch[5467]:   File "/usr/lib/python3.11/site-packages/requests/sessions.py", line 703, in send
Ebr 29 15:31:22 MyComputer fangfrisch[5467]:     r = adapter.send(request, **kwargs)
Ebr 29 15:31:22 MyComputer fangfrisch[5467]:         ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
Ebr 29 15:31:22 MyComputer fangfrisch[5467]:   File "/usr/lib/python3.11/site-packages/requests/adapters.py", line 519, in send
Ebr 29 15:31:22 MyComputer fangfrisch[5467]:     raise ConnectionError(e, request=request)
Ebr 29 15:31:22 MyComputer fangfrisch[5467]: requests.exceptions.ConnectionError: HTTPConnectionPool(host='mirror.sentries.org', port=80): Max retries exceed
ed with url: /sanesecurity/sanesecurity.ftm.sha256 (Caused by NewConnectionError('<urllib3.connection.HTTPConnection object at 0x7114d7cddd50>: Failed to est
ablish a new connection: [Errno -3] Temporary failure in name resolution'))
Ebr 29 15:31:22 MyComputer fangfrisch[5467]: ERROR: HTTPConnectionPool(host='mirror.sentries.org', port=80): Max retries exceeded with url: /sanesecurity/win
now.attachments.hdb.sha256 (Caused by NewConnectionError('<urllib3.connection.HTTPConnection object at 0x7114d7cdec50>: Failed to establish a new connection:
[Errno -3] Temporary failure in name resolution'))
Ebr 29 15:31:22 MyComputer fangfrisch[5467]: Traceback (most recent call last):
Ebr 29 15:31:22 MyComputer fangfrisch[5467]:   File "/usr/lib/python3.11/site-packages/urllib3/connection.py", line 174, in _new_conn
Ebr 29 15:31:22 MyComputer fangfrisch[5467]:     conn = connection.create_connection(
Ebr 29 15:31:22 MyComputer fangfrisch[5467]:            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
Ebr 29 15:31:22 MyComputer fangfrisch[5467]:   File "/usr/lib/python3.11/site-packages/urllib3/util/connection.py", line 72, in create_connection
Ebr 29 15:31:22 MyComputer fangfrisch[5467]:     for res in socket.getaddrinfo(host, port, family, socket.SOCK_STREAM):
Ebr 29 15:31:22 MyComputer fangfrisch[5467]:                ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
Ebr 29 15:31:22 MyComputer fangfrisch[5467]:   File "/usr/lib/python3.11/socket.py", line 962, in getaddrinfo
Ebr 29 15:31:22 MyComputer fangfrisch[5467]:     for res in _socket.getaddrinfo(host, port, family, type, proto, flags):
Ebr 29 15:31:22 MyComputer fangfrisch[5467]:                ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
Ebr 29 15:31:22 MyComputer fangfrisch[5467]: socket.gaierror: [Errno -3] Temporary failure in name resolution
Ebr 29 15:31:22 MyComputer fangfrisch[5467]: During handling of the above exception, another exception occurred:
Ebr 29 15:31:22 MyComputer fangfrisch[5467]: Traceback (most recent call last):
Ebr 29 15:31:22 MyComputer fangfrisch[5467]:   File "/usr/lib/python3.11/site-packages/urllib3/connectionpool.py", line 715, in urlopen
Ebr 29 15:31:22 MyComputer fangfrisch[5467]:     httplib_response = self._make_request(
Ebr 29 15:31:22 MyComputer fangfrisch[5467]:                        ^^^^^^^^^^^^^^^^^^^
Ebr 29 15:31:22 MyComputer fangfrisch[5467]:   File "/usr/lib/python3.11/site-packages/urllib3/connectionpool.py", line 416, in _make_request
Ebr 29 15:31:22 MyComputer fangfrisch[5467]:     conn.request(method, url, **httplib_request_kw)
Ebr 29 15:31:22 MyComputer fangfrisch[5467]:   File "/usr/lib/python3.11/site-packages/urllib3/connection.py", line 244, in request
Ebr 29 15:31:22 MyComputer fangfrisch[5467]:     super(HTTPConnection, self).request(method, url, body=body, headers=headers)
Ebr 29 15:31:22 MyComputer fangfrisch[5467]:   File "/usr/lib/python3.11/http/client.py", line 1298, in request
Ebr 29 15:31:22 MyComputer fangfrisch[5467]:     self._send_request(method, url, body, headers, encode_chunked)
Ebr 29 15:31:22 MyComputer fangfrisch[5467]:   File "/usr/lib/python3.11/http/client.py", line 1344, in _send_request
Ebr 29 15:31:22 MyComputer fangfrisch[5467]:     self.endheaders(body, encode_chunked=encode_chunked)
Ebr 29 15:31:22 MyComputer fangfrisch[5467]:   File "/usr/lib/python3.11/http/client.py", line 1293, in endheaders
Ebr 29 15:31:22 MyComputer fangfrisch[5467]:     self._send_output(message_body, encode_chunked=encode_chunked)
Ebr 29 15:31:22 MyComputer fangfrisch[5467]:   File "/usr/lib/python3.11/http/client.py", line 1052, in _send_output
Ebr 29 15:31:22 MyComputer fangfrisch[5467]:     self.send(msg)
Ebr 29 15:31:22 MyComputer fangfrisch[5467]:   File "/usr/lib/python3.11/http/client.py", line 990, in send
Ebr 29 15:31:22 MyComputer fangfrisch[5467]:     self.connect()
Ebr 29 15:31:22 MyComputer fangfrisch[5467]:   File "/usr/lib/python3.11/site-packages/urllib3/connection.py", line 205, in connect
Ebr 29 15:31:22 MyComputer fangfrisch[5467]:     conn = self._new_conn()
Ebr 29 15:31:22 MyComputer fangfrisch[5467]:            ^^^^^^^^^^^^^^^^
Ebr 29 15:31:22 MyComputer fangfrisch[5467]:   File "/usr/lib/python3.11/site-packages/urllib3/connection.py", line 186, in _new_conn
Ebr 29 15:31:22 MyComputer fangfrisch[5467]:     raise NewConnectionError(
Ebr 29 15:31:22 MyComputer fangfrisch[5467]: urllib3.exceptions.NewConnectionError: <urllib3.connection.HTTPConnection object at 0x7114d7cdec50>: Failed to establish a new connection: [Errno -3] Temporary failure in name resolution
Ebr 29 15:31:22 MyComputer fangfrisch[5467]: During handling of the above exception, another exception occurred:
Ebr 29 15:31:22 MyComputer fangfrisch[5467]: Traceback (most recent call last):
Ebr 29 15:31:22 MyComputer fangfrisch[5467]:   File "/usr/lib/python3.11/site-packages/requests/adapters.py", line 486, in send
Ebr 29 15:31:22 MyComputer fangfrisch[5467]:     resp = conn.urlopen(
Ebr 29 15:31:22 MyComputer fangfrisch[5467]:            ^^^^^^^^^^^^^
...

Given that this problem has appeared in my logs regularly, I'm assuming it isn't directly related to the kernel lockups, but mention it in case something has changed which causes it to have new knock-on effects.


I'm currently booting LTS to see if that avoids the problem, but it is too soon to tell one way or another.


CLI Paste | How To Ask Questions

Arch Linux | x86_64 | GPT | EFI boot | refind | stub loader | systemd | LVM2 on LUKS
Lenovo x270 | Intel(R) Core(TM) i5-7200U CPU @ 2.50GHz | Intel Wireless 8265/8275 | US keyboard w/ Euro | 512G NVMe INTEL SSDPEKKF512G7L

Offline

#2 2024-04-30 05:44:03

loqs
Member
Registered: 2014-03-06
Posts: 17,581

Re: 6.8.7.arch1-1 BUG: scheduling while atomic; watchdog: BUG: soft lockup

Offline

#3 2024-05-01 02:02:41

cfr
Member
From: Cymru
Registered: 2011-11-27
Posts: 7,143

Re: 6.8.7.arch1-1 BUG: scheduling while atomic; watchdog: BUG: soft lockup

That definitely sounds like it - thanks!

Did I misunderstand that thread? I understood from what was said that the commit causing the bug in the standard kernel had also been applied to Arch's LTS kernel. I've not yet seen the bug with the LTS one, but maybe it is just a matter of time?

Would you recommend downgrading in that case?

I'm also seeing a strange issue on the LTS kernel, but it seems unrelated. The LTS kernel says my battery is at 33% capacity i.e. dead, but at the weekend, the regular kernel said it was at 66% capacity i.e. alive. Now maybe it lost 33% of its life in a couple of days, but it seems a bit sudden even for an old battery. (But, then, I know very little about these things.) I was going to ask a new question, but just realised the reports are from different kernel lines.


CLI Paste | How To Ask Questions

Arch Linux | x86_64 | GPT | EFI boot | refind | stub loader | systemd | LVM2 on LUKS
Lenovo x270 | Intel(R) Core(TM) i5-7200U CPU @ 2.50GHz | Intel Wireless 8265/8275 | US keyboard w/ Euro | 512G NVMe INTEL SSDPEKKF512G7L

Offline

#4 2024-05-03 02:45:17

cfr
Member
From: Cymru
Registered: 2011-11-27
Posts: 7,143

Re: 6.8.7.arch1-1 BUG: scheduling while atomic; watchdog: BUG: soft lockup

Does 'resolved' in the status mean it is fixed? In the past there's usually been a link to a patch (which is here, but seems to be provided by the reporter rather than a developer) and a link to the commit when the patch gets applied (maybe in a provisional branch or whatever), but the discussion here doesn't seem to include any of that.

I wanted to make a note of which kernel might get a fix since that's a useful thing to know. Am I missing something or is there no way of telling?


CLI Paste | How To Ask Questions

Arch Linux | x86_64 | GPT | EFI boot | refind | stub loader | systemd | LVM2 on LUKS
Lenovo x270 | Intel(R) Core(TM) i5-7200U CPU @ 2.50GHz | Intel Wireless 8265/8275 | US keyboard w/ Euro | 512G NVMe INTEL SSDPEKKF512G7L

Offline

#5 2024-05-03 03:06:48

loqs
Member
Registered: 2014-03-06
Posts: 17,581

Re: 6.8.7.arch1-1 BUG: scheduling while atomic; watchdog: BUG: soft lockup

cfr wrote:

Does 'resolved' in the status mean it is fixed? In the past there's usually been a link to a patch (which is here, but seems to be provided by the reporter rather than a developer) and a link to the commit when the patch gets applied (maybe in a provisional branch or whatever), but the discussion here doesn't seem to include any of that.

I wanted to make a note of which kernel might get a fix since that's a useful thing to know. Am I missing something or is there no way of telling?

As far as I can determine the patch has not been accepted by the upstream subsystem maintainer https://lore.kernel.org/intel-wired-lan … huis.info/

Offline

#6 2024-05-03 05:21:55

cfr
Member
From: Cymru
Registered: 2011-11-27
Posts: 7,143

Re: 6.8.7.arch1-1 BUG: scheduling while atomic; watchdog: BUG: soft lockup

loqs wrote:

As far as I can determine the patch has not been accepted by the upstream subsystem maintainer https://lore.kernel.org/intel-wired-lan … huis.info/

That doesn't look good: not an objection to the patch, but a rejection of the problem. Is there anything I can do to help provide the evidence currently deemed lacking? The next message in the thread basically points to what I would have thought would be evidence ....

Do you think there's any chance of persuading Arch to at least remove it from the LTS kernel since I think Arch is adding it in that case? [But I've probably misunderstood.]


CLI Paste | How To Ask Questions

Arch Linux | x86_64 | GPT | EFI boot | refind | stub loader | systemd | LVM2 on LUKS
Lenovo x270 | Intel(R) Core(TM) i5-7200U CPU @ 2.50GHz | Intel Wireless 8265/8275 | US keyboard w/ Euro | 512G NVMe INTEL SSDPEKKF512G7L

Offline

#7 2024-05-03 13:23:32

kifuna
Member
Registered: 2015-01-29
Posts: 40

Re: 6.8.7.arch1-1 BUG: scheduling while atomic; watchdog: BUG: soft lockup

As someone who is not familiar with the way bugs are resolved in the kernel, all these public inboxes on lore.kernel.org look quite confusing.

- https://bugzilla.kernel.org/show_bug.cgi?id=218740 shows the bug as "RESOLVED CODE_FIX".
- https://lore.kernel.org/netdev/17145282 … ernel.org/ refers to the application of this patch: https://git.kernel.org/pub/scm/linux/ke … 7f295cb215.
- https://lore.kernel.org/lkml/95e3e9af-a … intel.com/ refers to the above public inbox (https://lore.kernel.org/netdev/17145282 … ernel.org/).
- https://lore.kernel.org/intel-wired-lan … ernel.org/ (linked by loqs above) seems to be still open? The messages, however, seem to be the same as on https://patchwork.ozlabs.org/project/in … intel.com/, where the state is "accepted".

Could maybe someone with more insight briefly explain, how all these e-mails are related and what has been fixed where (linux and/or linux-lts) and in what version? Many thanks in advance!

Last edited by kifuna (2024-05-03 13:24:45)

Offline

#8 2024-05-03 13:54:05

loqs
Member
Registered: 2014-03-06
Posts: 17,581

Re: 6.8.7.arch1-1 BUG: scheduling while atomic; watchdog: BUG: soft lockup

The fix is now in mainline https://git.kernel.org/pub/scm/linux/ke … cbd3621778 so should be picked up by upstream stable in one of the next two upstream stable releases.
You can open an issue on the Arch gitlab instance asking for that commit to be backported to linux/linux-lts.

Offline

#9 2024-05-03 18:08:23

cfr
Member
From: Cymru
Registered: 2011-11-27
Posts: 7,143

Re: 6.8.7.arch1-1 BUG: scheduling while atomic; watchdog: BUG: soft lockup

Thanks! I got thoroughly lost in the maze.

loqs wrote:

You can open an issue on the Arch gitlab instance asking for that commit to be backported to linux/linux-lts.

This took quite some doing ...

https://gitlab.archlinux.org/archlinux/ … -/issues/7


CLI Paste | How To Ask Questions

Arch Linux | x86_64 | GPT | EFI boot | refind | stub loader | systemd | LVM2 on LUKS
Lenovo x270 | Intel(R) Core(TM) i5-7200U CPU @ 2.50GHz | Intel Wireless 8265/8275 | US keyboard w/ Euro | 512G NVMe INTEL SSDPEKKF512G7L

Offline

Board footer

Powered by FluxBB