You are not logged in.

#1 2024-04-17 16:23:05

rsaavedra
Member
Registered: 2021-10-13
Posts: 12

[Solved] Soft lockup due to ethernet connection

I noticed my system hangs after I switch off wifi while having connected ethernet cable.

Here is the relevant journalctl log:

Apr 17 09:02:10 t480 NetworkManager[436]: <info>  [1713369730.5114] manager: rfkill: Wi-Fi hardware radio set disabled
Apr 17 09:02:10 t480 NetworkManager[436]: <info>  [1713369730.5115] device (wlp3s0): state change: activated -> unavailable (reason 'none', sys-iface-state: 'managed')
Apr 17 09:02:10 t480 NetworkManager[436]: <info>  [1713369730.5119] dhcp4 (wlp3s0): canceled DHCP transaction
Apr 17 09:02:10 t480 NetworkManager[436]: <info>  [1713369730.5120] dhcp4 (wlp3s0): activation: beginning transaction (timeout in 45 seconds)
Apr 17 09:02:10 t480 NetworkManager[436]: <info>  [1713369730.5120] dhcp4 (wlp3s0): state changed no lease
Apr 17 09:02:10 t480 wpa_supplicant[1147]: rfkill: WLAN soft blocked
Apr 17 09:02:10 t480 wpa_supplicant[1147]: rfkill: WLAN soft blocked
Apr 17 09:02:10 t480 systemd[1]: Starting Load/Save RF Kill Switch Status...
Apr 17 09:02:10 t480 systemd[1]: Started Load/Save RF Kill Switch Status.
Apr 17 09:02:10 t480 NetworkManager[436]: <info>  [1713369730.5793] device (wlp3s0): set-hw-addr: set MAC address to 16:E8:F9:0D:C5:A3 (scanning)
Apr 17 09:02:10 t480 NetworkManager[436]: <info>  [1713369730.5802] manager: NetworkManager state is now DISCONNECTED
Apr 17 09:02:10 t480 wpa_supplicant[1147]: p2p-dev-wlp3s0: CTRL-EVENT-DSCP-POLICY clear_all
Apr 17 09:02:10 t480 wpa_supplicant[1147]: p2p-dev-wlp3s0: CTRL-EVENT-DSCP-POLICY clear_all
Apr 17 09:02:10 t480 wpa_supplicant[1147]: nl80211: deinit ifname=p2p-dev-wlp3s0 disabled_11b_rates=0
Apr 17 09:02:10 t480 wpa_supplicant[1147]: wlp3s0: CTRL-EVENT-DSCP-POLICY clear_all
Apr 17 09:02:10 t480 NetworkManager[436]: <info>  [1713369730.6229] audit: op="radio-control" arg="wireless-enabled:off" pid=1597 uid=1000 result="success"
Apr 17 09:02:10 t480 NetworkManager[436]: <info>  [1713369730.6240] manager: rfkill: Wi-Fi now disabled by radio killswitch
Apr 17 09:02:10 t480 NetworkManager[436]: <info>  [1713369730.6270] device (p2p-dev-wlp3s0): state change: disconnected -> unavailable (reason 'supplicant-failed', sys-iface-state: 'managed')
Apr 17 09:02:10 t480 wpa_supplicant[1147]: wlp3s0: CTRL-EVENT-DSCP-POLICY clear_all
Apr 17 09:02:10 t480 wpa_supplicant[1147]: nl80211: deinit ifname=wlp3s0 disabled_11b_rates=0
Apr 17 09:02:15 t480 systemd[1]: systemd-rfkill.service: Deactivated successfully.
Apr 17 09:02:20 t480 systemd[1]: NetworkManager-dispatcher.service: Deactivated successfully.
Apr 17 09:02:25 t480 kernel: e1000e 0000:00:1f.6 enp0s31f6: NIC Link is Up 1000 Mbps Half Duplex, Flow Control: Rx/Tx
Apr 17 09:02:25 t480 kernel: BUG: scheduling while atomic: kworker/3:1/68/0x00000002
Apr 17 09:02:25 t480 kernel: Modules linked in: rfcomm vfat fat ccm cmac algif_hash algif_skcipher af_alg r8153_ecm cdc_ether usbnet r8152 mii bnep uvcvideo btusb btrtl videobuf2_vmalloc btintel uvc videobuf2_memops btbcm videobuf2_v4l2 btmtk videodev bluetooth videobuf2_common mc ecdh_generic mousedev joydev rmi_smbus rmi_core intel_uncore_frequency intel_uncore_frequency_common intel_pmc_core_pltdrv intel_pmc_core intel_vsec pmt_telemetry pmt_class snd_hda_codec_hdmi intel_tcc_cooling x86_pkg_temp_thermal intel_powerclamp snd_soc_avs coretemp snd_soc_hda_codec snd_soc_skl kvm_intel snd_soc_hdac_hda snd_hda_ext_core iwlmvm snd_soc_sst_ipc kvm snd_soc_sst_dsp snd_soc_acpi_intel_match snd_ctl_led irqbypass snd_soc_acpi snd_hda_codec_realtek crct10dif_pclmul mac80211 crc32_pclmul snd_hda_codec_generic snd_soc_core polyval_clmulni polyval_generic snd_compress gf128mul ghash_clmulni_intel ac97_bus sha512_ssse3 snd_pcm_dmaengine sha256_ssse3 sha1_ssse3 libarc4 snd_hda_intel aesni_intel snd_intel_dspcfg crypto_simd snd_intel_sdw_acpi cryptd
Apr 17 09:02:25 t480 kernel:  processor_thermal_device_pci_legacy snd_hda_codec processor_thermal_device intel_rapl_msr snd_hda_core rapl processor_thermal_wt_hint iwlwifi iTCO_wdt processor_thermal_rfim thinkpad_acpi ucsi_acpi snd_hwdep intel_pmc_bxt intel_cstate processor_thermal_rapl typec_ucsi mei_hdcp mei_pxp intel_rapl_common mei_wdt iTCO_vendor_support ee1004 ledtrig_audio snd_pcm intel_uncore e1000e processor_thermal_wt_req think_lmi typec intel_xhci_usb_role_switch psmouse mei_me processor_thermal_power_floor cfg80211 snd_timer ptp platform_profile wmi_bmof pcspkr processor_thermal_mbox firmware_attributes_class thunderbolt i2c_i801 intel_wmi_thunderbolt i2c_smbus mei rfkill intel_pch_thermal intel_soc_dts_iosf roles int3403_thermal pps_core snd int3400_thermal soundcore acpi_thermal_rel int340x_thermal_zone acpi_pad mac_hid crypto_user fuse loop dm_mod nfnetlink ip_tables x_tables ext4 crc32c_generic crc16 mbcache jbd2 hid_multitouch hid_generic usbhid uas usb_storage i915 serio_raw i2c_algo_bit atkbd drm_buddy libps2
Apr 17 09:02:25 t480 kernel:  vivaldi_fmap ttm nvme crc32c_intel intel_gtt nvme_core drm_display_helper xhci_pci nvme_auth cec xhci_pci_renesas video i8042 wmi serio
Apr 17 09:02:25 t480 kernel: CPU: 3 PID: 68 Comm: kworker/3:1 Tainted: G S                 6.8.5-arch1-1 #1 5f12b795066ab8d27a5fe9971245067df4fb99ed
Apr 17 09:02:25 t480 kernel: Hardware name: LENOVO 20L6S9R500/20L6S9R500, BIOS N24ET76W (1.51 ) 02/27/2024
Apr 17 09:02:25 t480 kernel: Workqueue: events linkwatch_event
Apr 17 09:02:25 t480 kernel: Call Trace:
Apr 17 09:02:25 t480 kernel:  <TASK>
Apr 17 09:02:25 t480 kernel:  dump_stack_lvl+0x47/0x60
Apr 17 09:02:25 t480 kernel:  __schedule_bug+0x56/0x70
Apr 17 09:02:25 t480 kernel:  __schedule+0x10f0/0x1520
Apr 17 09:02:25 t480 kernel:  ? update_sd_lb_stats.constprop.0+0x12d/0xa10
Apr 17 09:02:25 t480 kernel:  ? ktime_get+0x3c/0xa0
Apr 17 09:02:25 t480 kernel:  ? lapic_next_deadline+0x2c/0x40
Apr 17 09:02:25 t480 kernel:  ? hrtimer_start_range_ns+0x25b/0x350
Apr 17 09:02:25 t480 kernel:  schedule+0x32/0xd0
Apr 17 09:02:25 t480 kernel:  schedule_hrtimeout_range_clock+0xbe/0x140
Apr 17 09:02:25 t480 kernel:  ? __pfx_hrtimer_wakeup+0x10/0x10
Apr 17 09:02:25 t480 kernel:  usleep_range_state+0x64/0x90
Apr 17 09:02:25 t480 kernel:  e1000e_read_phy_reg_mdic+0x87/0x280 [e1000e fb07978d01503861146aa67f737f75ae3542f3bf]
Apr 17 09:02:25 t480 kernel:  e1000e_update_stats+0x513/0x730 [e1000e fb07978d01503861146aa67f737f75ae3542f3bf]
Apr 17 09:02:25 t480 kernel:  e1000e_get_stats64+0x2e/0x120 [e1000e fb07978d01503861146aa67f737f75ae3542f3bf]
Apr 17 09:02:25 t480 kernel:  dev_get_stats+0x5d/0x110
Apr 17 09:02:25 t480 kernel:  rtnl_fill_stats+0x3b/0x130
Apr 17 09:02:25 t480 kernel:  rtnl_fill_ifinfo+0x872/0x1580
Apr 17 09:02:25 t480 kernel:  ? __alloc_skb+0x8c/0x1a0
Apr 17 09:02:25 t480 kernel:  rtmsg_ifinfo_build_skb+0xae/0x120
Apr 17 09:02:25 t480 kernel:  rtmsg_ifinfo+0x3c/0x90
Apr 17 09:02:25 t480 kernel:  netdev_state_change+0x89/0x90
Apr 17 09:02:25 t480 kernel:  linkwatch_do_dev+0x4f/0x60
Apr 17 09:02:25 t480 kernel:  __linkwatch_run_queue+0xe1/0x260
Apr 17 09:02:25 t480 kernel:  linkwatch_event+0x31/0x40
Apr 17 09:02:25 t480 kernel:  process_one_work+0x178/0x350
Apr 17 09:02:25 t480 kernel:  worker_thread+0x30f/0x450
Apr 17 09:02:25 t480 kernel:  ? __pfx_worker_thread+0x10/0x10
Apr 17 09:02:25 t480 kernel:  kthread+0xe5/0x120
Apr 17 09:02:25 t480 kernel:  ? __pfx_kthread+0x10/0x10
Apr 17 09:02:25 t480 kernel:  ret_from_fork+0x31/0x50
Apr 17 09:02:25 t480 kernel:  ? __pfx_kthread+0x10/0x10
Apr 17 09:02:25 t480 kernel:  ret_from_fork_asm+0x1b/0x30
Apr 17 09:02:25 t480 kernel:  </TASK>
Apr 17 09:02:49 t480 kernel: watchdog: BUG: soft lockup - CPU#3 stuck for 26s! [kworker/3:3:177]
Apr 17 09:02:49 t480 kernel: Modules linked in: rfcomm vfat fat ccm cmac algif_hash algif_skcipher af_alg r8153_ecm cdc_ether usbnet r8152 mii bnep uvcvideo btusb btrtl videobuf2_vmalloc btintel uvc videobuf2_memops btbcm videobuf2_v4l2 btmtk videodev bluetooth videobuf2_common mc ecdh_generic mousedev joydev rmi_smbus rmi_core intel_uncore_frequency intel_uncore_frequency_common intel_pmc_core_pltdrv intel_pmc_core intel_vsec pmt_telemetry pmt_class snd_hda_codec_hdmi intel_tcc_cooling x86_pkg_temp_thermal intel_powerclamp snd_soc_avs coretemp snd_soc_hda_codec snd_soc_skl kvm_intel snd_soc_hdac_hda snd_hda_ext_core iwlmvm snd_soc_sst_ipc kvm snd_soc_sst_dsp snd_soc_acpi_intel_match snd_ctl_led irqbypass snd_soc_acpi snd_hda_codec_realtek crct10dif_pclmul mac80211 crc32_pclmul snd_hda_codec_generic snd_soc_core polyval_clmulni polyval_generic snd_compress gf128mul ghash_clmulni_intel ac97_bus sha512_ssse3 snd_pcm_dmaengine sha256_ssse3 sha1_ssse3 libarc4 snd_hda_intel aesni_intel snd_intel_dspcfg crypto_simd snd_intel_sdw_acpi cryptd
Apr 17 09:02:49 t480 kernel:  processor_thermal_device_pci_legacy snd_hda_codec processor_thermal_device intel_rapl_msr snd_hda_core rapl processor_thermal_wt_hint iwlwifi iTCO_wdt processor_thermal_rfim thinkpad_acpi ucsi_acpi snd_hwdep intel_pmc_bxt intel_cstate processor_thermal_rapl typec_ucsi mei_hdcp mei_pxp intel_rapl_common mei_wdt iTCO_vendor_support ee1004 ledtrig_audio snd_pcm intel_uncore e1000e processor_thermal_wt_req think_lmi typec intel_xhci_usb_role_switch psmouse mei_me processor_thermal_power_floor cfg80211 snd_timer ptp platform_profile wmi_bmof pcspkr processor_thermal_mbox firmware_attributes_class thunderbolt i2c_i801 intel_wmi_thunderbolt i2c_smbus mei rfkill intel_pch_thermal intel_soc_dts_iosf roles int3403_thermal pps_core snd int3400_thermal soundcore acpi_thermal_rel int340x_thermal_zone acpi_pad mac_hid crypto_user fuse loop dm_mod nfnetlink ip_tables x_tables ext4 crc32c_generic crc16 mbcache jbd2 hid_multitouch hid_generic usbhid uas usb_storage i915 serio_raw i2c_algo_bit atkbd drm_buddy libps2
Apr 17 09:02:49 t480 kernel:  vivaldi_fmap ttm nvme crc32c_intel intel_gtt nvme_core drm_display_helper xhci_pci nvme_auth cec xhci_pci_renesas video i8042 wmi serio
Apr 17 09:02:49 t480 kernel: CPU: 3 PID: 177 Comm: kworker/3:3 Tainted: G S      W          6.8.5-arch1-1 #1 5f12b795066ab8d27a5fe9971245067df4fb99ed
Apr 17 09:02:49 t480 kernel: Hardware name: LENOVO 20L6S9R500/20L6S9R500, BIOS N24ET76W (1.51 ) 02/27/2024
Apr 17 09:02:49 t480 kernel: Workqueue: events e1000_watchdog_task [e1000e]
Apr 17 09:02:49 t480 kernel: RIP: 0010:native_queued_spin_lock_slowpath+0x6e/0x2e0
Apr 17 09:02:49 t480 kernel: Code: 77 7f f0 0f ba 2b 08 0f 92 c2 8b 03 0f b6 d2 c1 e2 08 30 e4 09 d0 3d ff 00 00 00 77 5b 85 c0 74 10 0f b6 03 84 c0 74 09 f3 90 <0f> b6 03 84 c0 75 f7 b8 01 00 00 00 66 89 03 65 48 ff 05 c3 48 e5
Apr 17 09:02:49 t480 kernel: RSP: 0018:ffffb2d902383df0 EFLAGS: 00000202
Apr 17 09:02:49 t480 kernel: RAX: 0000000000000001 RBX: ffff99da13573428 RCX: 0000000000000000
Apr 17 09:02:49 t480 kernel: RDX: 0000000000000000 RSI: 0000000000000001 RDI: ffff99da13573428
Apr 17 09:02:49 t480 kernel: RBP: ffff99da13570f78 R08: 00000000fffee47d R09: 0000000000000002
Apr 17 09:02:49 t480 kernel: R10: ffff99dd516f4910 R11: 00000000001e28d4 R12: ffff99da135709c0
Apr 17 09:02:49 t480 kernel: R13: ffff99da013d1280 R14: ffff99da13573428 R15: ffff99da13570a58
Apr 17 09:02:49 t480 kernel: FS:  0000000000000000(0000) GS:ffff99dd516c0000(0000) knlGS:0000000000000000
Apr 17 09:02:49 t480 kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Apr 17 09:02:49 t480 kernel: CR2: 00007496a8ba1508 CR3: 0000000039820006 CR4: 00000000003706f0
Apr 17 09:02:49 t480 kernel: DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
Apr 17 09:02:49 t480 kernel: DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
Apr 17 09:02:49 t480 kernel: Call Trace:
Apr 17 09:02:49 t480 kernel:  <IRQ>
Apr 17 09:02:49 t480 kernel:  ? watchdog_timer_fn+0x1e6/0x270
Apr 17 09:02:49 t480 kernel:  ? __pfx_watchdog_timer_fn+0x10/0x10
Apr 17 09:02:49 t480 kernel:  ? __hrtimer_run_queues+0x10f/0x2b0
Apr 17 09:02:49 t480 kernel:  ? hrtimer_interrupt+0xf8/0x230
Apr 17 09:02:49 t480 kernel:  ? __sysvec_apic_timer_interrupt+0x4d/0x140
Apr 17 09:02:49 t480 kernel:  ? sysvec_apic_timer_interrupt+0x6d/0x90
Apr 17 09:02:49 t480 kernel:  </IRQ>
Apr 17 09:02:49 t480 kernel:  <TASK>
Apr 17 09:02:49 t480 kernel:  ? asm_sysvec_apic_timer_interrupt+0x1a/0x20
Apr 17 09:02:49 t480 kernel:  ? native_queued_spin_lock_slowpath+0x6e/0x2e0
Apr 17 09:02:49 t480 kernel:  _raw_spin_lock+0x29/0x30
Apr 17 09:02:49 t480 kernel:  e1000_watchdog_task+0xdf/0xab0 [e1000e fb07978d01503861146aa67f737f75ae3542f3bf]
Apr 17 09:02:49 t480 kernel:  process_one_work+0x178/0x350
Apr 17 09:02:49 t480 kernel:  worker_thread+0x30f/0x450
Apr 17 09:02:49 t480 kernel:  ? __pfx_worker_thread+0x10/0x10
Apr 17 09:02:49 t480 kernel:  kthread+0xe5/0x120
Apr 17 09:02:49 t480 kernel:  ? __pfx_kthread+0x10/0x10
Apr 17 09:02:49 t480 kernel:  ret_from_fork+0x31/0x50
Apr 17 09:02:49 t480 kernel:  ? __pfx_kthread+0x10/0x10
Apr 17 09:02:49 t480 kernel:  ret_from_fork_asm+0x1b/0x30
Apr 17 09:02:49 t480 kernel:  </TASK>

As you can see there are two bugs:

Apr 17 09:02:25 t480 kernel: BUG: scheduling while atomic: kworker/3:1/68/0x00000002
Apr 17 09:02:49 t480 kernel: watchdog: BUG: soft lockup - CPU#3 stuck for 26s! [kworker/3:3:177]

I have recently updated firmware using fwupd.
I found the same issue with kernels linux 6.8.6.arch1-1and linux-lts 6.6.27-1.

---

Apparently there was an issue related to systemd bluetooth.service.
I found a pacnew file for /etc/bluetooth/main.conf with the default option `autoenable = true`.
I had it set to `autoenable = false`, so commenting this line solved the issue.

Last edited by rsaavedra (2024-04-23 22:15:19)

Offline

#2 2024-04-26 07:56:30

patroclos
Member
Registered: 2024-04-26
Posts: 1

Re: [Solved] Soft lockup due to ethernet connection

I'm having a similar issue, only it happens when I plugin an ethernet cable.

Here's the `journalctl -k -b -1`:

Apr 25 16:37:44.905014 x kernel: e1000e 0000:00:1f.6 enp0s31f6: NIC Link is Down
Apr 25 16:38:00.851640 x kernel: e1000e 0000:00:1f.6 enp0s31f6: NIC Link is Up 100 Mbps Half Duplex, Flow Control: Rx/Tx
Apr 25 16:38:00.851921 x kernel: BUG: scheduling while atomic: kworker/3:0/39/0x00000002
Apr 25 16:38:00.851941 x kernel: Modules linked in: ...
Apr 25 16:38:00.852056 x kernel: CPU: 3 PID: 39 Comm: kworker/3:0 Not tainted 6.8.7-arch1-1 #1 cb8440eaa48704794690ea311c777c18c4e95af9
Apr 25 16:38:00.852069 x kernel: Hardware name: LENOVO 20L8S5LK0M/20L8S5LK0M, BIOS N22ET76W (1.53 ) 01/04/2023
Apr 25 16:38:00.852083 x kernel: Workqueue: events linkwatch_event
Apr 25 16:38:00.852096 x kernel: Call Trace:
Apr 25 16:38:00.852109 x kernel:  <TASK>
Apr 25 16:38:00.852120 x kernel:  dump_stack_lvl+0x64/0x80
Apr 25 16:38:00.852133 x kernel:  __schedule_bug+0x56/0x70
Apr 25 16:38:00.852148 x kernel:  __schedule+0x10f0/0x1520
Apr 25 16:38:00.852161 x kernel:  ? update_sd_lb_stats.constprop.0+0x12d/0xa10
Apr 25 16:38:00.852176 x kernel:  ? ktime_get+0x3c/0xa0
Apr 25 16:38:00.852189 x kernel:  ? lapic_next_deadline+0x2c/0x40
Apr 25 16:38:00.852203 x kernel:  ? hrtimer_start_range_ns+0x25b/0x350
Apr 25 16:38:00.852214 x kernel:  schedule+0x32/0xd0
Apr 25 16:38:00.852226 x kernel:  schedule_hrtimeout_range_clock+0xbe/0x140
Apr 25 16:38:00.852239 x kernel:  ? __pfx_hrtimer_wakeup+0x10/0x10
Apr 25 16:38:00.852252 x kernel:  usleep_range_state+0x64/0x90
Apr 25 16:38:00.852265 x kernel:  e1000e_read_phy_reg_mdic+0x87/0x280 [e1000e 5998468c47e3397a0a0c74928bb3f670f3bf9b64]
Apr 25 16:38:00.852278 x kernel:  e1000e_update_stats+0x513/0x730 [e1000e 5998468c47e3397a0a0c74928bb3f670f3bf9b64]
Apr 25 16:38:00.852291 x kernel:  e1000e_get_stats64+0x2e/0x120 [e1000e 5998468c47e3397a0a0c74928bb3f670f3bf9b64]
Apr 25 16:38:00.852304 x kernel:  dev_get_stats+0x5d/0x110
Apr 25 16:38:00.852317 x kernel:  rtnl_fill_stats+0x3b/0x130
Apr 25 16:38:00.852330 x kernel:  rtnl_fill_ifinfo+0x872/0x1580
Apr 25 16:38:00.852343 x kernel:  ? __alloc_skb+0x8c/0x1a0
Apr 25 16:38:00.852356 x kernel:  rtmsg_ifinfo_build_skb+0xae/0x120
Apr 25 16:38:00.852368 x kernel:  rtmsg_ifinfo+0x3c/0x90
Apr 25 16:38:00.852381 x kernel:  netdev_state_change+0x89/0x90
Apr 25 16:38:00.852394 x kernel:  linkwatch_do_dev+0x4f/0x60
Apr 25 16:38:00.852407 x kernel:  __linkwatch_run_queue+0xe1/0x260
Apr 25 16:38:00.852418 x kernel:  linkwatch_event+0x31/0x40
Apr 25 16:38:00.852431 x kernel:  process_one_work+0x178/0x350
Apr 25 16:38:00.852447 x kernel:  worker_thread+0x30f/0x450
Apr 25 16:38:00.852463 x kernel:  ? __pfx_worker_thread+0x10/0x10
Apr 25 16:38:00.852475 x kernel:  kthread+0xe5/0x120
Apr 25 16:38:00.852488 x kernel:  ? __pfx_kthread+0x10/0x10
Apr 25 16:38:00.852501 x kernel:  ret_from_fork+0x31/0x50
Apr 25 16:38:00.852513 x kernel:  ? __pfx_kthread+0x10/0x10
Apr 25 16:38:00.852524 x kernel:  ret_from_fork_asm+0x1b/0x30
Apr 25 16:38:00.852536 x kernel:  </TASK>
Apr 25 16:38:27.013951 x kernel: watchdog: BUG: soft lockup - CPU#3 stuck for 26s! [kworker/3:1:79]

I took a look at /etc/bluetooth/main.conf like suggested above, but that was just the default configuration (I never touched that file)
However I found this kernel bug-report:
Bug 218740 - 6.8.6 BUG: scheduling while atomic: kworker/5:0/84234/0x00000002 (e1000e?)

There is a patch linked in that bug.  I followed the wiki instructions in the Kernel/Arch Build system to incorporate
the patch into the linux kernel and it solved my problem.

They also mention that the issue seems to be with kernels >=6.8.5, which is curious because found this issue also with linux-lts-6.6.27-1.

Offline

#3 2024-04-30 20:47:47

rsaavedra
Member
Registered: 2021-10-13
Posts: 12

Re: [Solved] Soft lockup due to ethernet connection

patroclos wrote:

I'm having a similar issue, only it happens when I plugin an ethernet cable.

Here's the `journalctl -k -b -1`:

Apr 25 16:37:44.905014 x kernel: e1000e 0000:00:1f.6 enp0s31f6: NIC Link is Down
Apr 25 16:38:00.851640 x kernel: e1000e 0000:00:1f.6 enp0s31f6: NIC Link is Up 100 Mbps Half Duplex, Flow Control: Rx/Tx
Apr 25 16:38:00.851921 x kernel: BUG: scheduling while atomic: kworker/3:0/39/0x00000002
Apr 25 16:38:00.851941 x kernel: Modules linked in: ...
Apr 25 16:38:00.852056 x kernel: CPU: 3 PID: 39 Comm: kworker/3:0 Not tainted 6.8.7-arch1-1 #1 cb8440eaa48704794690ea311c777c18c4e95af9
Apr 25 16:38:00.852069 x kernel: Hardware name: LENOVO 20L8S5LK0M/20L8S5LK0M, BIOS N22ET76W (1.53 ) 01/04/2023
Apr 25 16:38:00.852083 x kernel: Workqueue: events linkwatch_event
Apr 25 16:38:00.852096 x kernel: Call Trace:
Apr 25 16:38:00.852109 x kernel:  <TASK>
Apr 25 16:38:00.852120 x kernel:  dump_stack_lvl+0x64/0x80
Apr 25 16:38:00.852133 x kernel:  __schedule_bug+0x56/0x70
Apr 25 16:38:00.852148 x kernel:  __schedule+0x10f0/0x1520
Apr 25 16:38:00.852161 x kernel:  ? update_sd_lb_stats.constprop.0+0x12d/0xa10
Apr 25 16:38:00.852176 x kernel:  ? ktime_get+0x3c/0xa0
Apr 25 16:38:00.852189 x kernel:  ? lapic_next_deadline+0x2c/0x40
Apr 25 16:38:00.852203 x kernel:  ? hrtimer_start_range_ns+0x25b/0x350
Apr 25 16:38:00.852214 x kernel:  schedule+0x32/0xd0
Apr 25 16:38:00.852226 x kernel:  schedule_hrtimeout_range_clock+0xbe/0x140
Apr 25 16:38:00.852239 x kernel:  ? __pfx_hrtimer_wakeup+0x10/0x10
Apr 25 16:38:00.852252 x kernel:  usleep_range_state+0x64/0x90
Apr 25 16:38:00.852265 x kernel:  e1000e_read_phy_reg_mdic+0x87/0x280 [e1000e 5998468c47e3397a0a0c74928bb3f670f3bf9b64]
Apr 25 16:38:00.852278 x kernel:  e1000e_update_stats+0x513/0x730 [e1000e 5998468c47e3397a0a0c74928bb3f670f3bf9b64]
Apr 25 16:38:00.852291 x kernel:  e1000e_get_stats64+0x2e/0x120 [e1000e 5998468c47e3397a0a0c74928bb3f670f3bf9b64]
Apr 25 16:38:00.852304 x kernel:  dev_get_stats+0x5d/0x110
Apr 25 16:38:00.852317 x kernel:  rtnl_fill_stats+0x3b/0x130
Apr 25 16:38:00.852330 x kernel:  rtnl_fill_ifinfo+0x872/0x1580
Apr 25 16:38:00.852343 x kernel:  ? __alloc_skb+0x8c/0x1a0
Apr 25 16:38:00.852356 x kernel:  rtmsg_ifinfo_build_skb+0xae/0x120
Apr 25 16:38:00.852368 x kernel:  rtmsg_ifinfo+0x3c/0x90
Apr 25 16:38:00.852381 x kernel:  netdev_state_change+0x89/0x90
Apr 25 16:38:00.852394 x kernel:  linkwatch_do_dev+0x4f/0x60
Apr 25 16:38:00.852407 x kernel:  __linkwatch_run_queue+0xe1/0x260
Apr 25 16:38:00.852418 x kernel:  linkwatch_event+0x31/0x40
Apr 25 16:38:00.852431 x kernel:  process_one_work+0x178/0x350
Apr 25 16:38:00.852447 x kernel:  worker_thread+0x30f/0x450
Apr 25 16:38:00.852463 x kernel:  ? __pfx_worker_thread+0x10/0x10
Apr 25 16:38:00.852475 x kernel:  kthread+0xe5/0x120
Apr 25 16:38:00.852488 x kernel:  ? __pfx_kthread+0x10/0x10
Apr 25 16:38:00.852501 x kernel:  ret_from_fork+0x31/0x50
Apr 25 16:38:00.852513 x kernel:  ? __pfx_kthread+0x10/0x10
Apr 25 16:38:00.852524 x kernel:  ret_from_fork_asm+0x1b/0x30
Apr 25 16:38:00.852536 x kernel:  </TASK>
Apr 25 16:38:27.013951 x kernel: watchdog: BUG: soft lockup - CPU#3 stuck for 26s! [kworker/3:1:79]

I took a look at /etc/bluetooth/main.conf like suggested above, but that was just the default configuration (I never touched that file)
However I found this kernel bug-report:
Bug 218740 - 6.8.6 BUG: scheduling while atomic: kworker/5:0/84234/0x00000002 (e1000e?)

There is a patch linked in that bug.  I followed the wiki instructions in the Kernel/Arch Build system to incorporate
the patch into the linux kernel and it solved my problem.

They also mention that the issue seems to be with kernels >=6.8.5, which is curious because found this issue also with linux-lts-6.6.27-1.

I notice the same issue with kernel 6.8.8.
The only way around is to first disconnect wifi, then connect the ethernet.
The bug appears when you connect the ethernet while the wifi is still on.

Offline

#4 2024-05-01 00:23:47

loqs
Member
Registered: 2014-03-06
Posts: 17,438

Re: [Solved] Soft lockup due to ethernet connection

@rsaavedra if building the kernel yourself is preventing you from testing the proposed fix you can find a prebuilt kernel with it applied in https://bbs.archlinux.org/viewtopic.php … 1#p2165871

Offline

Board footer

Powered by FluxBB