You are not logged in.

#1 2020-09-06 09:49:50

wtq119
Member
Registered: 2010-01-16
Posts: 2

watchdog: BUG: soft lockup - CPU stuck for 22s!

Hello everyone!
Sorry for my English.
I installed the nfs service a few months ago, but never used it. When trying to use this week, an error will be reported at sometime:
like this:

Sep 06 14:29:31 amd kernel: watchdog: BUG: soft lockup - CPU#57 stuck for 22s! [nfsd:1219]
Sep 06 14:29:31 amd kernel: Modules linked in: rpcsec_gss_krb5 loop joydev veth iptable_nat nf_nat nf_conntrack_netlink nfnetlink xfrm_user xfrm_algo snd_usb_audio s>
Sep 06 14:29:31 amd kernel:  jbd2 crc32c_intel xhci_pci xhci_pci_renesas xhci_hcd
Sep 06 14:29:31 amd kernel: CPU: 57 PID: 1219 Comm: nfsd Tainted: G      D      L    5.8.5-arch1-1 #1
Sep 06 14:29:31 amd kernel: Hardware name: Gigabyte Technology Co., Ltd. TRX40 AORUS XTREME/TRX40 AORUS XTREME, BIOS F4d 03/05/2020
Sep 06 14:29:31 amd kernel: RIP: 0010:native_queued_spin_lock_slowpath+0x6b/0x230
Sep 06 14:29:31 amd kernel: Code: 0f 92 c0 0f b6 c0 c1 e0 08 89 c2 8b 03 30 e4 09 d0 a9 00 01 ff ff 0f 85 37 01 00 00 85 c0 74 0e 8b 03 84 c0 74 08 f3 90 8b 03 <84> >
Sep 06 14:29:31 amd kernel: RSP: 0018:ffffa10302d47ce8 EFLAGS: 00000202
Sep 06 14:29:31 amd kernel: RAX: 0000000000000101 RBX: ffff973491c0f460 RCX: 0000000000000000
Sep 06 14:29:31 amd kernel: RDX: 0000000000000000 RSI: 0000000000000000 RDI: ffff973491c0f460
Sep 06 14:29:31 amd kernel: RBP: ffffa10302d47d38 R08: ffff973473ece360 R09: ffffa10302d47d48
Sep 06 14:29:31 amd kernel: R10: 00000000000007ff R11: 0000000000000002 R12: ffff973491c0f460
Sep 06 14:29:31 amd kernel: R13: ffff973473ece360 R14: ffffa10302d47da8 R15: ffff9734645cc000
Sep 06 14:29:31 amd kernel: FS:  0000000000000000(0000) GS:ffff9734bde40000(0000) knlGS:0000000000000000
Sep 06 14:29:31 amd kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Sep 06 14:29:31 amd kernel: CR2: 00007fafb8016718 CR3: 0000000b0ec0a000 CR4: 0000000000340ee0
Sep 06 14:29:31 amd kernel: Call Trace:
Sep 06 14:29:31 amd kernel:  _raw_spin_lock+0x2c/0x30
Sep 06 14:29:31 amd kernel:  lease_get_mtime+0x36/0x90
Sep 06 14:29:31 amd kernel:  encode_post_op_attr+0xca/0xf0 [nfsd]
Sep 06 14:29:31 amd kernel:  nfs3svc_encode_diropres+0x6b/0x70 [nfsd]
Sep 06 14:29:31 amd kernel:  nfsd_dispatch+0x13b/0x200 [nfsd]
Sep 06 14:29:31 amd kernel:  svc_process_common+0x3cd/0x740 [sunrpc]
Sep 06 14:29:31 amd kernel:  ? svc_xprt_received+0x55/0xc0 [sunrpc]
Sep 06 14:29:31 amd kernel:  ? svc_sock_secure_port+0x12/0x30 [sunrpc]
Sep 06 14:29:31 amd kernel:  ? nfsd_svc+0x330/0x330 [nfsd]
Sep 06 14:29:31 amd kernel:  ? nfsd_destroy+0x60/0x60 [nfsd]
Sep 06 14:29:31 amd kernel:  svc_process+0xb7/0xf0 [sunrpc]
Sep 06 14:29:31 amd kernel:  nfsd+0xed/0x150 [nfsd]
Sep 06 14:29:31 amd kernel:  kthread+0x142/0x160
Sep 06 14:29:31 amd kernel:  ? __kthread_bind_mask+0x60/0x60
Sep 06 14:29:31 amd kernel:  ret_from_fork+0x22/0x30
.

like this:

Sep 06 16:56:58 amd kernel: rcu: INFO: rcu_preempt self-detected stall on CPU
Sep 06 16:56:58 amd kernel: rcu:         39-....: (180008 ticks this GP) idle=b9e/1/0x4000000000000000 softirq=138864/138864 fqs=60001 last_accelerate: 91bc/50e6 dyn>
Sep 06 16:56:58 amd kernel:         (t=180009 jiffies g=348157 q=219812)
Sep 06 16:56:58 amd kernel: NMI backtrace for cpu 39
Sep 06 16:56:58 amd kernel: CPU: 39 PID: 1228 Comm: nfsd Tainted: G      D      L    5.8.5-arch1-1 #1
Sep 06 16:56:58 amd kernel: Hardware name: Gigabyte Technology Co., Ltd. TRX40 AORUS XTREME/TRX40 AORUS XTREME, BIOS F4d 03/05/2020
Sep 06 16:56:58 amd kernel: Call Trace:
Sep 06 16:56:58 amd kernel:  <IRQ>
Sep 06 16:56:58 amd kernel:  dump_stack+0x6b/0x88
Sep 06 16:56:58 amd kernel:  ? lapic_can_unplug_cpu.cold+0x40/0x40
Sep 06 16:56:58 amd kernel:  nmi_cpu_backtrace.cold+0x13/0x51
Sep 06 16:56:58 amd kernel:  ? lapic_can_unplug_cpu.cold+0x40/0x40
Sep 06 16:56:58 amd kernel:  nmi_trigger_cpumask_backtrace+0x10a/0x129
Sep 06 16:56:58 amd kernel:  rcu_dump_cpu_stacks+0xa2/0xd0
Sep 06 16:56:58 amd kernel:  rcu_sched_clock_irq.cold+0x1a7/0x59f
Sep 06 16:56:58 amd kernel:  ? timekeeping_update+0xde/0x120
Sep 06 16:56:58 amd kernel:  update_process_times+0x24/0x60
Sep 06 16:56:58 amd kernel:  tick_sched_handle+0x22/0x60
Sep 06 16:56:58 amd kernel:  tick_sched_timer+0x5b/0xc0
Sep 06 16:56:58 amd kernel:  ? can_stop_idle_tick+0xd0/0xd0
Sep 06 16:56:58 amd kernel:  __hrtimer_run_queues+0x128/0x2f0
Sep 06 16:56:58 amd kernel:  hrtimer_interrupt+0x118/0x280
Sep 06 16:56:58 amd kernel:  __sysvec_apic_timer_interrupt+0x83/0x190
Sep 06 16:56:58 amd kernel:  asm_call_on_stack+0x12/0x20
Sep 06 16:56:58 amd kernel:  </IRQ>
Sep 06 16:56:58 amd kernel:  sysvec_apic_timer_interrupt+0xa8/0xe0
Sep 06 16:56:58 amd kernel:  asm_sysvec_apic_timer_interrupt+0x12/0x20
Sep 06 16:56:58 amd kernel: RIP: 0010:native_queued_spin_lock_slowpath+0x69/0x230
Sep 06 16:56:58 amd kernel: Code: 2b 08 0f 92 c0 0f b6 c0 c1 e0 08 89 c2 8b 03 30 e4 09 d0 a9 00 01 ff ff 0f 85 37 01 00 00 85 c0 74 0e 8b 03 84 c0 74 08 f3 90 <8b> >
Sep 06 16:56:58 amd kernel: RSP: 0018:ffffb2f683a73cc8 EFLAGS: 00000202
Sep 06 16:56:58 amd kernel: RAX: 0000000000000101 RBX: ffff915345cd85e8 RCX: 0000000000000000
Sep 06 16:56:58 amd kernel: RDX: 0000000000000000 RSI: 0000000000000000 RDI: ffff915345cd85e8
Sep 06 16:56:58 amd kernel: RBP: 0000000000000002 R08: ffff9153055c7830 R09: ffff9150b2596078
Sep 06 16:56:58 amd kernel: R10: 0000000000000000 R11: 0000000000000000 R12: ffff9153055c7830
Sep 06 16:56:58 amd kernel: R13: ffff915345cd85e8 R14: ffff91535b209208 R15: ffff91535c7e1900
Sep 06 16:56:58 amd kernel:  _raw_spin_lock+0x2c/0x30
Sep 06 16:56:58 amd kernel:  generic_setlease+0x10c/0x790
Sep 06 16:56:58 amd kernel:  destroy_unhashed_deleg+0x5a/0xc0 [nfsd]
Sep 06 16:56:58 amd kernel:  nfsd4_delegreturn+0x123/0x130 [nfsd]
Sep 06 16:56:58 amd kernel:  nfsd4_proc_compound+0x3b5/0x760 [nfsd]
Sep 06 16:56:58 amd kernel:  nfsd_dispatch+0xcc/0x200 [nfsd]
.

When this happened, I use 'kill' and 'kill -9', but it didn't work.
I tried 'systemctl restart nfs-server',it didn‘t work.

I used 'pacman -Syu' every week, So,I don’t know after which update the failure occurred.



My computer and software:
AMD 3970X,Gigabyte  TRX40 AORUS
64G RAM
kernel:5.8.5-arch1-1
amd-ucode:20200817.7a30af1-1
nfs-utils:2.5.1-1

Thanks!

Last edited by wtq119 (2020-09-06 09:51:48)

Offline

Board footer

Powered by FluxBB