You are not logged in.
System Configuration:
- Everything up-to-date
- Kernel: extra/linux-zen 6.6.4.zen1-1
- Using unofficial repos archlinuxcn and chaotic-aur along with the official repos, but I don't think I've installed any software that will cause this problem from unofficial repos.
- Removed or blacklisted all kernel modules that will taint the kernel.
- Tried to reproduce multiple times and succeeded.
- GPU: AMD 5700 founders edition
The Problem:
Recently (I'm not certain which update caused the problem), attempting to start LibreOffice results in the application getting stuck on the splash screen. The progress bar moves initially but then freezes. Subsequently, the system might remain usable, or it could freeze entirely.
When the system remains usable, the process soffice.bin becomes a zombie, and terminating it becomes impossible. Even when I try to shutdown, the kernel fails to kill the zombie process, necessitating a wait for the systemd shutdown watchdog to timeout and force a power-off. Despite the presence of this zombie process, the system remains operational and somewhat stable. However, I'm able to spot some error messages in the kernel log:
12月 08 13:56:28 kernel: amdgpu 0000:0a:00.0: amdgpu: bo 000000009faa37dc va 0x0800000000-0x0800000001 conflict with 0x0800000000-0x0800000200
12月 08 13:56:28 kernel: amdgpu: Failed to map VA 0x800000000000 in vm. ret -22
12月 08 13:56:28 kernel: amdgpu: Failed to map bo to gpuvm
12月 08 13:56:28 kernel: BUG: kernel NULL pointer dereference, address: 0000000000000008
12月 08 13:56:28 kernel: #PF: supervisor read access in kernel mode
12月 08 13:56:28 kernel: #PF: error_code(0x0000) - not-present page
12月 08 13:56:28 kernel: PGD 0 P4D 0
12月 08 13:56:28 kernel: Oops: 0000 [#1] PREEMPT SMP NOPTI
12月 08 13:56:28 kernel: CPU: 18 PID: 11646 Comm: soffice.bin Not tainted 6.6.4-zen1-1-zen #1 26da708843d24c87db72b38a656d6f6e482ad152
12月 08 13:56:28 kernel: Hardware name: ASUS System Product Name/TUF GAMING B550M-PLUS (WI-FI), BIOS 2803 04/27/2022
12月 08 13:56:28 kernel: RIP: 0010:dma_resv_add_fence+0x47/0x1e0
12月 08 13:56:28 kernel: Code: 89 54 24 04 48 85 f6 74 21 48 8d 7e 38 b8 01 00 00 00 f0 0f c1 46 38 85 c0 0f 84 49 01 00 00 8d 50 01 09 c2 0f 88 4d 01 00 00 <49> 8b 45 08 48 3d 40 3c 3a 9d 0f 84 c9 00 00 00 48 3d e>
12月 08 13:56:28 kernel: RSP: 0018:ffffbbe450b87c60 EFLAGS: 00010246
12月 08 13:56:28 kernel: RAX: ffff9b8fce740000 RBX: ffff9b8fce740158 RCX: 0000000000000003
12月 08 13:56:28 kernel: RDX: 0000000000000003 RSI: 0000000000000000 RDI: ffff9b8fce740158
12月 08 13:56:28 kernel: RBP: ffff9b8e81469000 R08: 0000000000000002 R09: ffff9b8e81469000
12月 08 13:56:28 kernel: R10: 0000000007ffffff R11: 0000000004000800 R12: ffff9b8eb6079338
12月 08 13:56:28 kernel: R13: 0000000000000000 R14: ffff9b8e81469730 R15: ffff9b8fce740000
12月 08 13:56:28 kernel: FS: 00007f629cbcd000(0000) GS:ffff9ba15ee80000(0000) knlGS:0000000000000000
12月 08 13:56:28 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
12月 08 13:56:28 kernel: CR2: 0000000000000008 CR3: 000000016e800000 CR4: 0000000000f50ee0
12月 08 13:56:28 kernel: PKRU: 55555554
12月 08 13:56:28 kernel: Call Trace:
12月 08 13:56:28 kernel: <TASK>
12月 08 13:56:28 kernel: ? __die+0x10f/0x120
12月 08 13:56:28 kernel: ? page_fault_oops+0x171/0x4e0
12月 08 13:56:28 kernel: ? exc_page_fault+0x7f/0x180
12月 08 13:56:28 kernel: ? asm_exc_page_fault+0x26/0x30
12月 08 13:56:28 kernel: ? dma_resv_add_fence+0x47/0x1e0
12月 08 13:56:28 kernel: amdgpu_amdkfd_gpuvm_acquire_process_vm+0x212/0x530 [amdgpu f35041c0b426873870f068374b5ce57ca7b35096]
12月 08 13:56:28 kernel: kfd_process_device_init_vm+0xb0/0x390 [amdgpu f35041c0b426873870f068374b5ce57ca7b35096]
12月 08 13:56:28 kernel: ? kfd_ioctl_get_process_apertures_new+0x47/0x330 [amdgpu f35041c0b426873870f068374b5ce57ca7b35096]
12月 08 13:56:28 kernel: kfd_ioctl_acquire_vm+0x89/0xc0 [amdgpu f35041c0b426873870f068374b5ce57ca7b35096]
12月 08 13:56:28 kernel: kfd_ioctl+0x3cb/0x4e0 [amdgpu f35041c0b426873870f068374b5ce57ca7b35096]
12月 08 13:56:28 kernel: ? __pfx_kfd_ioctl_acquire_vm+0x10/0x10 [amdgpu f35041c0b426873870f068374b5ce57ca7b35096]
12月 08 13:56:28 kernel: __x64_sys_ioctl+0x97/0xd0
12月 08 13:56:28 kernel: do_syscall_64+0x60/0x90
12月 08 13:56:28 kernel: ? syscall_exit_to_user_mode+0x2b/0x40
12月 08 13:56:28 kernel: ? srso_alias_return_thunk+0x5/0x7f
12月 08 13:56:28 kernel: ? do_syscall_64+0x6c/0x90
12月 08 13:56:28 kernel: ? srso_alias_return_thunk+0x5/0x7f
12月 08 13:56:28 kernel: ? __x64_sys_close+0x8c/0x110
12月 08 13:56:28 kernel: ? srso_alias_return_thunk+0x5/0x7f
12月 08 13:56:28 kernel: ? syscall_exit_to_user_mode+0x2b/0x40
12月 08 13:56:28 kernel: ? srso_alias_return_thunk+0x5/0x7f
12月 08 13:56:28 kernel: ? do_syscall_64+0x6c/0x90
12月 08 13:56:28 kernel: ? srso_alias_return_thunk+0x5/0x7f
12月 08 13:56:28 kernel: ? syscall_exit_to_user_mode+0x2b/0x40
12月 08 13:56:28 kernel: ? srso_alias_return_thunk+0x5/0x7f
12月 08 13:56:28 kernel: ? do_syscall_64+0x6c/0x90
12月 08 13:56:28 kernel: ? do_syscall_64+0x6c/0x90
12月 08 13:56:28 kernel: ? do_syscall_64+0x6c/0x90
12月 08 13:56:28 kernel: entry_SYSCALL_64_after_hwframe+0x6e/0xd8
12月 08 13:56:28 kernel: RIP: 0033:0x7f62a6aef3af
12月 08 13:56:28 kernel: Code: 00 48 89 44 24 18 31 c0 48 8d 44 24 60 c7 04 24 10 00 00 00 48 89 44 24 08 48 8d 44 24 20 48 89 44 24 10 b8 10 00 00 00 0f 05 <89> c2 3d 00 f0 ff ff 77 18 48 8b 44 24 18 64 48 2b 04 2>
12月 08 13:56:28 kernel: RSP: 002b:00007fffb7524960 EFLAGS: 00000246 ORIG_RAX: 0000000000000010
12月 08 13:56:28 kernel: RAX: ffffffffffffffda RBX: 00007fffb7524a50 RCX: 00007f62a6aef3af
12月 08 13:56:28 kernel: RDX: 00007fffb7524ad0 RSI: 0000000040084b15 RDI: 0000000000000023
12月 08 13:56:28 kernel: RBP: 00007fffb7524ad0 R08: 000000000000000e R09: 0000000000000006
12月 08 13:56:28 kernel: R10: 0000000000000001 R11: 0000000000000246 R12: 0000000040084b15
12月 08 13:56:28 kernel: R13: 0000000000000023 R14: 000055cab482cf90 R15: 00007f6238724180
12月 08 13:56:28 kernel: </TASK>
12月 08 13:56:28 kernel: Modules linked in: nfnetlink rfcomm ntfs3 ccm cmac algif_hash algif_skcipher af_alg bnep intel_rapl_msr intel_rapl_common iwlmvm edac_mce_amd vfat fat amdgpu kvm_amd snd_hda_codec_realtek m>
12月 08 13:56:28 kernel: ccp acpi_cpufreq pcspkr k10temp soundcore i2c_piix4 gpio_amdpt wmi gpio_generic mac_hid tcp_bbr sch_cake vmw_vmci usbip_host usbip_core pkcs8_key_parser i2c_dev crypto_user fuse loop dm_mo>
12月 08 13:56:28 kernel: CR2: 0000000000000008
12月 08 13:56:28 kernel: ---[ end trace 0000000000000000 ]---
12月 08 13:56:28 kernel: RIP: 0010:dma_resv_add_fence+0x47/0x1e0
12月 08 13:56:28 kernel: Code: 89 54 24 04 48 85 f6 74 21 48 8d 7e 38 b8 01 00 00 00 f0 0f c1 46 38 85 c0 0f 84 49 01 00 00 8d 50 01 09 c2 0f 88 4d 01 00 00 <49> 8b 45 08 48 3d 40 3c 3a 9d 0f 84 c9 00 00 00 48 3d e>
12月 08 13:56:28 kernel: RSP: 0018:ffffbbe450b87c60 EFLAGS: 00010246
12月 08 13:56:28 kernel: RAX: ffff9b8fce740000 RBX: ffff9b8fce740158 RCX: 0000000000000003
12月 08 13:56:28 kernel: RDX: 0000000000000003 RSI: 0000000000000000 RDI: ffff9b8fce740158
12月 08 13:56:28 kernel: RBP: ffff9b8e81469000 R08: 0000000000000002 R09: ffff9b8e81469000
12月 08 13:56:28 kernel: R10: 0000000007ffffff R11: 0000000004000800 R12: ffff9b8eb6079338
12月 08 13:56:28 kernel: R13: 0000000000000000 R14: ffff9b8e81469730 R15: ffff9b8fce740000
12月 08 13:56:28 kernel: FS: 00007f629cbcd000(0000) GS:ffff9ba15ee80000(0000) knlGS:0000000000000000
12月 08 13:56:28 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
12月 08 13:56:28 kernel: CR2: 0000000000000008 CR3: 000000016e800000 CR4: 0000000000f50ee0
12月 08 13:56:28 kernel: PKRU: 55555554
12月 08 13:56:28 kernel: note: soffice.bin[11646] exited with irqs disabled
12月 08 13:56:28 kernel: BUG: kernel NULL pointer dereference, address: 0000000000000001
12月 08 13:56:28 kernel: #PF: supervisor read access in kernel mode
12月 08 13:56:28 kernel: #PF: error_code(0x0000) - not-present page
12月 08 13:56:28 kernel: PGD 1b9559067 P4D 1b9559067 PUD 1bb211067 PMD 0
12月 08 13:56:28 kernel: Oops: 0000 [#2] PREEMPT SMP NOPTI
12月 08 13:56:28 kernel: CPU: 6 PID: 12 Comm: kworker/u64:1 Tainted: G D 6.6.4-zen1-1-zen #1 26da708843d24c87db72b38a656d6f6e482ad152
12月 08 13:56:28 kernel: Hardware name: ASUS System Product Name/TUF GAMING B550M-PLUS (WI-FI), BIOS 2803 04/27/2022
12月 08 13:56:28 kernel: Workqueue: kfd_restore_wq restore_process_worker [amdgpu]
12月 08 13:56:28 kernel: RIP: 0010:__list_add_valid_or_report+0x1a/0xa0
12月 08 13:56:28 kernel: Code: 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 f3 0f 1e fa 48 89 d0 48 85 f6 74 2a 48 85 d2 74 3a 48 8b 52 08 48 39 f2 75 41 <4c> 8b 02 49 39 c0 75 4c 48 39 fa 74 60 49 39 f8 74 5b b>
12月 08 13:56:28 kernel: RSP: 0018:ffffbbe440157c18 EFLAGS: 00010246
12月 08 13:56:28 kernel: RAX: ffff9b8fd554d050 RBX: 0000000000000001 RCX: 0000000000000002
12月 08 13:56:28 kernel: RDX: 0000000000000001 RSI: 0000000000000001 RDI: ffffbbe440157c40
12月 08 13:56:28 kernel: RBP: ffffbbe440157ca0 R08: 0000000000000000 R09: 0000000000000000
12月 08 13:56:28 kernel: R10: 0000000000000001 R11: ffff9b8e80b1a0c0 R12: ffffbbe440157c40
12月 08 13:56:28 kernel: R13: ffff9b8fd554d048 R14: ffff9b8fd554d040 R15: ffff9b8eb60793a8
12月 08 13:56:28 kernel: FS: 0000000000000000(0000) GS:ffff9ba15eb80000(0000) knlGS:0000000000000000
12月 08 13:56:28 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
12月 08 13:56:28 kernel: CR2: 0000000000000001 CR3: 0000000195e96000 CR4: 0000000000f50ee0
12月 08 13:56:28 kernel: PKRU: 55555554
12月 08 13:56:28 kernel: Call Trace:
12月 08 13:56:28 kernel: <TASK>
12月 08 13:56:28 kernel: ? __die+0x10f/0x120
12月 08 13:56:28 kernel: ? page_fault_oops+0x171/0x4e0
12月 08 13:56:28 kernel: ? srso_alias_return_thunk+0x5/0x7f
12月 08 13:56:28 kernel: ? srso_alias_return_thunk+0x5/0x7f
12月 08 13:56:28 kernel: ? sched_clock_cpu+0xf/0x1b0
12月 08 13:56:28 kernel: ? exc_page_fault+0x7f/0x180
12月 08 13:56:28 kernel: ? asm_exc_page_fault+0x26/0x30
12月 08 13:56:28 kernel: ? __list_add_valid_or_report+0x1a/0xa0
12月 08 13:56:28 kernel: ? srso_alias_return_thunk+0x5/0x7f
12月 08 13:56:28 kernel: __mutex_lock.constprop.0+0x29c/0x770
12月 08 13:56:28 kernel: ? srso_alias_return_thunk+0x5/0x7f
12月 08 13:56:28 kernel: amdgpu_amdkfd_gpuvm_restore_process_bos+0x71/0x730 [amdgpu f35041c0b426873870f068374b5ce57ca7b35096]
12月 08 13:56:28 kernel: ? srso_alias_return_thunk+0x5/0x7f
12月 08 13:56:28 kernel: ? srso_alias_return_thunk+0x5/0x7f
12月 08 13:56:28 kernel: ? psi_group_change+0x274/0x430
12月 08 13:56:28 kernel: restore_process_worker+0x34/0x170 [amdgpu f35041c0b426873870f068374b5ce57ca7b35096]
12月 08 13:56:28 kernel: process_one_work+0x174/0x330
12月 08 13:56:28 kernel: worker_thread+0x3ef/0x580
12月 08 13:56:28 kernel: ? __pfx_worker_thread+0x10/0x10
12月 08 13:56:28 kernel: kthread+0xe8/0x120
12月 08 13:56:28 kernel: ? __pfx_kthread+0x10/0x10
12月 08 13:56:28 kernel: ret_from_fork+0x34/0x50
12月 08 13:56:28 kernel: ? __pfx_kthread+0x10/0x10
12月 08 13:56:28 kernel: ret_from_fork_asm+0x1b/0x30
12月 08 13:56:28 kernel: </TASK>
12月 08 13:56:28 kernel: Modules linked in: nfnetlink rfcomm ntfs3 ccm cmac algif_hash algif_skcipher af_alg bnep intel_rapl_msr intel_rapl_common iwlmvm edac_mce_amd vfat fat amdgpu kvm_amd snd_hda_codec_realtek m>
12月 08 13:56:28 kernel: ccp acpi_cpufreq pcspkr k10temp soundcore i2c_piix4 gpio_amdpt wmi gpio_generic mac_hid tcp_bbr sch_cake vmw_vmci usbip_host usbip_core pkcs8_key_parser i2c_dev crypto_user fuse loop dm_mo>
12月 08 13:56:28 kernel: CR2: 0000000000000001
12月 08 13:56:28 kernel: ---[ end trace 0000000000000000 ]---
12月 08 13:56:28 kernel: RIP: 0010:dma_resv_add_fence+0x47/0x1e0
12月 08 13:56:28 kernel: Code: 89 54 24 04 48 85 f6 74 21 48 8d 7e 38 b8 01 00 00 00 f0 0f c1 46 38 85 c0 0f 84 49 01 00 00 8d 50 01 09 c2 0f 88 4d 01 00 00 <49> 8b 45 08 48 3d 40 3c 3a 9d 0f 84 c9 00 00 00 48 3d e>
12月 08 13:56:28 kernel: RSP: 0018:ffffbbe450b87c60 EFLAGS: 00010246
12月 08 13:56:28 kernel: RAX: ffff9b8fce740000 RBX: ffff9b8fce740158 RCX: 0000000000000003
12月 08 13:56:28 kernel: RDX: 0000000000000003 RSI: 0000000000000000 RDI: ffff9b8fce740158
12月 08 13:56:28 kernel: RBP: ffff9b8e81469000 R08: 0000000000000002 R09: ffff9b8e81469000
12月 08 13:56:28 kernel: R10: 0000000007ffffff R11: 0000000004000800 R12: ffff9b8eb6079338
12月 08 13:56:28 kernel: R13: 0000000000000000 R14: ffff9b8e81469730 R15: ffff9b8fce740000
12月 08 13:56:28 kernel: FS: 0000000000000000(0000) GS:ffff9ba15eb80000(0000) knlGS:0000000000000000
12月 08 13:56:28 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
12月 08 13:56:28 kernel: CR2: 0000000000000001 CR3: 0000000195e96000 CR4: 0000000000f50ee0
12月 08 13:56:28 kernel: PKRU: 55555554
12月 08 13:56:28 kernel: note: kworker/u64:1[12] exited with irqs disabled
12月 08 13:56:28 kernel: note: kworker/u64:1[12] exited with preempt_count 2In cases where my system completely freezes, I am compelled to forcibly cut off power and restart. During these occurrences, there is no any response from the system, SSH is also non-functional, and no logs are saved to disk so I can't investigate the root cause after I restart.
Any ideas? Is this a driver bug and is there any way to fix it?
Last edited by Shidao (2023-12-10 07:09:41)
Offline
I don't know how to fix your hardware problems or wheather you should change gpu driver, but does your system hang only with this application?
You may try to start LibreOffice in safe mode to check if it will work; start in terminal: 'soffice --safe-mode'. If it not hang you may reset its profile, disable the HW acceleration, etc:
https://www.libreofficehelp.com/fix-can … breoffice/
https://ask.libreoffice.org/t/libreoffi … -why/64173
Offline
I don't know how to fix your hardware problems or wheather you should change gpu driver, but does your system hang only with this application?
You may try to start LibreOffice in safe mode to check if it will work; start in terminal: 'soffice --safe-mode'. If it not hang you may reset its profile, disable the HW acceleration, etc:
https://www.libreofficehelp.com/fix-can … breoffice/
https://ask.libreoffice.org/t/libreoffi … -why/64173
Though it sounds strange, my system hangs only with this application.
Disabling hardware acceleration worked for me, thanks!
Offline