You are not logged in.

#1 2024-03-05 07:49:41

allencch
Member
Registered: 2011-03-25
Posts: 118

[SOLVED] Laptop frequently freeze recently

Environment:

Linux version 6.7.6
NVidia DKMS 550

I have faced a serious laptop freeze problem since last night. The browsers freeze after some amount of work, and I can't do `killall`. `killall` has no respond.
When I try `ps aux|less`, it shows nothing.
Then I tried to restart, and I saw the error messages of failed to unmount /home and /tmp.

I get these errors from `journalctl -b -1`

Mar 05 15:23:36 archnovo systemd[1]: Unmounting /boot...
Mar 05 15:23:36 archnovo systemd[1]: Unmounting /home...
Mar 05 15:23:36 archnovo systemd[1]: Unmounting Temporary Directory /tmp...
Mar 05 15:23:36 archnovo systemd[1]: boot.mount: Deactivated successfully.
Mar 05 15:23:36 archnovo umount[39571]: umount: /home: target is busy.
Mar 05 15:23:36 archnovo umount[39572]: umount: /tmp: target is busy.
Mar 05 15:23:36 archnovo systemd[1]: Unmounted /boot.
Mar 05 15:23:36 archnovo systemd[1]: home.mount: Mount process exited, code=exited, status=32/n/a
Mar 05 15:23:36 archnovo systemd[1]: Failed unmounting /home.
Mar 05 15:23:36 archnovo systemd[1]: tmp.mount: Mount process exited, code=exited, status=32/n/a
Mar 05 15:23:36 archnovo systemd[1]: Failed unmounting Temporary Directory /tmp.
Mar 05 15:23:36 archnovo systemd[1]: Stopped target Swaps.
Mar 05 15:23:37 archnovo systemd[1]: Deactivating swap /swapfile...
Mar 05 15:23:37 archnovo systemd[1]: systemd-fsck@dev-disk-by\x2duuid-18CC\x2d21B3.service: Deactivated successfully.
Mar 05 15:23:37 archnovo systemd[1]: Stopped File System Check on /dev/disk/by-uuid/18CC-21B3.
Mar 05 15:23:37 archnovo systemd[1]: systemd-fsck@dev-disk-by\x2duuid-41740d33\x2d46ab\x2d45a1\x2daa8c\x2d269c47eb082a.service: Deactivated successfully.
Mar 05 15:23:37 archnovo systemd[1]: Stopped File System Check on /dev/disk/by-uuid/41740d33-46ab-45a1-aa8c-269c47eb082a.
Mar 05 15:23:37 archnovo systemd[1]: Removed slice Slice /system/systemd-fsck.
Mar 05 15:23:37 archnovo systemd[1]: Stopped target Preparation for Local File Systems.
Mar 05 15:23:37 archnovo systemd[1]: Stopping Monitoring of LVM2 mirrors, snapshots etc. using dmeventd or progress polling...
Mar 05 15:23:37 archnovo systemd[1]: systemd-tmpfiles-setup-dev.service: Deactivated successfully.
Mar 05 15:23:37 archnovo systemd[1]: Stopped Create Static Device Nodes in /dev.
Mar 05 15:23:37 archnovo systemd[1]: systemd-tmpfiles-setup-dev-early.service: Deactivated successfully.
Mar 05 15:23:37 archnovo systemd[1]: Stopped Create Static Device Nodes in /dev gracefully.
Mar 05 15:23:37 archnovo systemd[1]: lvm2-monitor.service: Deactivated successfully.
Mar 05 15:23:37 archnovo systemd[1]: Stopped Monitoring of LVM2 mirrors, snapshots etc. using dmeventd or progress polling.

Besides that, there are also some kernel error messages, not sure whether they are related.

Mar 05 14:32:40 archnovo kernel: BUG: unable to handle page fault for address: 0000000000003686
Mar 05 14:32:40 archnovo kernel: #PF: supervisor read access in kernel mode
Mar 05 14:32:40 archnovo kernel: #PF: error_code(0x0000) - not-present page
Mar 05 14:32:40 archnovo kernel: PGD 0 P4D 0 
Mar 05 14:32:40 archnovo kernel: Oops: 0000 [#2] PREEMPT SMP PTI
Mar 05 14:32:40 archnovo kernel: CPU: 5 PID: 13059 Comm: ThreadPoolForeg Tainted: P      D W  OE      6.7.6-arch1-2 #1 36a1d137df2a95849ad6b7232a6023837991924e
Mar 05 14:32:40 archnovo kernel: Hardware name: LENOVO 81T0/LNVNB161216, BIOS BHCN29WW 06/19/2019
Mar 05 14:32:40 archnovo kernel: RIP: 0010:zswap_load+0x30e/0x4b0
Mar 05 14:32:40 archnovo kernel: Code: 2c 0c 00 00 01 65 ff 0d 58 a3 c9 77 0f 84 9d 01 00 00 65 48 ff 05 1a 6d c9 77 4d 8b 67 38 4d 85 e4 74 1b 66 90 e8 f2 4e dd ff <49> 8b 7c 24 10 be 6f 00 00 00 e8 d3 d8 ff ff e8 ee 86 dd ff 48 89
Mar 05 14:32:40 archnovo kernel: RSP: 0000:ffffad8d497a7bc8 EFLAGS: 00010202
Mar 05 14:32:40 archnovo kernel: RAX: 0000000000000001 RBX: ffffdbe981e016c0 RCX: 0000000004b5b405
Mar 05 14:32:40 archnovo kernel: RDX: ffff97db62c7a400 RSI: 0000000000039160 RDI: 0062a762db97ffff
Mar 05 14:32:40 archnovo kernel: RBP: ffff97db016fa2b8 R08: 0000000000001000 R09: ffffad8d497a7b30
Mar 05 14:32:40 archnovo kernel: R10: ffffad8d40529000 R11: 00000000000000f0 R12: 0000000000003676
Mar 05 14:32:40 archnovo kernel: R13: ffff97db01032090 R14: ffff97db016fa2b0 R15: ffff97da0df76140
Mar 05 14:32:40 archnovo kernel: FS:  000072d45c2006c0(0000) GS:ffff97dc55d40000(0000) knlGS:0000000000000000
Mar 05 14:32:40 archnovo kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Mar 05 14:32:40 archnovo kernel: CR2: 0000000000003686 CR3: 000000012bc08006 CR4: 00000000003706f0
Mar 05 14:32:40 archnovo kernel: Call Trace:
Mar 05 14:32:40 archnovo kernel:  <TASK>
Mar 05 14:32:40 archnovo kernel:  ? __die+0x23/0x70
Mar 05 14:32:40 archnovo kernel:  ? page_fault_oops+0x171/0x4e0
Mar 05 14:32:40 archnovo kernel:  ? exc_page_fault+0x7f/0x180
Mar 05 14:32:40 archnovo kernel:  ? asm_exc_page_fault+0x26/0x30
Mar 05 14:32:40 archnovo kernel:  ? zswap_load+0x30e/0x4b0
Mar 05 14:32:40 archnovo kernel:  ? zswap_load+0x30e/0x4b0
Mar 05 14:32:40 archnovo kernel:  swap_readpage+0x81/0x460
Mar 05 14:32:40 archnovo kernel:  swapin_readahead+0x4c8/0x560
Mar 05 14:32:40 archnovo kernel:  do_swap_page+0x279/0xc00
Mar 05 14:32:40 archnovo kernel:  ? __pte_offset_map+0x1b/0x180
Mar 05 14:32:40 archnovo kernel:  __handle_mm_fault+0x7e7/0xda0
Mar 05 14:32:40 archnovo kernel:  handle_mm_fault+0x17f/0x360
Mar 05 14:32:40 archnovo kernel:  do_user_addr_fault+0x15b/0x670
Mar 05 14:32:40 archnovo kernel:  exc_page_fault+0x7f/0x180
Mar 05 14:32:40 archnovo kernel:  asm_exc_page_fault+0x26/0x30
Mar 05 14:32:40 archnovo kernel: RIP: 0033:0x5d7aad3bafe8

Can someone help me on this? How to diagnose the issue?

Last edited by allencch (2024-03-09 16:30:33)

Offline

#2 2024-03-05 08:42:41

seth
Member
Registered: 2012-09-03
Posts: 52,011

Re: [SOLVED] Laptop frequently freeze recently

Target busy means some process is still holding open FDs on those mounts.
Probably relates to the killall failures.

When I try `ps aux|less`, it shows nothing.

What do you mean by "nothing"? Not even the ps header??

Is the freeze limited to the browser?
What browser?

There seems to be an issue accessing the zswap - do you have physical swayp space?
Does this also happen when disabling zswap, https://wiki.archlinux.org/title/Zswap ?
Are you simply running OOM?
can you kill -9 the browser process(es)?

Offline

#3 2024-03-05 12:38:31

allencch
Member
Registered: 2011-03-25
Posts: 118

Re: [SOLVED] Laptop frequently freeze recently

What do you mean by "nothing"? Not even the ps header??

The command just no respond, even run `ps aux`, it just stuck there.

Is the freeze limited to the browser?
What browser?

Not only the browser, but mostly freeze while I was still browsing. Both Firefox and Brave, which I run both sometimes.

There seems to be an issue accessing the zswap - do you have physical swayp space?

I am using a `swapfile` instead of swap partition.

Does this also happen when disabling zswap, https://wiki.archlinux.org/title/Zswap ?

Didn't try yet.

Are you simply running OOM?

Yes, I am running systemd-oomd.

can you kill -9 the browser process(es)?

Not able. The kill command just stuck there.

By the way, I suspect it is similar or related to this issue (https://bbs.archlinux.org/viewtopic.php?id=293400).

P/S: I have upgraded the kernel to 6.7.8, and also disabled the zswap. Hopefully this solves the issue.

Last edited by allencch (2024-03-05 12:48:31)

Offline

#4 2024-03-05 15:37:06

seth
Member
Registered: 2012-09-03
Posts: 52,011

Re: [SOLVED] Laptop frequently freeze recently

Yes, I am running systemd-oomd.

No, meant whether you're using more RAM that you physically and swap…ically have - but likely not when you've a swapfile on disk.

I suspect it is similar or related to this issue

The backtraces are wildly different, though.
But there's a 3rd thread that has no windows/ntfs involvement.

also disabled the zswap. Hopefully this solves the issue

W/ zswap disabled you should absolutely not get that specific oops with that specific backtrace - maybe something else flares up, hinting at deeper issues.

Also

I have faced a serious laptop freeze problem since last night.

Ok, so what happened last night… errrr… to be clear: I mean, did you update the kernel or other packages or was there a power surge or anything that could explain why this started *now*?

Offline

#5 2024-03-05 15:43:49

torvic9
Member
Registered: 2022-08-26
Posts: 8

Re: [SOLVED] Laptop frequently freeze recently

Can you try booting with

`numa=off` 

or alternatively try this patch (coming soon to 6.7.9):

https://lore.kernel.org/all/20240216111 … hul@sk.com

Offline

#6 2024-03-05 16:20:05

allencch
Member
Registered: 2011-03-25
Posts: 118

Re: [SOLVED] Laptop frequently freeze recently

Ok, so what happened last night… errrr… to be clear: I mean, did you update the kernel or other packages or was there a power surge or anything that could explain why this started *now*?

My last upgrade was done on 2nd March (linux 6.7.6.arch1-1 -> 6.7.6.arch1-2). But recently (can't remember since when), I experienced computer freeze a few times, which I had to force shutdown with the power button.
A special thing I faced last night is that my working git repo has corrupted commits. Possibly caused by the freeze issue with force shutdown.

W/ zswap disabled you should absolutely not get that specific oops with that specific backtrace - maybe something else flares up, hinting at deeper issues.

If without zswap can resolve the freeze issue, I would prefer to continue without zswap.

Offline

#7 2024-03-05 16:21:46

allencch
Member
Registered: 2011-03-25
Posts: 118

Re: [SOLVED] Laptop frequently freeze recently

torvic9 wrote:

Can you try booting with

`numa=off` 

or alternatively try this patch (coming soon to 6.7.9):

https://lore.kernel.org/all/20240216111 … hul@sk.com

Thank you. I will try `numa=off` if still facing the freeze issue or failed to unmount issue.

Offline

#8 2024-03-05 16:24:26

seth
Member
Registered: 2012-09-03
Posts: 52,011

Re: [SOLVED] Laptop frequently freeze recently

You should probably just try the numa=off kernel parameter, the related bug is very deep in memory management and will affect all sorts of things (and would confirm your previous suspicion)

Offline

#9 2024-03-05 16:58:41

allencch
Member
Registered: 2011-03-25
Posts: 118

Re: [SOLVED] Laptop frequently freeze recently

seth wrote:

You should probably just try the numa=off kernel parameter, the related bug is very deep in memory management and will affect all sorts of things (and would confirm your previous suspicion)

Thank you for the advice. I will add `numa=off` to the kernel parameter.

Thanks @torvic9 for the suggestion as well.

Hopefully this can solve.

Offline

#10 2024-03-06 01:01:56

godman180
Member
Registered: 2024-03-06
Posts: 3

Re: [SOLVED] Laptop frequently freeze recently

I was getting similar issues and believe I solved it here: https://bbs.archlinux.org/viewtopic.php … 9#p2155079

Basically, try installing nvidia-open-dkms and adding numa=off to your kernel command line.

Offline

#11 2024-03-06 03:50:52

allencch
Member
Registered: 2011-03-25
Posts: 118

Re: [SOLVED] Laptop frequently freeze recently

godman180 wrote:

I was getting similar issues and believe I solved it here: https://bbs.archlinux.org/viewtopic.php … 9#p2155079

Basically, try installing nvidia-open-dkms and adding numa=off to your kernel command line.


I will try this. So far I have no oops issue and unmount issue, but this morning my computer just suddenly freeze. Probably have to try nvidia-open-dkms instead.

Offline

#12 2024-03-07 02:38:57

allencch
Member
Registered: 2011-03-25
Posts: 118

Re: [SOLVED] Laptop frequently freeze recently

Am using nvidia-dkms-open. So far no sudden freeze. Will observe for more.

However, there is another problem with NVidia, not related to the freeze. If I switch between TTY, the TTY7 will become a black screen but the mouse is still movable.

There is another thread discussing this issue (black screen with functional mouse) https://bbs.archlinux.org/viewtopic.php?id=290126 . Unlike that, switching TTY doesn't solve the black screen problem, but triggers the black screen problem. sad

Offline

#13 2024-03-07 09:38:36

seth
Member
Registered: 2012-09-03
Posts: 52,011

Re: [SOLVED] Laptop frequently freeze recently

The other thread revolves around waking from hibernation and the nvidia sleep hooks.

become a black screen but the mouse is still movable

means that the compositor (which one) lost the textures (this /is/ a problem w/ nvidia and framebuffer switches as well as S3 resumes, latter because of VRAM decay what the services in the other thread mitigate)

Do you use either of "nvidia_drm.modeset=1" or "nvidia_drm.fbdev=1"?
Did you consider revertingt to the 535xx drivers (-dkms and -utils) from the https://wiki.archlinux.org/title/Ala ?

Offline

#14 2024-03-07 12:45:26

allencch
Member
Registered: 2011-03-25
Posts: 118

Re: [SOLVED] Laptop frequently freeze recently

Do you use either of "nvidia_drm.modeset=1" or "nvidia_drm.fbdev=1"?
Did you consider revertingt to the 535xx drivers (-dkms and -utils) from the https://wiki.archlinux.org/title/Ala ?

Neither set "nvidia_drm.modeset=1" nor "nvidia_drm.fbdev=1".

Not intend to revert to 535xx drivers for the moment, unless TTY black screen issue affects the working experience. I failed to build the module (DKMS) previously, not 535xx drivers, but 545xx drivers.

Error! Bad return status for module build on kernel: 6.7.3-arch1-1 (x86_64)

Offline

#15 2024-03-07 15:24:12

seth
Member
Registered: 2012-09-03
Posts: 52,011

Re: [SOLVED] Laptop frequently freeze recently

You very most certainly want "nvidia_drm.modeset=1" w/ the blob (and try for the nvidia-open drivers as well)
"nvidia_drm.fbdev=0/1" would be something to experiment with itr (it's certainly relevant for the framebuffer handling, but idk at hand whether nvidia-open implements that)

Offline

#16 2024-03-08 16:25:07

allencch
Member
Registered: 2011-03-25
Posts: 118

Re: [SOLVED] Laptop frequently freeze recently

With nvidia-open-dkms, I tried "nvidia_drm.modeset=1" and "nvidia_drm.fbdev=1" (on another boot), both didn't solve changing TTY causes black screen with mouse on TTY7.

Offline

#17 2024-03-09 16:22:00

allencch
Member
Registered: 2011-03-25
Posts: 118

Re: [SOLVED] Laptop frequently freeze recently

I have resolved the black screen issue as well. Since the "compositor" was mentioned, I found that "picom" is the root cause of the black screen when switching TTY. Kill picom or not running picom, switching TTY works fine.

As a result, I used "vsync = true;" in picom.conf, and it works.

As a conclusion of this thread, I

1. Added "zswap.enabled=0" and "numa=off" to the kernel parameter
2. Changed the nvidia to nvidia-open-dkms
3. Added "vsync = true;" to picom.conf

Offline

#18 2024-03-11 10:58:02

dartfira
Member
Registered: 2010-08-23
Posts: 104

Re: [SOLVED] Laptop frequently freeze recently

Is there any track / bug report open about that issue?
Just to know when will be resolved.

Last edited by dartfira (2024-03-11 10:58:26)

Offline

Board footer

Powered by FluxBB