You are not logged in.

#1 2021-01-29 07:01:33

omernaveedxyz
Guest

[Solved] Recurring crashes on DWM, unknown reason

Hello,

I have been using DWM for around three months now, and have just recently (~2 weeks) noticed recurring crashing on my laptop. I have not updated my version of dwm so I am not sure as to what is causing this issue. It seems to happen sometimes when I open a new window on any application, the screen will freeze but the audio will continue playing. The keyboard will however be unresponsive and after ~30 seconds the fans will spin up to full speed only to go back down after ~10 seconds. I am not sure what logs or such to send so please let me know any further details you will need to help me diagnose this issue. I have not tested any other window manager or desktop environment so I am not sure if this issue is specific to DWM either.

Initially I thought it may have been the package "cronie" that I was using to run cron jobs (as this was my latest change), but even after disabling all of my jobs the issue still occurs.

This is issue is also very hard to replicate as it seems to happen at random times, but I will see if I can diagnose what exactly causes it.

Update (Feb 16, 2021): Had a feeling that my browser (Brave) may have been causing this. After uninstalling Brave-bin and moving to chromium, have not had a system crash in the last week. Not sure why Brave would be causing this issue (can't find any posts with similar issues), but I am concluding that Brave was the issue with my system.

Last edited by omernaveedxyz (2021-02-16 19:36:03)

#2 2021-01-29 08:34:36

seth
Member
From: Don't DM me only for attention
Registered: 2012-09-03
Posts: 73,185

Re: [Solved] Recurring crashes on DWM, unknown reason

Does it "unfreeze" after a while? Can you trigger a "clean" shutdown?
Sound like a halted kernel (audio continuation is normal) - did you check your journal?

Offline

#3 2021-01-29 08:39:26

omernaveedxyz
Guest

Re: [Solved] Recurring crashes on DWM, unknown reason

seth wrote:

Does it "unfreeze" after a while? Can you trigger a "clean" shutdown?
Sound like a halted kernel (audio continuation is normal) - did you check your journal?

Haven't left if frozen for more than 5 minutes, but it did not unfreeze within that time. I also am not sure what you mean by a "clean" shutdown, but I have to hold down the power button and I think that is a "hard" shutdown. When I close the lid to put the device to hibernate and reopen however, the screen wakes but is still frozen and my bluetooth headset disconnects, which I assume means it enters and exits hibernate correctly. Also, am not sure what specific you want me to journal, but running

journalctl -b -1

doesn't give anything useful.

...
Jan 29 00:16:45 Arch NetworkManager[403]: <info>  [1611901005.9394] dhcp4 (wlp59s0): option d>
Jan 29 00:16:45 Arch NetworkManager[403]: <info>  [1611901005.9394] dhcp4 (wlp59s0): option d>
Jan 29 00:16:45 Arch NetworkManager[403]: <info>  [1611901005.9395] dhcp4 (wlp59s0): option d>
Jan 29 00:16:45 Arch NetworkManager[403]: <info>  [1611901005.9395] dhcp4 (wlp59s0): option e>
Jan 29 00:16:45 Arch NetworkManager[403]: <info>  [1611901005.9395] dhcp4 (wlp59s0): option h>
Jan 29 00:16:45 Arch NetworkManager[403]: <info>  [1611901005.9395] dhcp4 (wlp59s0): option i>
Jan 29 00:16:45 Arch NetworkManager[403]: <info>  [1611901005.9396] dhcp4 (wlp59s0): option n>
Jan 29 00:16:45 Arch NetworkManager[403]: <info>  [1611901005.9396] dhcp4 (wlp59s0): option r>
Jan 29 00:16:45 Arch NetworkManager[403]: <info>  [1611901005.9396] dhcp4 (wlp59s0): option r>
Jan 29 00:16:45 Arch NetworkManager[403]: <info>  [1611901005.9396] dhcp4 (wlp59s0): option r>
Jan 29 00:16:45 Arch NetworkManager[403]: <info>  [1611901005.9396] dhcp4 (wlp59s0): option r>
Jan 29 00:16:45 Arch NetworkManager[403]: <info>  [1611901005.9397] dhcp4 (wlp59s0): option r>
Jan 29 00:16:45 Arch NetworkManager[403]: <info>  [1611901005.9397] dhcp4 (wlp59s0): option r>
Jan 29 00:16:45 Arch NetworkManager[403]: <info>  [1611901005.9397] dhcp4 (wlp59s0): option r>
Jan 29 00:16:45 Arch NetworkManager[403]: <info>  [1611901005.9397] dhcp4 (wlp59s0): option r>
Jan 29 00:16:45 Arch NetworkManager[403]: <info>  [1611901005.9397] dhcp4 (wlp59s0): option r>
Jan 29 00:16:45 Arch NetworkManager[403]: <info>  [1611901005.9398] dhcp4 (wlp59s0): option r>
Jan 29 00:16:45 Arch NetworkManager[403]: <info>  [1611901005.9398] dhcp4 (wlp59s0): option r>
Jan 29 00:16:45 Arch NetworkManager[403]: <info>  [1611901005.9398] dhcp4 (wlp59s0): option r>
Jan 29 00:16:45 Arch NetworkManager[403]: <info>  [1611901005.9398] dhcp4 (wlp59s0): option r>
Jan 29 00:16:45 Arch NetworkManager[403]: <info>  [1611901005.9398] dhcp4 (wlp59s0): option r>
Jan 29 00:16:45 Arch NetworkManager[403]: <info>  [1611901005.9399] dhcp4 (wlp59s0): option r>
Jan 29 00:16:45 Arch NetworkManager[403]: <info>  [1611901005.9399] dhcp4 (wlp59s0): option r>
Jan 29 00:16:45 Arch NetworkManager[403]: <info>  [1611901005.9399] dhcp4 (wlp59s0): option r>
Jan 29 00:16:45 Arch NetworkManager[403]: <info>  [1611901005.9399] dhcp4 (wlp59s0): option r>
Jan 29 00:16:45 Arch NetworkManager[403]: <info>  [1611901005.9399] dhcp4 (wlp59s0): option s>
Jan 29 00:16:45 Arch NetworkManager[403]: <info>  [1611901005.9400] dhcp4 (wlp59s0): state ch>
Jan 29 00:16:55 Arch systemd[1]: NetworkManager-dispatcher.service: Succeeded.
Jan 29 00:16:55 Arch audit[1]: SERVICE_STOP pid=1 uid=0 auid=4294967295 ses=4294967295 msg='u>
Jan 29 00:16:55 Arch kernel: audit: type=1131 audit(1611901015.207:94): pid=1 uid=0 auid=4294>
Jan 29 00:50:53 Arch systemd-logind[405]: Power key pressed.

Last edited by omernaveedxyz (2021-01-29 08:40:34)

#4 2021-01-29 08:45:30

seth
Member
From: Don't DM me only for attention
Registered: 2012-09-03
Posts: 73,185

Re: [Solved] Recurring crashes on DWM, unknown reason

It cah't - the hard reboot prevents the journal from being sync'd to disk.
You'll need a crash kernel, https://wiki.archlinux.org/index.php/Kdump

Wild guess: https://wiki.archlinux.org/index.php/In … ete_freeze

Also record

lscpu
lspci
lsusb

Maybe there's a known "problematic" device in there.

Offline

#5 2021-01-29 08:54:24

omernaveedxyz
Guest

Re: [Solved] Recurring crashes on DWM, unknown reason

Have included some ouputs below, I'll try to figure out how to setup kernel dumps. Gonna guess that

intel_idle.max_cstate=1

won't work since I'm not running a compatible cpu.

[omer@Arch ~] $ lscpu
Architecture:                    x86_64
CPU op-mode(s):                  32-bit, 64-bit
Byte Order:                      Little Endian
Address sizes:                   39 bits physical, 48 bits virtual
CPU(s):                          12
On-line CPU(s) list:             0-11
Thread(s) per core:              2
Core(s) per socket:              6
Socket(s):                       1
NUMA node(s):                    1
Vendor ID:                       GenuineIntel
CPU family:                      6
Model:                           158
Model name:                      Intel(R) Core(TM) i7-9750H CPU @ 2.60GHz
Stepping:                        10
CPU MHz:                         845.743
CPU max MHz:                     4500.0000
CPU min MHz:                     800.0000
BogoMIPS:                        5202.65
Virtualization:                  VT-x
L1d cache:                       192 KiB
L1i cache:                       192 KiB
L2 cache:                        1.5 MiB
L3 cache:                        12 MiB
NUMA node0 CPU(s):               0-11
Vulnerability Itlb multihit:     KVM: Mitigation: VMX disabled
Vulnerability L1tf:              Mitigation; PTE Inversion; VMX conditional cache flushes, SMT
                                  vulnerable
Vulnerability Mds:               Mitigation; Clear CPU buffers; SMT vulnerable
Vulnerability Meltdown:          Mitigation; PTI
Vulnerability Spec store bypass: Mitigation; Speculative Store Bypass disabled via prctl and s
                                 eccomp
Vulnerability Spectre v1:        Mitigation; usercopy/swapgs barriers and __user pointer sanit
                                 ization
Vulnerability Spectre v2:        Mitigation; Full generic retpoline, IBPB conditional, IBRS_FW
                                 , STIBP conditional, RSB filling
Vulnerability Srbds:             Mitigation; Microcode
Vulnerability Tsx async abort:   Not affected
Flags:                           fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov
                                  pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe sy
                                 scall nx pdpe1gb rdtscp lm constant_tsc art arch_perfmon pebs
                                  bts rep_good nopl xtopology nonstop_tsc cpuid aperfmperf pni
                                  pclmulqdq dtes64 monitor ds_cpl vmx est tm2 ssse3 sdbg fma c
                                 x16 xtpr pdcm pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_dead
                                 line_timer aes xsave avx f16c rdrand lahf_lm abm 3dnowprefetc
                                 h cpuid_fault epb invpcid_single pti ssbd ibrs ibpb stibp tpr
                                 _shadow vnmi flexpriority ept vpid ept_ad fsgsbase tsc_adjust
                                  bmi1 avx2 smep bmi2 erms invpcid mpx rdseed adx smap clflush
                                 opt intel_pt xsaveopt xsavec xgetbv1 xsaves dtherm ida arat p
                                 ln pts hwp hwp_notify hwp_act_window hwp_epp md_clear flush_l
                                 1d
[omer@Arch ~] $ lspci
00:00.0 Host bridge: Intel Corporation 8th Gen Core Processor Host Bridge/DRAM Registers (rev
07)
00:01.0 PCI bridge: Intel Corporation Xeon E3-1200 v5/E3-1500 v5/6th Gen Core Processor PCIe Controller (x16) (rev 07)
00:02.0 VGA compatible controller: Intel Corporation UHD Graphics 630 (Mobile)
00:04.0 Signal processing controller: Intel Corporation Xeon E3-1200 v5/E3-1500 v5/6th Gen Core Processor Thermal Subsystem (rev 07)
00:08.0 System peripheral: Intel Corporation Xeon E3-1200 v5/v6 / E3-1500 v5 / 6th/7th/8th Gen Core Processor Gaussian Mixture Model
00:12.0 Signal processing controller: Intel Corporation Cannon Lake PCH Thermal Controller (rev 10)
00:14.0 USB controller: Intel Corporation Cannon Lake PCH USB 3.1 xHCI Host Controller (rev 10)
00:14.2 RAM memory: Intel Corporation Cannon Lake PCH Shared SRAM (rev 10)
00:15.0 Serial bus controller [0c80]: Intel Corporation Cannon Lake PCH Serial IO I2C Controller #0 (rev 10)
00:15.1 Serial bus controller [0c80]: Intel Corporation Cannon Lake PCH Serial IO I2C Controller #1 (rev 10)
00:16.0 Communication controller: Intel Corporation Cannon Lake PCH HECI Controller (rev 10)
00:17.0 SATA controller: Intel Corporation Cannon Lake Mobile PCH SATA AHCI Controller (rev 10)
00:1b.0 PCI bridge: Intel Corporation Cannon Lake PCH PCI Express Root Port #17 (rev f0)
00:1c.0 PCI bridge: Intel Corporation Cannon Lake PCH PCI Express Root Port #1 (rev f0)
00:1c.4 PCI bridge: Intel Corporation Cannon Lake PCH PCI Express Root Port #5 (rev f0)
00:1d.0 PCI bridge: Intel Corporation Cannon Lake PCH PCI Express Root Port #9 (rev f0)
00:1f.0 ISA bridge: Intel Corporation Cannon Lake LPC Controller (rev 10)
00:1f.3 Audio device: Intel Corporation Cannon Lake PCH cAVS (rev 10)
00:1f.4 SMBus: Intel Corporation Cannon Lake PCH SMBus Controller (rev 10)
00:1f.5 Serial bus controller [0c80]: Intel Corporation Cannon Lake PCH SPI Controller (rev 10)
01:00.0 3D controller: NVIDIA Corporation TU117M [GeForce GTX 1650 Mobile / Max-Q] (rev a1)
02:00.0 PCI bridge: Intel Corporation JHL6340 Thunderbolt 3 Bridge (C step) [Alpine Ridge 2C 2016] (rev 02)
03:00.0 PCI bridge: Intel Corporation JHL6340 Thunderbolt 3 Bridge (C step) [Alpine Ridge 2C 2016] (rev 02)
03:01.0 PCI bridge: Intel Corporation JHL6340 Thunderbolt 3 Bridge (C step) [Alpine Ridge 2C 2016] (rev 02)
03:02.0 PCI bridge: Intel Corporation JHL6340 Thunderbolt 3 Bridge (C step) [Alpine Ridge 2C 2016] (rev 02)
04:00.0 System peripheral: Intel Corporation JHL6340 Thunderbolt 3 NHI (C step) [Alpine Ridge
2C 2016] (rev 02)
3a:00.0 USB controller: Intel Corporation JHL6340 Thunderbolt 3 USB 3.1 Controller (C step) [Alpine Ridge 2C 2016] (rev 02)
3b:00.0 Network controller: Intel Corporation Wi-Fi 6 AX200 (rev 1a)
3c:00.0 Unassigned class [ff00]: Realtek Semiconductor Co., Ltd. RTS525A PCI Express Card Reader (rev 01)
3d:00.0 Non-Volatile memory controller: Samsung Electronics Co Ltd NVMe SSD Controller SM981/PM981/PM983
[omer@Arch ~] $ lsusb
Bus 004 Device 001: ID 1d6b:0003 Linux Foundation 3.0 root hub
Bus 003 Device 001: ID 1d6b:0002 Linux Foundation 2.0 root hub
Bus 002 Device 001: ID 1d6b:0003 Linux Foundation 3.0 root hub
Bus 001 Device 003: ID 27c6:5395 Shenzhen Goodix Technology Co.,Ltd. Fingerprint Reader
Bus 001 Device 002: ID 8087:0029 Intel Corp. AX200 Bluetooth
Bus 001 Device 004: ID 0c45:6723 Microdia Integrated_Webcam_HD
Bus 001 Device 001: ID 1d6b:0002 Linux Foundation 2.0 root hub

Last edited by omernaveedxyz (2021-01-29 08:55:19)

#6 2021-01-29 12:27:51

seth
Member
From: Don't DM me only for attention
Registered: 2012-09-03
Posts: 73,185

Re: [Solved] Recurring crashes on DWM, unknown reason

Later generations might be affected as well, though the upstream bug claims a fix in 5.3…
There were recently several reports reg. your wifi chip, but none amounted to a kernel halt.

How do you use the hybrid graphics (do you use the intel chip only as crtc or as your main rendering device)?

Offline

#7 2021-01-29 19:35:18

omernaveedxyz
Guest

Re: [Solved] Recurring crashes on DWM, unknown reason

Yeah I was looking at that patch, apparently some users are still complaining of the issue being present. I'll test it out then.

I only have my intel Graphics enabled (don't have any drivers installed for nvidia gpu). Also followed this guide to fully disable the dedicated gpu. Used acpi_call method.

https://wiki.archlinux.org/index.php/Hy … screte_GPU

Last edited by omernaveedxyz (2021-01-29 20:11:27)

#8 2021-01-29 20:41:39

omernaveedxyz
Guest

Re: [Solved] Recurring crashes on DWM, unknown reason

Tried to setup Kdumps, but I'm struggling. Follow this guide: https://wiki.archlinux.org/index.php/Kdump, the default kernel config was fine so I did not edit that (meaning I did not setup a separate kernel for kdumps). I set crashkernel=512M

[omer@Arch ~] $ cat /proc/cmdline
ro root=/dev/nvme0n1p4 crashkernel=512M initrd=\initramfs-linux.img

And then setup kexec according to the guide

[omer@Arch ~] $ cat /sys/kernel/kexec_crash_loaded
1

Tried to test using # echo c > /proc/sysrq-trigger but get error. Makedumpfile also gives a similar error

[omer@Arch ~] $ sudo cp /proc/vmcore /root/crash.dump
cp: cannot stat '/proc/vmcore': No such file or directory

Last edited by omernaveedxyz (2021-01-29 20:42:59)

#9 2021-01-29 21:00:26

seth
Member
From: Don't DM me only for attention
Registered: 2012-09-03
Posts: 73,185

Re: [Solved] Recurring crashes on DWM, unknown reason

but get error

Get what error?
Do you have sysreq enabled?
https://wiki.archlinux.org/index.php/Ke … el_(SysRq)

Offline

#10 2021-01-29 21:25:39

omernaveedxyz
Guest

Re: [Solved] Recurring crashes on DWM, unknown reason

seth wrote:

but get error

Get what error?
Do you have sysreq enabled?
https://wiki.archlinux.org/index.php/Ke … el_(SysRq)

The code I provided in the reply. there is not file /proc/vmcore. I do not have Sysreq enabled.

#11 2021-01-29 21:27:58

seth
Member
From: Don't DM me only for attention
Registered: 2012-09-03
Posts: 73,185

Re: [Solved] Recurring crashes on DWM, unknown reason

Ah, you're not getting an error message from echo'ing into /proc/sysrq-trigger but meant the missing vmcore.
You'll require sysrq support to make "echo c > /proc/sysrq-trigger" artificially crash the system.

Offline

#12 2021-01-29 21:32:03

omernaveedxyz
Guest

Re: [Solved] Recurring crashes on DWM, unknown reason

Same issue, but now on hard reboot after artificial crash, one of my cpu cores is pegged at 100% usage.

#13 2021-01-29 21:56:06

seth
Member
From: Don't DM me only for attention
Registered: 2012-09-03
Posts: 73,185

Re: [Solved] Recurring crashes on DWM, unknown reason

"same issue" like there's no vmcore or "same issue" like the system freezes and you need to perform a hard reset?
Did you run kexec … before triggering the crash?

What process occupies a complete core?

Offline

#14 2021-01-29 22:04:11

omernaveedxyz
Guest

Re: [Solved] Recurring crashes on DWM, unknown reason

seth wrote:

"same issue" like there's no vmcore or "same issue" like the system freezes and you need to perform a hard reset?
Did you run kexec … before triggering the crash?

What process occupies a complete core?

Nevermind, I managed to get the dump file. Now I am not sure how to read it. I have crash installed, but I'm not sure what to specify for vmlinux.

$ crash vmlinux path/crash.dump

Last edited by omernaveedxyz (2021-01-29 22:05:39)

#15 2021-01-30 07:37:23

seth
Member
From: Don't DM me only for attention
Registered: 2012-09-03
Posts: 73,185

Re: [Solved] Recurring crashes on DWM, unknown reason

"vmlinux" is your kernel, resp. the one that generated the crash.
If that's the currently running one - and the /boot directory mounted -  you can usually omit that parameter.

Offline

#16 2021-02-01 04:19:03

omernaveedxyz
Guest

Re: [Solved] Recurring crashes on DWM, unknown reason

seth wrote:

"vmlinux" is your kernel, resp. the one that generated the crash.
If that's the currently running one - and the /boot directory mounted -  you can usually omit that parameter.

Just had another crash. The issue is that my screen never unfroze so I never managed to load the capture kernel. Is there a way around this? I left my device for 10 minutes but it was still unresponsive with the fans spinning at full speed... Had to hard reboot in the end, which means I couldn't get the kernel dump

Last edited by omernaveedxyz (2021-02-01 04:19:31)

#17 2021-02-01 09:25:07

seth
Member
From: Don't DM me only for attention
Registered: 2012-09-03
Posts: 73,185

Re: [Solved] Recurring crashes on DWM, unknown reason

If you had setup the crash kernel correctly this sounds more like a HW issue than a kernel bug.
Check the RAM first, https://wiki.archlinux.org/index.php/St … ing_memory and inspect the HDD, https://wiki.archlinux.org/index.php/Smart

Offline

#18 2021-02-02 00:18:01

omernaveedxyz
Guest

Re: [Solved] Recurring crashes on DWM, unknown reason

I have just recently upgraded to 32GB of memory. I don't think that is the issue since it is all detected correctly, but I will test it out. As for my HDD, it is a m.2 nvme Samsung 2TB drive but it apparently does not support SMART, or at least Arch isn't able to detect it. If this is a hardware issue that is quite problematic. I have Windows dual-booted on my laptop, so I think I will use Windows and see if I get the same crash.

#19 2021-02-02 07:45:45

seth
Member
From: Don't DM me only for attention
Registered: 2012-09-03
Posts: 73,185

Re: [Solved] Recurring crashes on DWM, unknown reason

I have just recently upgraded to 32GB of memory.

Replaced or added?
If you mix different RAM modules, the slot positioning becomes very important (consult the board manual) and of course there's a chance that a module isn't properly seated.
You may also have to pick more conservative timings (BIOS)

If the story is "I changed the HW and ever since then there're random hard crashes", that's self explanatory ;-)

Offline

#20 2021-02-10 00:17:56

omernaveedxyz
Guest

Re: [Solved] Recurring crashes on DWM, unknown reason

Update: I have since downgraded back to the RAM that came with my laptop and have made a clean install of Arch. Two days in, I once again experienced the same crash (screen freeze, keyboard freeze, audio continues). I'm at a complete loss for what to do. Gonna run Windows 10 for a week (since I'm dual-booted) and will see if this issue is reproduceable in Windows 10.

Last edited by omernaveedxyz (2021-02-10 01:25:07)

Board footer

Powered by FluxBB