You are not logged in.

#26 2018-05-18 09:35:48

pkejr
Member
Registered: 2018-04-30
Posts: 12

Re: System freeze randomly

Hi,
I am very frustrated because I didn't have a single freeze since more than a week and I don't know why.

 11:22:37 up 8 days, 20:28,  2 users,  load average: 0.68, 0.61, 0.54

One thing I did though (didn't reboot since, or restarted X though) is to add this to a file in /etc/X11/Xorg.conf.d

Section "Device"
    Identifier      "Intel Graphics"
    Driver          "intel"
    Option          "DRI" "false"
    Option          "AccelMethod"   "uxa"
    Option          "NoAccel"       "True"
EndSection

I got those information from here: http://www.thinkwiki.org/wiki/Category:X250
But I don't think it got actually applied, unless Xorg reloads the files in /etc/X11/xorg.conf.d from time to time?
I tried commenting it out to see if that was the thing that fixed it.
But yeah as said very frustrated because I'm quite sure the things I tried before (disabling TLP, swap and updating microcode) didn't fix it, and I'm pretty sure that this Xorg config is not applied (no reboot or restart).

Last edited by pkejr (2018-05-18 09:37:45)

Offline

#27 2018-05-19 03:42:12

CarbonChauvinist
Member
Registered: 2012-06-16
Posts: 50

Re: System freeze randomly

I too have been plagued by this problem to the point where my laptop is almost unusable. Here are some full journals from recent occurences: one, and two, and three. And a /var/log/Xorg.0.log and a /var/log/Xorg.0.log.old from a boot after a freeze.

Unsure if should open my own thread, but the initial description in this thread is spot on and exactly what I experience. I run LVM on LUKS on EXT4. I have a Dell Precision 5510 and it has a buggy ACPI, not sure if that's the root cause:

$ journalctl -b -p err | ag ACPI
May 18 18:30:23 archlinux kernel: ACPI BIOS Error (bug): Failure looking up [\_SB.PCI0.XHC.RHUB.HS11], AE_NOT_FOUND (20180105/dswload-211)
May 18 18:30:23 archlinux kernel: ACPI Error: AE_NOT_FOUND, During name lookup/catalog (20180105/psobject-252)
May 18 18:30:23 archlinux kernel: ACPI Error: AE_NOT_FOUND, (SSDT:xh_rvp10) while loading table (20180105/tbxfload-228)
May 18 18:30:23 archlinux kernel: ACPI Error: 1 table load failures, 10 successful (20180105/tbxfload-246)
May 18 18:30:23 archlinux kernel: ACPI BIOS Error (bug): Failure looking up [\_SB.PCI0.PEG0.PEGP.TDGC], AE_NOT_FOUND (20180105/psargs-364)
May 18 18:30:23 archlinux kernel: ACPI Error: Method parse/execution failed \_SB.PCI0.PEG0.PG00._ON, AE_NOT_FOUND (20180105/psparse-550)
May 18 18:30:23 archlinux kernel: ACPI BIOS Error (bug): Failure looking up [\_SB.PCI0.LPCB.HEC.ECAV], AE_NOT_FOUND (20180105/psargs-364)
May 18 18:30:23 archlinux kernel: ACPI Error: Method parse/execution failed \_TZ.FNCL, AE_NOT_FOUND (20180105/psparse-550)
May 18 18:30:23 archlinux kernel: ACPI Error: Method parse/execution failed \_TZ.FN00._ON, AE_NOT_FOUND (20180105/psparse-550)
May 18 18:30:23 archlinux kernel: ACPI BIOS Error (bug): Failure looking up [\_SB.PCI0.LPCB.HEC.ECAV], AE_NOT_FOUND (20180105/psargs-364)
May 18 18:30:23 archlinux kernel: ACPI Error: Method parse/execution failed \_TZ.FNCL, AE_NOT_FOUND (20180105/psparse-550)
May 18 18:30:23 archlinux kernel: ACPI Error: Method parse/execution failed \_TZ.FN00._ON, AE_NOT_FOUND (20180105/psparse-550)
May 18 18:30:23 archlinux kernel: ACPI BIOS Error (bug): Failure looking up [\_SB.PCI0.LPCB.HEC.ECAV], AE_NOT_FOUND (20180105/psargs-364)
May 18 18:30:23 archlinux kernel: ACPI Error: Method parse/execution failed \_TZ.FNCL, AE_NOT_FOUND (20180105/psparse-550)
May 18 18:30:23 archlinux kernel: ACPI Error: Method parse/execution failed \_TZ.FN01._ON, AE_NOT_FOUND (20180105/psparse-550)
May 18 18:30:23 archlinux kernel: ACPI BIOS Error (bug): Failure looking up [\_SB.PCI0.LPCB.HEC.ECAV], AE_NOT_FOUND (20180105/psargs-364)
May 18 18:30:23 archlinux kernel: ACPI Error: Method parse/execution failed \_TZ.FNCL, AE_NOT_FOUND (20180105/psparse-550)
May 18 18:30:23 archlinux kernel: ACPI Error: Method parse/execution failed \_TZ.FN01._ON, AE_NOT_FOUND (20180105/psparse-550)
May 18 18:30:23 archlinux kernel: ACPI BIOS Error (bug): Failure looking up [\_SB.PCI0.LPCB.HEC.ECAV], AE_NOT_FOUND (20180105/psargs-364)
May 18 18:30:23 archlinux kernel: ACPI Error: Method parse/execution failed \_TZ.FNCL, AE_NOT_FOUND (20180105/psparse-550)
May 18 18:30:23 archlinux kernel: ACPI Error: Method parse/execution failed \_TZ.FN02._ON, AE_NOT_FOUND (20180105/psparse-550)
May 18 18:30:23 archlinux kernel: ACPI BIOS Error (bug): Failure looking up [\_SB.PCI0.LPCB.HEC.ECAV], AE_NOT_FOUND (20180105/psargs-364)
May 18 18:30:23 archlinux kernel: ACPI Error: Method parse/execution failed \_TZ.FNCL, AE_NOT_FOUND (20180105/psparse-550)
May 18 18:30:23 archlinux kernel: ACPI Error: Method parse/execution failed \_TZ.FN02._ON, AE_NOT_FOUND (20180105/psparse-550)
May 18 18:30:23 archlinux kernel: ACPI BIOS Error (bug): Failure looking up [\_SB.PCI0.LPCB.HEC.ECAV], AE_NOT_FOUND (20180105/psargs-364)
May 18 18:30:23 archlinux kernel: ACPI Error: Method parse/execution failed \_TZ.FNCL, AE_NOT_FOUND (20180105/psparse-550)
May 18 18:30:23 archlinux kernel: ACPI Error: Method parse/execution failed \_TZ.FN03._ON, AE_NOT_FOUND (20180105/psparse-550)
May 18 18:30:23 archlinux kernel: ACPI BIOS Error (bug): Failure looking up [\_SB.PCI0.LPCB.HEC.ECAV], AE_NOT_FOUND (20180105/psargs-364)
May 18 18:30:23 archlinux kernel: ACPI Error: Method parse/execution failed \_TZ.FNCL, AE_NOT_FOUND (20180105/psparse-550)
May 18 18:30:23 archlinux kernel: ACPI Error: Method parse/execution failed \_TZ.FN03._ON, AE_NOT_FOUND (20180105/psparse-550)
May 18 18:30:23 archlinux kernel: ACPI BIOS Error (bug): Failure looking up [\_SB.PCI0.LPCB.HEC.ECAV], AE_NOT_FOUND (20180105/psargs-364)
May 18 18:30:23 archlinux kernel: ACPI Error: Method parse/execution failed \_TZ.FNCL, AE_NOT_FOUND (20180105/psparse-550)
May 18 18:30:23 archlinux kernel: ACPI Error: Method parse/execution failed \_TZ.FN04._ON, AE_NOT_FOUND (20180105/psparse-550)
May 18 18:30:23 archlinux kernel: ACPI BIOS Error (bug): Failure looking up [\_SB.PCI0.LPCB.HEC.ECAV], AE_NOT_FOUND (20180105/psargs-364)
May 18 18:30:23 archlinux kernel: ACPI Error: Method parse/execution failed \_TZ.FNCL, AE_NOT_FOUND (20180105/psparse-550)
May 18 18:30:23 archlinux kernel: ACPI Error: Method parse/execution failed \_TZ.FN04._ON, AE_NOT_FOUND (20180105/psparse-550)
May 18 18:30:23 archlinux kernel: ACPI BIOS Error (bug): Failure looking up [\_SB.PCI0.LPCB.HEC.ECAV], AE_NOT_FOUND (20180105/psargs-364)
May 18 18:30:23 archlinux kernel: ACPI Error: Method parse/execution failed \_TZ.TZ00._TMP, AE_NOT_FOUND (20180105/psparse-550)
May 18 18:30:23 archlinux kernel: ACPI BIOS Error (bug): Failure looking up [\_SB.PCI0.LPCB.HEC.ECAV], AE_NOT_FOUND (20180105/psargs-364)
May 18 18:30:23 archlinux kernel: ACPI Error: Method parse/execution failed \_TZ.TZ00._TMP, AE_NOT_FOUND (20180105/psparse-550)
May 18 18:30:23 archlinux kernel: ACPI BIOS Error (bug): Failure looking up [\_SB.PCI0.LPCB.HEC.ECAV], AE_NOT_FOUND (20180105/psargs-364)
May 18 18:30:23 archlinux kernel: ACPI Error: Method parse/execution failed \_TZ.TZ01._TMP, AE_NOT_FOUND (20180105/psparse-550)
May 18 18:30:23 archlinux kernel: ACPI BIOS Error (bug): Failure looking up [\_SB.PCI0.LPCB.HEC.ECAV], AE_NOT_FOUND (20180105/psargs-364)
May 18 18:30:23 archlinux kernel: ACPI Error: Method parse/execution failed \_TZ.TZ01._TMP, AE_NOT_FOUND (20180105/psparse-550)

My system:

$ uname -a
Linux 5510 4.16.9-1-ARCH #1 SMP PREEMPT Thu May 17 02:10:09 UTC 2018 x86_64 GNU/Linux
$ lspci -nnv
00:00.0 Host bridge [0600]: Intel Corporation Xeon E3-1200 v5/E3-1500 v5/6th Gen Core Processor Host Bridge/DRAM Registers [8086:1910] (rev 07)
        Subsystem: Dell Xeon E3-1200 v5/E3-1500 v5/6th Gen Core Processor Host Bridge/DRAM Registers [1028:06e5]
        Flags: bus master, fast devsel, latency 0
        Capabilities: <access denied>
        Kernel driver in use: skl_uncore

00:02.0 VGA compatible controller [0300]: Intel Corporation HD Graphics 530 [8086:191b] (rev 06) (prog-if 00 [VGA controller])
        Subsystem: Dell HD Graphics 530 [1028:06e5]
        Flags: bus master, fast devsel, latency 0, IRQ 123
        Memory at db000000 (64-bit, non-prefetchable) [size=16M]
        Memory at 90000000 (64-bit, prefetchable) [size=256M]
        I/O ports at f000 [size=64]
        [virtual] Expansion ROM at 000c0000 [disabled] [size=128K]
        Capabilities: <access denied>
        Kernel driver in use: i915
        Kernel modules: i915

00:04.0 Signal processing controller [1180]: Intel Corporation Xeon E3-1200 v5/E3-1500 v5/6th Gen Core Processor Thermal Subsystem [8086:1903] (rev 07)
        Subsystem: Dell Xeon E3-1200 v5/E3-1500 v5/6th Gen Core Processor Thermal Subsystem [1028:06e5]
        Flags: fast devsel, IRQ 16
        Memory at dcc20000 (64-bit, non-prefetchable) [size=32K]
        Capabilities: <access denied>
        Kernel driver in use: proc_thermal
        Kernel modules: processor_thermal_device

00:14.0 USB controller [0c03]: Intel Corporation Sunrise Point-H USB 3.0 xHCI Controller [8086:a12f] (rev 31) (prog-if 30 [XHCI])
        Subsystem: Dell Sunrise Point-H USB 3.0 xHCI Controller [1028:06e5]
        Flags: bus master, medium devsel, latency 0, IRQ 125
        Memory at dcc10000 (64-bit, non-prefetchable) [size=64K]
        Capabilities: <access denied>
        Kernel driver in use: xhci_hcd
        Kernel modules: xhci_pci

00:14.2 Signal processing controller [1180]: Intel Corporation Sunrise Point-H Thermal subsystem [8086:a131] (rev 31)
        Subsystem: Dell Sunrise Point-H Thermal subsystem [1028:06e5]
        Flags: bus master, fast devsel, latency 0, IRQ 18
        Memory at dcc39000 (64-bit, non-prefetchable) [size=4K]
        Capabilities: <access denied>
        Kernel driver in use: intel_pch_thermal
        Kernel modules: intel_pch_thermal

00:15.0 Signal processing controller [1180]: Intel Corporation Sunrise Point-H Serial IO I2C Controller #0 [8086:a160] (rev 31)
        Subsystem: Dell Sunrise Point-H Serial IO I2C Controller [1028:06e5]
        Flags: bus master, fast devsel, latency 0, IRQ 16
        Memory at dcc38000 (64-bit, non-prefetchable) [size=4K]
        Capabilities: <access denied>
        Kernel driver in use: intel-lpss
        Kernel modules: intel_lpss_pci

00:15.1 Signal processing controller [1180]: Intel Corporation Sunrise Point-H Serial IO I2C Controller #1 [8086:a161] (rev 31)
        Subsystem: Dell Sunrise Point-H Serial IO I2C Controller [1028:06e5]
        Flags: bus master, fast devsel, latency 0, IRQ 17
        Memory at dcc37000 (64-bit, non-prefetchable) [size=4K]
        Capabilities: <access denied>
        Kernel driver in use: intel-lpss
        Kernel modules: intel_lpss_pci

00:16.0 Communication controller [0780]: Intel Corporation Sunrise Point-H CSME HECI #1 [8086:a13a] (rev 31)
        Subsystem: Dell Sunrise Point-H CSME HECI [1028:06e5]
        Flags: bus master, fast devsel, latency 0, IRQ 130
        Memory at dcc36000 (64-bit, non-prefetchable) [size=4K]
        Capabilities: <access denied>
        Kernel driver in use: mei_me
        Kernel modules: mei_me

00:16.3 Serial controller [0700]: Intel Corporation Sunrise Point-H KT Redirection [8086:a13d] (rev 31) (prog-if 02 [16550])
        Subsystem: Dell Sunrise Point-H KT Redirection [1028:06e5]
        Flags: 66MHz, fast devsel, IRQ 19
        I/O ports at f0a0 [size=8]
        Memory at dcc35000 (32-bit, non-prefetchable) [size=4K]
        Capabilities: <access denied>
        Kernel driver in use: serial

00:17.0 SATA controller [0106]: Intel Corporation Sunrise Point-H SATA controller [AHCI mode] [8086:a102] (rev 31) (prog-if 01 [AHCI 1.0])
        Subsystem: Dell Sunrise Point-H SATA controller [AHCI mode] [1028:06e5]
        Flags: bus master, 66MHz, medium devsel, latency 0, IRQ 126
        Memory at dcc30000 (32-bit, non-prefetchable) [size=8K]
        Memory at dcc34000 (32-bit, non-prefetchable) [size=256]
        I/O ports at f090 [size=8]
        I/O ports at f080 [size=4]
        I/O ports at f060 [size=32]
        Memory at dcc33000 (32-bit, non-prefetchable) [size=2K]
        Capabilities: <access denied>
        Kernel driver in use: ahci
        Kernel modules: ahci

00:1c.0 PCI bridge [0604]: Intel Corporation Sunrise Point-H PCI Express Root Port #1 [8086:a110] (rev f1) (prog-if 00 [Normal decode])
        Flags: bus master, fast devsel, latency 0, IRQ 16
        Bus: primary=00, secondary=01, subordinate=01, sec-latency=0
        I/O behind bridge: None
        Memory behind bridge: dcb00000-dcbfffff [size=1M]
        Prefetchable memory behind bridge: None
        Capabilities: <access denied>
        Kernel driver in use: pcieport
        Kernel modules: shpchp

00:1c.1 PCI bridge [0604]: Intel Corporation Sunrise Point-H PCI Express Root Port #2 [8086:a111] (rev f1) (prog-if 00 [Normal decode])
        Flags: bus master, fast devsel, latency 0, IRQ 17
        Bus: primary=00, secondary=02, subordinate=02, sec-latency=0
        I/O behind bridge: 0000e000-0000efff [size=4K]
        Memory behind bridge: dc000000-dc9fffff [size=10M]
        Prefetchable memory behind bridge: 00000000c2100000-00000000c2afffff [size=10M]
        Capabilities: <access denied>
        Kernel driver in use: pcieport
        Kernel modules: shpchp

00:1d.0 PCI bridge [0604]: Intel Corporation Sunrise Point-H PCI Express Root Port #9 [8086:a118] (rev f1) (prog-if 00 [Normal decode])
        Flags: bus master, fast devsel, latency 0, IRQ 16
        Bus: primary=00, secondary=03, subordinate=03, sec-latency=0
        I/O behind bridge: None
        Memory behind bridge: dca00000-dcafffff [size=1M]
        Prefetchable memory behind bridge: None
        Capabilities: <access denied>
        Kernel driver in use: pcieport
        Kernel modules: shpchp

00:1d.4 PCI bridge [0604]: Intel Corporation Sunrise Point-H PCI Express Root Port #13 [8086:a11c] (rev f1) (prog-if 00 [Normal decode])
        Flags: bus master, fast devsel, latency 0, IRQ 16
        Bus: primary=00, secondary=04, subordinate=04, sec-latency=0
        I/O behind bridge: None
        Memory behind bridge: None
        Prefetchable memory behind bridge: None
        Capabilities: <access denied>
        Kernel driver in use: pcieport
        Kernel modules: shpchp

00:1d.6 PCI bridge [0604]: Intel Corporation Sunrise Point-H PCI Express Root Port #15 [8086:a11e] (rev f1) (prog-if 00 [Normal decode])
        Flags: bus master, fast devsel, latency 0, IRQ 18
        Bus: primary=00, secondary=05, subordinate=3d, sec-latency=0
        I/O behind bridge: 00002000-00002fff [size=4K]
        Memory behind bridge: c4000000-da0fffff [size=353M]
        Prefetchable memory behind bridge: 00000000a0000000-00000000c1ffffff [size=544M]
        Capabilities: <access denied>
        Kernel driver in use: pcieport
        Kernel modules: shpchp

00:1f.0 ISA bridge [0601]: Intel Corporation Sunrise Point-H LPC Controller [8086:a150] (rev 31)
        Subsystem: Dell Sunrise Point-H LPC Controller [1028:06e5]
        Flags: bus master, medium devsel, latency 0

00:1f.2 Memory controller [0580]: Intel Corporation Sunrise Point-H PMC [8086:a121] (rev 31)
        Subsystem: Dell Sunrise Point-H PMC [1028:06e5]
        Flags: fast devsel
        Memory at dcc2c000 (32-bit, non-prefetchable) [disabled] [size=16K]

00:1f.3 Audio device [0403]: Intel Corporation Sunrise Point-H HD Audio [8086:a170] (rev 31) (prog-if 80)
        Subsystem: Dell Sunrise Point-H HD Audio [1028:06e5]
        Flags: bus master, fast devsel, latency 32, IRQ 131
        Memory at dcc28000 (64-bit, non-prefetchable) [size=16K]
        Memory at dcc00000 (64-bit, non-prefetchable) [size=64K]
        Capabilities: <access denied>
        Kernel driver in use: snd_hda_intel
        Kernel modules: snd_hda_intel

00:1f.4 SMBus [0c05]: Intel Corporation Sunrise Point-H SMBus [8086:a123] (rev 31)
        Subsystem: Dell Sunrise Point-H SMBus [1028:06e5]
        Flags: medium devsel, IRQ 16
        Memory at dcc32000 (64-bit, non-prefetchable) [size=256]
        I/O ports at f040 [size=32]
        Kernel driver in use: i801_smbus
        Kernel modules: i2c_i801

01:00.0 Network controller [0280]: Intel Corporation Wireless 8260 [8086:24f3] (rev 3a)
        Subsystem: Intel Corporation Wireless 8260 [8086:0050]
        Flags: bus master, fast devsel, latency 0, IRQ 132
        Memory at dcb00000 (64-bit, non-prefetchable) [size=8K]
        Capabilities: <access denied>
        Kernel driver in use: iwlwifi
        Kernel modules: iwlwifi

02:00.0 Unassigned class [ff00]: Realtek Semiconductor Co., Ltd. RTS525A PCI Express Card Reader [10ec:525a] (rev 01)
        Subsystem: Dell RTS525A PCI Express Card Reader [1028:06e5]
        Flags: bus master, fast devsel, latency 0, IRQ 124
        Memory at dc000000 (32-bit, non-prefetchable) [size=4K]
        Capabilities: <access denied>
        Kernel driver in use: rtsx_pci
        Kernel modules: rtsx_pci

03:00.0 Non-Volatile memory controller [0108]: Samsung Electronics Co Ltd NVMe SSD Controller SM961/PM961 [144d:a804] (prog-if 02 [NVM Express])
        Subsystem: Samsung Electronics Co Ltd NVMe SSD Controller SM961/PM961 [144d:a801]
        Flags: bus master, fast devsel, latency 0, IRQ 16, NUMA node 0
        Memory at dca00000 (64-bit, non-prefetchable) [size=16K]
        Capabilities: <access denied>
        Kernel driver in use: nvme

My mkinitcpio.conf

MODULES=(intel_agp i915)
FILES=(/etc/modprobe.d/psmouse.conf /etc/modprobe.d/itco_wdt.conf /etc/modprobe.d/i915.conf)
HOOKS=(base systemd autodetect keyboard sd-vconsole modconf block sd-encrypt sd-lvm2 filesystems fsck)

The only module options I have explicitly enabled that may be an issue is for 1915

  1. $ cat /etc/modprobe.d/i915.conf 
    options i915 enable_guc=1
  2. $ journalctl -b | ag taint
    May 18 22:09:53 archlinux kernel: Setting dangerous option enable_guc - tainting kernel
    May 18 22:10:00 archlinux systemd[1]: System is tainted: var-run-bad
    May 18 22:10:01 5510 kernel: CPU: 1 PID: 441 Comm: systemd-backlig Tainted: G     U           4.16.9-1-ARCH #1

I've tried everything I can think of including the following:

  1. Making sure kernel parameters are as sparse and sane as possible

    $ cat /proc/cmdline 
    initrd=\intel-ucode.img initrd=\initramfs-linux.img rw rd.luks.uuid=145628bb-0138-4b8b-bc94-2d041c756539 rd.luks.name=145628bb-0138-4b8b-bc94-2d041c756539=lvm root=/dev/lvmvg/root quiet
  2. Disabling swap because of issues with LVM swap

    1. $ cat /etc/fstab
      # Static information about the filesystems.
      # See fstab(5) for details.
      
      # <file system> <dir> <type> <options> <dump> <pass>
      # /dev/mapper/lvmvg-root
      UUID=584e8363-4e34-43fa-9ce1-aa71aa05ee24       /               ext4            rw,relatime,data=ordered            0 1
      
      # /dev/sdb1
      UUID=2F98-463D          /boot           vfat            rw,relatime,fmask=0022,dmask=0022,codepage=437,iocharset=iso8859-1,shortname=mixed,errors=remount-ro    0 2
      
      # /dev/mapper/lvmvg-var
      UUID=25ba1aa0-afca-499e-833e-36ed09591444       /var            ext4            rw,relatime,data=ordered            0 2
      
      # /dev/mapper/lvmvg-home
      UUID=36fd8e5e-1099-4559-83cb-b8e9a641d6b7       /home           ext4            rw,relatime,data=ordered            0 2
      
      # /dev/mapper/lvmvg-swap
      #UUID=c5ad34f4-4b2f-46be-9836-f3b1b87571d4      none            swap            defaults,discard        0 0
    2. $  free -m
                    total        used        free      shared  buff/cache   available
      Mem:           7828        1085        5142         394        1600        6114
      Swap:             0           0           0
  3. Trying different DEs (Plasma and i3), still happens.

  4. Trying different kernels, but also happens on zen and lts.

  5. I did change from modesetting back to the intel driver in hopes that may have been the issue, but I still get the freezes/kernel-panics.

    1. $ cat /etc/X11/xorg.conf.d/20-intel.conf 
      Section "Device"
          Identifier  "Intel Graphics"
          Driver      "intel"
          Option      "DRI"          "3"   # DRI3 is now default 
          Option      "AccelMethod"  "sna" # default
          #Option      "AccelMethod"  "uxa" # fallback
          Option      "TearFree"      "true"  # SNA may cause tearing,  enable "TearFree" to fix
       EndSection
  6. I've disabled any settings from powertop being applied (no TLP etc)

    $ systemctl list-dependencies 
    default.target
    ● ├─sddm.service
    ● └─multi-user.target
    ●   ├─dbus.service
    ●   ├─lm_sensors.service
    ●   ├─man-db.timer
    ●   ├─systemd-ask-password-wall.path
    ●   ├─systemd-logind.service
    ●   ├─systemd-networkd.service
    ●   ├─systemd-resolved.service
    ●   ├─systemd-user-sessions.service
    ●   ├─updatedb.timer
    ●   ├─wpa_supplicant@wifi0.service
    ●   ├─basic.target
    ●   │ ├─-.mount
    ●   │ ├─tmp.mount
    ●   │ ├─var.mount
    ●   │ ├─paths.target
    ●   │ ├─slices.target
    ●   │ │ ├─-.slice
    ●   │ │ └─system.slice
    ●   │ ├─sockets.target
    ●   │ │ ├─dbus.socket
    ●   │ │ ├─dm-event.socket
    ●   │ │ ├─dnscrypt-proxy.socket
    ●   │ │ ├─systemd-coredump.socket
    ●   │ │ ├─systemd-initctl.socket
    ●   │ │ ├─systemd-journald-audit.socket
    ●   │ │ ├─systemd-journald-dev-log.socket
    ●   │ │ ├─systemd-journald.socket
    ●   │ │ ├─systemd-networkd.socket
    ●   │ │ ├─systemd-udevd-control.socket
    ●   │ │ └─systemd-udevd-kernel.socket
    ●   │ ├─sysinit.target
    ●   │ │ ├─dev-hugepages.mount

Additionally, there are many times the system freezes almost instantly after entering my LUKS password - actually just happened two times in a row: (Neither will leave any logs as it happens before journald is up.)

  1. First time the hang was on "A start job is running for dev-lvmvg-root.device (7s / 1min 30s)";

  2. The second time on "A start job is running for Cryptography Setup for LVM (8s / no limit)".

There doesn't appear to be any consistent behavior as to when the system freezes; though there are some actions that seem to be anecdotally linked.

  1. Attempting to use bash auto-complete on a command that has a lot of potential matches to return (for example, trying to auto complete "systemctl status"+[tab to autocomplete] )

  2. Starting the Falkon web browser

+++++++edit++++++++++++
After much pain and gnashing of teeth my issues appear to be related to IOMMU. In my case these are the kernel parameters that allow me to at least use my system without it hard-freezing every other 1-2 mins. Don't get me wrong, it still freezes, but I'm now able to go much longer in between freezes.

iommu=off intel_iommu=off acpi_osi=Linux pci=nocrs pcie_aspm=off

I also turned off intel VT-x in my laptop's BIOS/UEFI settings.

$ journalctl -b |ag kvm:
May 20 16:34:55 5510 kernel: kvm: disabled by bios
May 20 16:34:55 5510 kernel: kvm: disabled by bios

Last edited by CarbonChauvinist (Yesterday 01:03:36)

Offline

#28 Yesterday 01:10:08

qrwteyrutiyoup
Member
From: Canada
Registered: 2017-12-26
Posts: 13

Re: System freeze randomly

Out of curiosity, have you people tried out linux-lts (currently 4.14.x) instead of the regular linux (currently 4.16.x) to see if this issue is also present?

Offline

#29 Yesterday 03:27:09

CarbonChauvinist
Member
Registered: 2012-06-16
Posts: 50

Re: System freeze randomly

@qrwteyrutiyoup, yes - in my case also happens with LTS. If I boot LTS with "iommu=off" also I can at least run htop for a few minutes at a time without locking up.

But, for instance even with "iommu=off" LTS just locked up again when I was just trying to auto-complete in bash (Konsole). "systemctl status system"+[tab] resulted in kernel panic (hard freeze, cant' switch ttys, and blinking capslock - I have to long-hold the power button down to reboot)

Offline

#30 Yesterday 17:24:41

qrwteyrutiyoup
Member
From: Canada
Registered: 2017-12-26
Posts: 13

Re: System freeze randomly

CarbonChauvinist wrote:

@qrwteyrutiyoup, yes - in my case also happens with LTS. If I boot LTS with "iommu=off" also I can at least run htop for a few minutes at a time without locking up.

But, for instance even with "iommu=off" LTS just locked up again when I was just trying to auto-complete in bash (Konsole). "systemctl status system"+[tab] resulted in kernel panic (hard freeze, cant' switch ttys, and blinking capslock - I have to long-hold the power button down to reboot)


I see. I asked because I am experiencing something similar since recently: system freeze plus some kind of audio looping the last 1-2 sec. As I haven't used this laptop very often for a few months, I am guessing it's something recent, as I certainly didn't experience this when I used it heavily in past.

I currently have

intel_iommu=igfx_off

in my boot params, and right now I am testing LTS. No freezes yet (few hours already; with linux, I was having almost one freeze per hour, on average.

You may have a different issue, as yours seems to trigger very quickly. Were you able to collect logs via netconsole/serial console or something like that, to help diagnose the issue when it panics? I am going to try netconsole in a couple days, and hopefully, it can provide helpful info.

Offline

#31 Yesterday 20:29:58

The Loko
Member
From: Spain
Registered: 2014-07-23
Posts: 80

Re: System freeze randomly

I also have the same issue, I have a desktop PC with i5-2200. I'm not sure when this happened first time but it doesn't happens every day, maybe weekly or so, it usually happens on the first hours after power on. I have noticed that sometimes the USB power is cut (USB devices are powered off) and sometimes isn't. I can't notice anything relevant on the logs.
My laptop works fine without any issues.

Last edited by The Loko (Yesterday 20:30:41)

Offline

Board footer

Powered by FluxBB