You are not logged in.

#1 2023-07-14 14:47:39

bradwiggo
Member
Registered: 2023-06-05
Posts: 3

GPU Hang Issue, Laptop completely freezes except cursor, hard reset

A while ago I was having an issue on my laptop where it would experience GPU Hangs which caused the entire system to freeze up. I ended up fixing it by moving from Ubuntu which I was using at the time, to Arch, so I assumed the newer kernel updates helped as my laptop is quite new (specs at bottom of post). However today the issue suddenly happened again. It happened while I was playing a game (Terraria) and watching a video.

I will post the relevant lines from journalctl, if any other log files would be of use I can post them also.

Relevant journalctl lines:

Jul 14 15:25:10 laptop kernel: i915 0000:00:02.0: [drm] *ERROR* GT0: GUC: Engine reset failed on 0:0 (rcs0) because 0x00000000
Jul 14 15:25:10 laptop kernel: i915 0000:00:02.0: [drm] GPU HANG: ecode 12:1:84dffffb, in Main Thread [11143]
Jul 14 15:25:10 laptop kernel: i915 0000:00:02.0: [drm] Resetting chip for GuC failed to reset engine mask=0x1
Jul 14 15:25:10 laptop kernel: i915 0000:00:02.0: [drm] *ERROR* rcs0 reset request timed out: {request: 00000001, RESET_CTL: 00000001}
Jul 14 15:25:10 laptop kernel: i915 0000:00:02.0: [drm] *ERROR* rcs0 reset request timed out: {request: 00000001, RESET_CTL: 00000001}
Jul 14 15:25:10 laptop kernel: i915 0000:00:02.0: [drm] Main Thread[11143] context reset due to GPU hang
Jul 14 15:25:10 laptop kernel: i915 0000:00:02.0: [drm] GT0: GuC firmware i915/adlp_guc_70.bin version 70.5.1
Jul 14 15:25:10 laptop kernel: i915 0000:00:02.0: [drm] GT0: HuC firmware i915/tgl_huc.bin version 7.9.3
Jul 14 15:25:10 laptop kernel: i915 0000:00:02.0: [drm] GT0: HuC: authenticated!
Jul 14 15:25:10 laptop kernel: i915 0000:00:02.0: [drm] GT0: GUC: submission enabled
Jul 14 15:25:10 laptop kernel: i915 0000:00:02.0: [drm] GT0: GUC: SLPC enabled
Jul 14 15:25:21 laptop kernel: Asynchronous wait on fence 0000:00:02.0:Xorg[462]:376b8a timed out (hint:intel_atomic_commit_ready [i915])
Jul 14 15:25:21 laptop kernel: Asynchronous wait on fence 0000:00:02.0:Xorg[462]:376b8a timed out (hint:intel_atomic_commit_ready [i915])
Jul 14 15:25:22 laptop kernel: Asynchronous wait on fence 0000:00:02.0:Xorg[462]:376b8a timed out (hint:intel_atomic_commit_ready [i915])
Jul 14 15:25:30 laptop kernel: Fence expiration time out i915-0000:00:02.0:Main Thread<11143>:22a49e!
Jul 14 15:25:33 laptop kernel: Asynchronous wait on fence 0000:00:02.0:Xorg[462]:376b96 timed out (hint:intel_atomic_commit_ready [i915])
Jul 14 15:25:33 laptop kernel: Asynchronous wait on fence 0000:00:02.0:Xorg[462]:376b96 timed out (hint:intel_atomic_commit_ready [i915])
Jul 14 15:25:33 laptop kernel: Asynchronous wait on fence 0000:00:02.0:Xorg[462]:376b96 timed out (hint:intel_atomic_commit_ready [i915])
Jul 14 15:25:53 laptop dbus-daemon[627]: [session uid=1000 pid=627] Activating service name='org.xfce.Xfconf' requested by ':1.14' (uid=1000 pid=751 comm="xfce4-panel --display :0.0 --sm-client-id 200e9553")
Jul 14 15:25:53 laptop dbus-daemon[627]: [session uid=1000 pid=627] Successfully activated service 'org.xfce.Xfconf'
Jul 14 15:26:07 laptop kernel: Asynchronous wait on fence 0000:00:02.0:Xorg[462]:376b96 timed out (hint:intel_atomic_commit_ready [i915])
Jul 14 15:26:32 laptop kernel: Asynchronous wait on fence 0000:00:02.0:Xorg[462]:376b96 timed out (hint:intel_atomic_commit_ready [i915])

What could the issue be? Has anybody else experienced this and if so, did/how did you fix it?

Important Info:

Laptop: Lenovo Yoga 7i Slim Pro
CPU: i7-1260p
RAM: 16GB DDR5
GPU: Integrated
OS: Arch Latest (last fully updated a few days ago)
DE: Xfce 4 with Xfwm
Kernel: 6.4.2-arch1-1

Offline

#2 2023-09-10 20:24:31

Moxon
Member
Registered: 2017-01-30
Posts: 10

Re: GPU Hang Issue, Laptop completely freezes except cursor, hard reset

I have the same issue since a couple of days.  Interesstingly also while playing Terraria.  I tried a 6.4, a 6.5 and also the linux-next kernel, all locking up X after a while of gameplay.

Unfortunately I have no solution.

$ inxi -a
CPU: 14-core (6-mt/8-st) 13th Gen Intel Core i9-13900H (-MST AMCP-)
speed/min/max: 454/400/5200:5400:4100 MHz
Kernel: 6.5.0-next-20230908-1-next-git-14143-gaf3c30d33476 x86_64 Up: 30m
Mem: 4.85/30.99 GiB (15.6%) Storage: 5.5 TiB (46.5% used) Procs: 447
Shell: Bash 5.1.16 inxi: 3.3.29

journalctl:

$ journalctl  --boot -t kernel | rg i915
Sep 10 21:50:52 xox kernel: i915 0000:00:02.0: [drm] VT-d active for gfx access
Sep 10 21:50:52 xox kernel: i915 0000:00:02.0: vgaarb: deactivate vga console
Sep 10 21:50:52 xox kernel: i915 0000:00:02.0: [drm] Using Transparent Hugepages
Sep 10 21:50:52 xox kernel: i915 0000:00:02.0: vgaarb: VGA decodes changed: olddecodes=io+mem,decodes=none:owns=io+mem
Sep 10 21:50:52 xox kernel: i915 0000:00:02.0: [drm] Finished loading DMC firmware i915/adlp_dmc.bin (v2.20)
Sep 10 21:50:52 xox kernel: i915 0000:00:02.0: [drm] GT0: GuC firmware i915/adlp_guc_70.bin version 70.5.1
Sep 10 21:50:52 xox kernel: i915 0000:00:02.0: [drm] GT0: HuC firmware i915/tgl_huc.bin version 7.9.3
Sep 10 21:50:52 xox kernel: i915 0000:00:02.0: [drm] GT0: HuC: authenticated for all workloads
Sep 10 21:50:52 xox kernel: i915 0000:00:02.0: [drm] GT0: GUC: submission enabled
Sep 10 21:50:52 xox kernel: i915 0000:00:02.0: [drm] GT0: GUC: SLPC enabled
Sep 10 21:50:52 xox kernel: i915 0000:00:02.0: [drm] GT0: GUC: RC enabled
Sep 10 21:50:52 xox kernel: i915 0000:00:02.0: [drm] Protected Xe Path (PXP) protected content support initialized
Sep 10 21:50:52 xox kernel: [drm] Initialized i915 1.6.0 20201103 for 0000:00:02.0 on minor 1
Sep 10 21:50:52 xox kernel: fbcon: i915drmfb (fb0) is primary device
Sep 10 21:50:52 xox kernel: i915 0000:00:02.0: [drm] fb0: i915drmfb frame buffer device
Sep 10 21:50:53 xox kernel: mei_pxp 0000:00:16.0-fbf6fcf1-96cf-4e2e-a6a6-1bab8cbe36b1: bound 0000:00:02.0 (ops i915_pxp_tee_component_ops [i915])
Sep 10 21:50:53 xox kernel: mei_hdcp 0000:00:16.0-b638ab7e-94e2-4ea2-a552-d1c54b627f04: bound 0000:00:02.0 (ops i915_hdcp_ops [i915])
Sep 10 21:50:53 xox kernel: snd_hda_intel 0000:00:1f.3: bound 0000:00:02.0 (ops i915_audio_component_bind_ops [i915])
Sep 10 22:15:22 xox kernel: i915 0000:00:02.0: [drm] *ERROR* GT0: GUC: Engine reset failed on 0:0 (rcs0) because 0x00000000
Sep 10 22:15:22 xox kernel: i915 0000:00:02.0: [drm] GPU HANG: ecode 12:1:84dffffb, in Main Thread [5298]
Sep 10 22:15:22 xox kernel: i915 0000:00:02.0: [drm] Resetting chip for GuC failed to reset engine mask=0x1
Sep 10 22:15:22 xox kernel: i915 0000:00:02.0: [drm] *ERROR* rcs0 reset request timed out: {request: 00000001, RESET_CTL: 00000001}
Sep 10 22:15:22 xox kernel: i915 0000:00:02.0: [drm] *ERROR* rcs0 reset request timed out: {request: 00000001, RESET_CTL: 00000001}
Sep 10 22:15:22 xox kernel: i915 0000:00:02.0: [drm] Main Thread[5298] context reset due to GPU hang
Sep 10 22:15:22 xox kernel: i915 0000:00:02.0: [drm] GT0: GuC firmware i915/adlp_guc_70.bin version 70.5.1
Sep 10 22:15:22 xox kernel: i915 0000:00:02.0: [drm] GT0: HuC firmware i915/tgl_huc.bin version 7.9.3
Sep 10 22:15:22 xox kernel: i915 0000:00:02.0: [drm] GT0: HuC: authenticated for all workloads
Sep 10 22:15:22 xox kernel: i915 0000:00:02.0: [drm] GT0: GUC: submission enabled
Sep 10 22:15:22 xox kernel: i915 0000:00:02.0: [drm] GT0: GUC: SLPC enabled
Sep 10 22:15:30 xox kernel: i915 0000:00:02.0: [drm] GPU HANG: ecode 12:1:85dfbfff, in Main Thread [5298]
Sep 10 22:15:30 xox kernel: i915 0000:00:02.0: [drm] Main Thread[5298] context reset due to GPU hang
Sep 10 22:15:41 xox kernel: Asynchronous wait on fence 0000:00:02.0:gnome-shell[3167]:24594 timed out (hint:intel_atomic_commit_ready [i915])

Offline

Board footer

Powered by FluxBB