You are not logged in.

#1 2015-09-30 04:45:15

Square
Member
Registered: 2008-06-11
Posts: 435

GPU Hang - i915 [Solved (I hope)]

I can't spend much time in X without getting a GPU hang. Sometimes everything will refresh and I can continue on just fine, but other times the screen will jumble and get completely nonsensical. Any ideas?

[12156.991071] WARNING: CPU: 1 PID: 11162 at drivers/gpu/drm/i915/intel_display.c:3291 intel_crtc_wait_for_pending_flips+0x16c/0x200 [i915]()
[12156.991076] WARN_ON(ret)
[12156.991080] Modules linked in:
[12156.991089]  arc4 ath9k_htc ath9k_common ath9k_hw ath mac80211 cfg80211 gpio_ich rfkill iTCO_wdt iTCO_vendor_support ppdev i915 serio_raw e100 evdev input_leds mousedev video drm_kms_helper mii snd_hda_codec_realtek snd_hda_codec_generic pcspkr led_class mac_hid i2c_i801 drm intel_agp intel_gtt snd_hda_intel lpc_ich agpgart snd_hda_codec snd_hda_core snd_hwdep snd_pcm parport_pc rng_core tpm_infineon tpm parport button snd_timer snd soundcore i2c_algo_bit shpchp acpi_cpufreq processor sch_fq_codel ip_tables x_tables ext4 crc16 mbcache jbd2 uas usb_storage hid_generic usbhid hid sr_mod cdrom sd_mod ata_generic pata_acpi atkbd libps2 ata_piix libata scsi_mod ehci_pci uhci_hcd ehci_hcd usbcore usb_common i8042 serio floppy
[12156.991227] CPU: 1 PID: 11162 Comm: kworker/u4:2 Tainted: G        W       4.2.1-1-ARCH #1
[12156.991233] Hardware name:                  /D915GAG                        , BIOS EV91510A.86A.0482.2006.0222.2350 02/22/2006
[12156.991266] Workqueue: i915-hangcheck i915_hangcheck_elapsed [i915]
[12156.991274]  c162b967 2f075528 00000000 f541fd50 c14c3c3d f541fd90 f541fd80 c10582e7
[12156.991286]  f988975a f541fdb0 00002b9a f9894afc 00000cdb f9830e3c f9830e3c f6758000
[12156.991298]  f2134434 00000001 f541fd9c c105835e 00000009 f541fd90 f988975a f541fdb0
[12156.991311] Call Trace:
[12156.991333]  [<c14c3c3d>] dump_stack+0x48/0x69
[12156.991344]  [<c10582e7>] warn_slowpath_common+0x87/0xc0
[12156.991380]  [<f9830e3c>] ? intel_crtc_wait_for_pending_flips+0x16c/0x200 [i915]
[12156.991416]  [<f9830e3c>] ? intel_crtc_wait_for_pending_flips+0x16c/0x200 [i915]
[12156.991425]  [<c105835e>] warn_slowpath_fmt+0x3e/0x60
[12156.991463]  [<f9830e3c>] intel_crtc_wait_for_pending_flips+0x16c/0x200 [i915]
[12156.991498]  [<f8351bb4>] ? drm_modeset_lock_all_crtcs+0x84/0x90 [drm]
[12156.991531]  [<f98321c4>] intel_crtc_disable_planes+0x34/0xf0 [i915]
[12156.991564]  [<f983232a>] intel_prepare_reset+0x6a/0x80 [i915]
[12156.991596]  [<f9805877>] i915_handle_error+0x147/0x6e0 [i915]
[12156.991608]  [<c10ad307>] ? vprintk_default+0x37/0x40
[12156.991638]  [<f9806093>] i915_hangcheck_elapsed+0x233/0x410 [i915]
[12156.991648]  [<c106d8ea>] process_one_work+0x11a/0x3f0
[12156.991655]  [<c106dbf7>] worker_thread+0x37/0x470
[12156.991663]  [<c106dbc0>] ? process_one_work+0x3f0/0x3f0
[12156.991672]  [<c1072c96>] kthread+0xa6/0xc0
[12156.991682]  [<c1079ea5>] ? finish_task_switch+0x55/0x190
[12156.991693]  [<c14c8b41>] ret_from_kernel_thread+0x21/0x30
[12156.991702]  [<c1072bf0>] ? kthread_worker_fn+0x140/0x140
[12156.991708] ---[ end trace 19e44a44334dce5c ]---
[12157.028199] [drm:i915_set_reset_status [i915]] *ERROR* gpu hanging too fast, banning!
[12157.028264] drm/i915: Resetting chip after gpu hang

Last edited by Square (2015-10-01 01:18:14)


 

Offline

#2 2015-09-30 06:54:29

flannelhead
Member
From: Finland
Registered: 2014-03-30
Posts: 55
Website

Re: GPU Hang - i915 [Solved (I hope)]

It would be worth it to specify your kernel version and GPU/CPU model. Are you using any kind of a compositor / accelerated desktop environment? Has the problem started to appear only recently? Does the problem disappear if you enable the UXA acceleration method?

Please also do an online search with the error code and compare the existing bug reports to your issue. If none of those match, you could file an upstream bug report.

Offline

#3 2015-09-30 19:14:31

Square
Member
Registered: 2008-06-11
Posts: 435

Re: GPU Hang - i915 [Solved (I hope)]

I've just done a fresh install on this system which, admittedly, was having some problems before Arch; so it could very well be hardware related. That said, UXA acceleration seems to have done the trick. I was hoping to find some sort of trick like that which would just make it work, considering this tower is pretty much junk anyhow.

The GPU is Intel Corporation 82915G/GV/910GL.

I will report back if the issue comes up again. Thank you for the suggestion! I've been separated from a computer for some time, so I'm a bit rusty. I must admit, I was extremely surprised by the latest install medium. Last I used was probably in 2010!

Last edited by Square (2015-09-30 19:15:29)


 

Offline

#4 2015-09-30 19:55:41

flannelhead
Member
From: Finland
Registered: 2014-03-30
Posts: 55
Website

Re: GPU Hang - i915 [Solved (I hope)]

Glad to hear you got it working. Intel GPUs are a sometimes a bit tricky... Mine freezes randomly when starting X – there could be weeks without a single hang, but then it could happen 5 times in a row. That's a known issue, though.

Offline

#5 2015-09-30 22:32:25

karol
Archivist
Registered: 2009-05-06
Posts: 25,440

Re: GPU Hang - i915 [Solved (I hope)]

Please remember to mark the thread as solved https://bbs.archlinux.org/viewtopic.php?id=130309

Offline

#6 2015-10-01 01:18:47

Square
Member
Registered: 2008-06-11
Posts: 435

Re: GPU Hang - i915 [Solved (I hope)]

Thanks karol.


 

Offline

#7 2018-08-29 08:46:33

Galaxy
Member
Registered: 2018-08-29
Posts: 2

Re: GPU Hang - i915 [Solved (I hope)]

I just meet the same display freeze bug on Xeon E-2176M today. It happeds only once among at least 4 times of restart.

Is "SNA" still not stable after 3 years ?

kernel: [drm:fw_domains_get_with_fallback [i915]] *ERROR* media: timed out waiting for forcewake ack to clear.

kernel: ------------[ cut here ]------------
kernel: WARN_ON(intel_gpu_reset(i915, (~0)))
kernel: WARNING: CPU: 4 PID: 337 at drivers/gpu/drm/i915/i915_gem.c:5005 i915_gem_sanitize+0x77/0x80 [i915]
kernel: Modules linked in: wmi_bmof(+) intel_wmi_thunderbolt nvidia(PO) iwlmvm(+) i915(+) irqbypass crct10dif_pclmul c>
kernel:  typec_ucsi tpm typec rng_core wmi intel_gtt pcc_cpufreq agpgart int3400_thermal acpi_thermal_rel syscopyarea >
kernel: CPU: 4 PID: 337 Comm: systemd-udevd Tainted: P           O      4.18.5-arch1-1-ARCH #1
kernel: Hardware name: LENOVO 20M9CTO1WW/20M9CTO1WW, BIOS N2CET27W (1.10 ) 07/19/2018
kernel: RIP: 0010:i915_gem_sanitize+0x77/0x80 [i915]
kernel: Code: 01 00 84 c0 74 cc be ff ff ff ff 48 89 df e8 80 ef 01 00 85 c0 74 bb 48 c7 c6 68 5d 47 c1 48 c7 c7 f4 62>
kernel: RSP: 0018:ffffbc6787227ae8 EFLAGS: 00010282
kernel: RAX: 0000000000000000 RBX: ffff9e76872c0000 RCX: 0000000000000001
kernel: RDX: 0000000080000001 RSI: ffffffff902811ce RDI: 00000000ffffffff
kernel: RBP: 0000000000000000 R08: 0000000e9edcfdff R09: 00000000000003a2
kernel: R10: ffffffff90a05700 R11: 0000000000000000 R12: ffffbc6787227b10
kernel: R13: ffff9e76945a6000 R14: 0000000000000048 R15: 000000000000a30e
kernel: FS:  00007f4b8d1d4d40(0000) GS:ffff9e769bd00000(0000) knlGS:0000000000000000
kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
kernel: CR2: 000055690674f000 CR3: 000000104bf6a005 CR4: 00000000003606e0
kernel: DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
kernel: DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
kernel: Call Trace:
kernel:  i915_driver_load+0x6e3/0xda0 [i915]
kernel:  ? acpi_dev_found+0x5f/0x70
kernel:  local_pci_probe+0x41/0x90
kernel:  pci_device_probe+0x115/0x1a0
kernel:  driver_probe_device+0x2da/0x450
kernel:  __driver_attach+0xdd/0x110
kernel:  ? driver_probe_device+0x450/0x450
kernel:  bus_for_each_dev+0x76/0xc0
kernel:  bus_add_driver+0x152/0x230
kernel:  ? 0xffffffffc0f9a000
kernel:  driver_register+0x6b/0xb0
kernel:  ? 0xffffffffc0f9a000
kernel:  do_one_initcall+0x46/0x1f5
kernel:  ? kmem_cache_alloc_trace+0x181/0x1d0
kernel:  ? do_init_module+0x22/0x210
kernel:  do_init_module+0x5a/0x210
kernel:  load_module+0x22a9/0x24c0
kernel:  ? vmap_page_range_noflush+0x23f/0x350
kernel:  ? __se_sys_init_module+0x10c/0x170
kernel:  __se_sys_init_module+0x10c/0x170
kernel:  do_syscall_64+0x5b/0x170
kernel:  entry_SYSCALL_64_after_hwframe+0x44/0xa9
kernel: RIP: 0033:0x7f4b90fe848e
kernel: Code: 48 8b 0d d5 19 0c 00 f7 d8 64 89 01 48 83 c8 ff c3 66 2e 0f 1f 84 00 00 00 00 00 90 f3 0f 1e fa 49 89 ca>
kernel: RSP: 002b:00007fff7f35c818 EFLAGS: 00000246 ORIG_RAX: 00000000000000af
kernel: RAX: ffffffffffffffda RBX: 00005569058f4c30 RCX: 00007f4b90fe848e
kernel: RDX: 00007f4b90881ecd RSI: 00000000003f5fe0 RDI: 000055690635ad20
kernel: RBP: 00007f4b90881ecd R08: 0000000000000007 R09: 0000000000000006
kernel: R10: 00005569058e6010 R11: 0000000000000246 R12: 000055690635ad20
kernel: R13: 00005569058ebfa0 R14: 0000000000020000 R15: 00005569058f4c30
kernel: ---[ end trace 7f969b4f89a9e816 ]---
kernel: checking generic (4000000000 1fa4000) vs hw (4000000000 10000000)
kernel: fb: switching to inteldrmfb from EFI VGA
kernel: Console: switching to colour dummy device 80x25
kernel: [drm] Replacing VGA console driver
kernel: [drm:gen9_set_dc_state [i915]] *ERROR* DC state mismatch (0x0 -> 0x3)
kernel: [drm:gen9_set_dc_state [i915]] *ERROR* Writing dc state to 0xfffffffc failed, now 0xffffffff
kernel: ------------[ cut here ]------------
kernel: WARN_ON((val & ((1<<((0)*6+5)) | (1<<((0)*6+4)) | (1<<((0)*6)))) != (1<<((0)*6)))
kernel: WARNING: CPU: 5 PID: 337 at drivers/gpu/drm/i915/intel_cdclk.c:834 skl_get_cdclk+0x1f2/0x260 [i915]
kernel: Modules linked in: iTCO_vendor_support(+) wmi_bmof(+) intel_wmi_thunderbolt nvidia(PO) kvm(+) iwlmvm(+) i915(+>
kernel:  media tpm_tis_core ucsi_acpi typec_ucsi tpm typec rng_core wmi intel_gtt pcc_cpufreq agpgart int3400_thermal >
kernel: CPU: 5 PID: 337 Comm: systemd-udevd Tainted: P        W  O      4.18.5-arch1-1-ARCH #1
kernel: Hardware name: LENOVO 20M9CTO1WW/20M9CTO1WW, BIOS N2CET27W (1.10 ) 07/19/2018
kernel: RIP: 0010:skl_get_cdclk+0x1f2/0x260 [i915]
kernel: Code: 5b 5d c3 8b 43 08 c7 43 04 a0 98 7b 00 89 43 0c 89 03 e9 bf fe ff ff 48 c7 c6 48 ae 47 c1 48 c7 c7 68 78>
kernel: RSP: 0018:ffffbc6787227a78 EFLAGS: 00010286
kernel: RAX: 0000000000000000 RBX: ffff9e76872c55fc RCX: 0000000000000001
kernel: RDX: 0000000080000001 RSI: ffffffff902811ce RDI: 00000000ffffffff
kernel: RBP: ffff9e76872c0000 R08: 0000000e9f772a81 R09: 00000000000003dc
kernel: R10: ffffffff90a05700 R11: 0000000000000000 R12: ffff9e76872c6958
kernel: R13: ffff9e76945a6000 R14: ffff9e76945a6000 R15: 000000000000a30e
kernel: FS:  00007f4b8d1d4d40(0000) GS:ffff9e769bd40000(0000) knlGS:0000000000000000
kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
kernel: CR2: 00007f4b8d1d4cc0 CR3: 000000104bf6a003 CR4: 00000000003606e0
kernel: DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
kernel: DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
kernel: Call Trace:
kernel:  intel_update_cdclk+0x1c/0x60 [i915]
kernel:  skl_init_cdclk+0xdd/0x1b0 [i915]
kernel:  ? intel_power_well_enable+0x34/0x40 [i915]
kernel:  intel_power_domains_init_hw+0x704/0x910 [i915]
kernel:  i915_driver_load+0xaa8/0xda0 [i915]
kernel:  ? acpi_dev_found+0x5f/0x70
kernel:  local_pci_probe+0x41/0x90
kernel:  pci_device_probe+0x115/0x1a0
kernel:  driver_probe_device+0x2da/0x450
kernel:  __driver_attach+0xdd/0x110
kernel:  ? driver_probe_device+0x450/0x450
kernel:  bus_for_each_dev+0x76/0xc0
kernel:  bus_add_driver+0x152/0x230
kernel:  ? 0xffffffffc0f9a000
kernel:  driver_register+0x6b/0xb0
kernel:  ? 0xffffffffc0f9a000
kernel:  do_one_initcall+0x46/0x1f5
kernel:  ? kmem_cache_alloc_trace+0x181/0x1d0
kernel:  ? do_init_module+0x22/0x210
kernel:  do_init_module+0x5a/0x210
kernel:  load_module+0x22a9/0x24c0
kernel:  ? vmap_page_range_noflush+0x23f/0x350
kernel:  ? __se_sys_init_module+0x10c/0x170
kernel:  __se_sys_init_module+0x10c/0x170
kernel:  do_syscall_64+0x5b/0x170
kernel:  entry_SYSCALL_64_after_hwframe+0x44/0xa9
kernel: RIP: 0033:0x7f4b90fe848e
kernel: Code: 48 8b 0d d5 19 0c 00 f7 d8 64 89 01 48 83 c8 ff c3 66 2e 0f 1f 84 00 00 00 00 00 90 f3 0f 1e fa 49 89 ca>
kernel: RSP: 002b:00007fff7f35c818 EFLAGS: 00000246 ORIG_RAX: 00000000000000af
kernel: RAX: ffffffffffffffda RBX: 00005569058f4c30 RCX: 00007f4b90fe848e
kernel: RDX: 00007f4b90881ecd RSI: 00000000003f5fe0 RDI: 000055690635ad20
kernel: RBP: 00007f4b90881ecd R08: 0000000000000007 R09: 0000000000000006
kernel: R10: 00005569058e6010 R11: 0000000000000246 R12: 000055690635ad20
kernel: R13: 00005569058ebfa0 R14: 0000000000020000 R15: 00005569058f4c30
kernel: ---[ end trace 7f969b4f89a9e817 ]---
kernel: [drm:skl_init_cdclk [i915]] *ERROR* Failed to inform PCU about cdclk change (-11)
kernel: [drm:gen9_set_dc_state [i915]] *ERROR* DC state mismatch (0x0 -> 0x3)
kernel: iTCO_vendor_support: vendor-support=0
kernel: e1000e 0000:00:1f.6 enp0s31f6: renamed from eth0
kernel: [drm:gen9_set_dc_state [i915]] *ERROR* Writing dc state to 0xfffffffc failed, now 0xffffffff
kernel: ------------[ cut here ]------------
kernel: WARN_ON((val & ((1<<((0)*6+5)) | (1<<((0)*6+4)) | (1<<((0)*6)))) != (1<<((0)*6)))
kernel: WARNING: CPU: 5 PID: 337 at drivers/gpu/drm/i915/intel_cdclk.c:834 skl_get_cdclk+0x1f2/0x260 [i915]
kernel: Modules linked in: arc4 8250_dw(+) iTCO_vendor_support mei_wdt(+) wmi_bmof intel_wmi_thunderbolt nvidia(PO) kv>

Offline

#8 2018-08-29 09:37:21

V1del
Forum Moderator
Registered: 2012-10-16
Posts: 21,333

Re: GPU Hang - i915 [Solved (I hope)]

The xf86-video-intel driver is not relevantly being developed anymore, use the built in modesetting driver instead.

If that is insufficient to fix the issue please open up your own thread about it as opposed to bumping a 3 year old thread.

https://wiki.archlinux.org/index.php/Co … bumping.22

Closing.

Last edited by V1del (2018-08-29 09:38:09)

Offline

Board footer

Powered by FluxBB