[skl] GPU HANG: ecode 9:0:0x85dffffb, in Xorg [836], reason: Hang on render ring, action: reset
Submitted by Bjørn Mork
Assigned to Intel 3D Bugs Mailing List
Link to original bug (#101288)
Description
Created attachment 131693 dmesg
Getting this occasionally while zooming windows or moving graphic objects (drawing etc). It happens too rarely to be reproducible, but often enough to be annoying. The hang somehow makes X restart after repeated reset attempts.
[54034.511500] [drm] GPU HANG: ecode 9:0:0x85dffffb, in Xorg [836], reason: Hang on render ring, action: reset [54034.511509] [drm] GPU hangs can indicate a bug anywhere in the entire gfx stack, including userspace. [54034.511513] [drm] Please file a new bug report on bugs.freedesktop.org against DRI -> DRM/Intel [54034.511517] [drm] drm/i915 developers can then reassign to the right component if it's not a kernel issue. [54034.511520] [drm] The gpu crash dump is required to analyze gpu hangs, so please always attach it. [54034.511524] [drm] GPU crash dump saved to /sys/class/drm/card0/error [54034.511622] drm/i915: Resetting chip after gpu hang [54034.511708] [drm] RC6 on [54034.527085] [drm] GuC firmware load skipped [[54046.478860] drm/i915: Resetting chip after gpu hang [54046.478993] [drm] RC6 on [54046.497115] [drm] GuC firmware load skipped
System is a plain Debian stretch running on a Lenovo Thinkpad X1 Carbon 4th gen.
bjorn@miraculix:~$ uname -a Linux miraculix 4.9.0-3-amd64 #1 (closed) SMP Debian 4.9.25-1 (2017-05-02) x86_64 GNU/Linux
bjorn@miraculix:~$ grep . /sys/class/dmi/id/{bios,board}* 2>/dev/null /sys/class/dmi/id/bios_date:11/28/2016 /sys/class/dmi/id/bios_vendor:LENOVO /sys/class/dmi/id/bios_version:N1FET47W (1.21 ) /sys/class/dmi/id/board_asset_tag:Not Available /sys/class/dmi/id/board_name:20FB006AMN /sys/class/dmi/id/board_vendor:LENOVO /sys/class/dmi/id/board_version:SDK0J40697 WIN
root@miraculix:/tmp# lspci -vvvnns :2
00:02.0 VGA compatible controller [0300]: Intel Corporation HD Graphics 520 [8086:1916] (rev 07) (prog-if 00 [VGA controller])
Subsystem: Lenovo HD Graphics 520 [17aa:2238]
Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx+
Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >
SERR- <PERR- INTx-
Latency: 0
Interrupt: pin A routed to IRQ 131
Region 0: Memory at e0000000 (64-bit, non-prefetchable) [size=16M]
Region 2: Memory at c0000000 (64-bit, prefetchable) [size=512M]
Region 4: I/O ports at e000 [size=64]
[virtual] Expansion ROM at 000c0000 [disabled] [size=128K]
Capabilities: [40] Vendor Specific Information: Len=0c <?>
Capabilities: [70] Express (v2) Root Complex Integrated Endpoint, MSI 00
DevCap: MaxPayload 128 bytes, PhantFunc 0
ExtTag- RBE+
DevCtl: Report errors: Correctable- Non-Fatal- Fatal- Unsupported-
RlxdOrd- ExtTag- PhantFunc- AuxPwr- NoSnoop-
MaxPayload 128 bytes, MaxReadReq 128 bytes
DevSta: CorrErr- UncorrErr- FatalErr- UnsuppReq- AuxPwr- TransPend-
DevCap2: Completion Timeout: Not Supported, TimeoutDis-, LTR-, OBFF Not Supported
DevCtl2: Completion Timeout: 50us to 50ms, TimeoutDis-, LTR-, OBFF Disabled
Capabilities: [ac] MSI: Enable+ Count=1/1 Maskable- 64bit-
Address: fee00018 Data: 0000
Capabilities: [d0] Power Management version 2
Flags: PMEClk- DSI+ D1- D2- AuxCurrent=0mA PME(D0-,D1-,D2-,D3hot-,D3cold-)
Status: D0 NoSoftRst- PME-Enable- DSel=0 DScale=0 PME-
Capabilities: [100 v1] Process Address Space ID (PASID)
PASIDCap: Exec+ Priv-, Max PASID Width: 14
PASIDCtl: Enable- Exec- Priv-
Capabilities: [200 v1] Address Translation Service (ATS)
ATSCap: Invalidate Queue Depth: 00
ATSCtl: Enable-, Smallest Translation Unit: 00
Capabilities: [300 v1] Page Request Interface (PRI)
PRICtl: Enable- Reset-
PRISta: RF- UPRGI- Stopped-
Page Request Capacity: 00008000, Page Request Allocation: 00000000
Kernel driver in use: i915
Kernel modules: i915
Using modesetting driver. Xorg.log before and after X restart is attached, as well as full dmesg and the /sys/class/drm/card0/error dump.
Attachment 131693, "dmesg":
dmesg