GPU hang when playing Terraria: ecode 12:1:84dffffb, in Main Thread [129278]
Playing Terraria (launched through Steam) often causes my system to freeze after a few minutes (or sometimes more like 30 mins) of playing, with a GPU HANG message in the system log. Here's the relevant part:
Mar 23 08:45:03 mish-arch kwin_wayland[836]: kwin_wayland_drm: Pageflip timed out! This is a kernel bug
Mar 23 08:45:06 mish-arch kernel: i915 0000:00:02.0: [drm] Resetting rcs0 for preemption time out
Mar 23 08:45:06 mish-arch kernel: i915 0000:00:02.0: [drm] *ERROR* GT0: rcs0 reset request timed out: {request: 00000001, RESET_CTL: 00000001}
Mar 23 08:45:06 mish-arch kernel: i915 0000:00:02.0: [drm] GPU HANG: ecode 12:1:84dffffb, in Main Thread [129278]
Mar 23 08:45:06 mish-arch kernel: GPU hangs can indicate a bug anywhere in the entire gfx stack, including userspace.
Mar 23 08:45:06 mish-arch kernel: Please file a _new_ bug report at https://gitlab.freedesktop.org/drm/intel/issues/new.
Mar 23 08:45:06 mish-arch kernel: Please see https://drm.pages.freedesktop.org/intel-docs/how-to-file-i915-bugs.html for details.
Mar 23 08:45:06 mish-arch kernel: drm/i915 developers can then reassign to the right component if it's not a kernel issue.
Mar 23 08:45:06 mish-arch kernel: The GPU crash dump is required to analyze GPU hangs, so please always attach it.
Mar 23 08:45:06 mish-arch kernel: GPU crash dump saved to /sys/class/drm/card1/error
Mar 23 08:45:08 mish-arch kwin_wayland[836]: kwin_wayland_drm: Pageflip timed out! This is a kernel bug
Mar 23 08:45:18 mish-arch kernel: Fence expiration time out i915-0000:00:02.0:Main Thread[129278]:6d516!
Mar 23 08:45:18 mish-arch kernel: Fence expiration time out i915-0000:00:02.0:kwin_wayland[836]:833298!
Mar 23 08:45:18 mish-arch kernel: Fence expiration time out i915-0000:00:02.0:Main Thread[129278]:6d518!
Mar 23 08:45:18 mish-arch kernel: Fence expiration time out i915-0000:00:02.0:QSGRenderThread[128275]:58d4!
Mar 23 08:45:18 mish-arch kernel: Fence expiration time out i915-0000:00:02.0:QSGRenderThread[128275]:58d6!
Mar 23 08:45:18 mish-arch kernel: Fence expiration time out i915-0000:00:02.0:Main Thread[129278]:6d51c!
Mar 23 08:45:18 mish-arch kernel: Fence expiration time out i915-0000:00:02.0:Main Thread[129278]:6d51a!
Mar 23 08:45:18 mish-arch kernel: Fence expiration time out i915-0000:00:02.0:steamwebhelper[2134]:8afed2!
Mar 23 08:45:18 mish-arch kernel: Fence expiration time out i915-0000:00:02.0:steamwebhelper[2134]:8afed4!
Mar 23 08:45:27 mish-arch kernel: i915 0000:00:02.0: [drm] GPU HANG: ecode 12:1:84dffffb, in Main Thread [129278]
Mar 23 08:45:27 mish-arch kernel: i915 0000:00:02.0: [drm] Resetting rcs0 for stopped heartbeat on rcs0
Mar 23 08:45:27 mish-arch kernel: i915 0000:00:02.0: [drm] *ERROR* GT0: rcs0 reset request timed out: {request: 00000001, RESET_CTL: 00000001}
Mar 23 08:45:27 mish-arch kernel: i915 0000:00:02.0: [drm] GT0: Resetting chip for stopped heartbeat on rcs0
Mar 23 08:45:27 mish-arch kernel: i915 0000:00:02.0: [drm] *ERROR* GT0: rcs0 reset request timed out: {request: 00000001, RESET_CTL: 00000001}
Mar 23 08:45:27 mish-arch kernel: i915 0000:00:02.0: [drm] *ERROR* GT0: rcs0 reset request timed out: {request: 00000001, RESET_CTL: 00000001}
Mar 23 08:45:27 mish-arch kernel: i915 0000:00:02.0: [drm] Main Thread[129278] context reset due to GPU hang
Mar 23 08:45:27 mish-arch kernel: i915 0000:00:02.0: [drm] GT0: GuC firmware i915/tgl_guc_70.bin version 70.20.0
Mar 23 08:45:27 mish-arch kernel: i915 0000:00:02.0: [drm] GT0: HuC firmware i915/tgl_huc.bin version 7.9.3
Mar 23 08:45:28 mish-arch kernel: i915 0000:00:02.0: [drm] GT0: HuC: authenticated for all workloads
Mar 23 08:45:28 mish-arch kernel: i915 0000:00:02.0: [drm] GT0: GUC: submission disabled
Mar 23 08:45:28 mish-arch kernel: i915 0000:00:02.0: [drm] GT0: GUC: SLPC disabled
Mar 23 08:45:29 mish-arch kernel: sched: RT throttling activated
I've uploaded the GPU crash dump to card1-error.txt
System info
-
uname
: Linux 6.8.1-arch1-1 #1 SMP PREEMPT_DYNAMIC Sat, 16 Mar 2024 17:15:35 +0000 x86_64 GNU/Linux - Motherboard: MSI PRO H610M-G DDR4 (MS-7D46)
-
dmesg
debug output: dmesg.log - Note that not all log files are from the same occurrence of the issue (hopefully that won't be a problem)
Additional info
- I can reproduce this issue, as it now happens whenever I play Terraria, and usually doesn't take long to be triggered. However, I have previously been able to play the game without issues,
so a recent kernel update may be the cause. - I can also reproduce this with kernel version
6.7.0-arch3-1
Edited by MMK21