[amdgpu/radeonsi,HAWAII] Hand of Fate 2 leads to GPU lock up (display powered off, SSH works, keyboard dead): "flip_done timed out"
@curan
Submitted by Kai Wasserbäch Assigned to Default DRI bug account
Link to original bug (#109461)
Description
Created attachment 143232
dmesg excerpt showing the backtraces and other DRM related entries
Playing Hand of Fate 2 leads to reproducible lock ups of my HAWAII Pro GPU. Sometimes directly on initial load, sometimes after playing for a while. The system can be reached over SSH, but the attached input devices are dead (not even num lock changes work). In addition to this the display gets powered off (display turns black and shows behaviour as if looking for input, ie. the connector identifier is show.
In dmesg I can see "flip_done timed out" errors and two backtraces (see attached dmesg excerpt for all the details):
[15465.441663] [drm:drm_atomic_helper_wait_for_flip_done [drm_kms_helper]] ERROR [CRTC:44:crtc-0] flip_done timed out
[15465.451746] [drm:amdgpu_job_timedout [amdgpu]] ERROR ring gfx timeout, signaled seq=1164561, emitted seq=1164563
[15465.451751] [drm] GPU recovery disabled.
[15467.233739] [drm:amdgpu_job_timedout [amdgpu]] ERROR ring sdma0 timeout, signaled seq=171220, emitted seq=171221
[15467.233746] [drm] GPU recovery disabled.
[15475.681643] [drm:drm_atomic_helper_wait_for_dependencies [drm_kms_helper]] ERROR [CRTC:44:crtc-0] flip_done timed out
[15485.921664] [drm:drm_atomic_helper_wait_for_dependencies [drm_kms_helper]] ERROR [PLANE:42:plane-5] flip_done timed out
[15485.921779] [drm:amdgpu_dm_atomic_commit_tail [amdgpu]] ERROR amdgpu_dm_commit_planes: acrtc 0, already busy
If you need data logged with umr, please provide me with the exact command I should run.
The bug is reproducible with the following stack (Debian testing as a base):
GPU: Hawaii PRO [Radeon R9 290] (ChipID = 0x67b1)
Mesa: Git:master/8e9ad592c3
libdrm: 2.4.97
LLVM: SVN:trunk/r351739 (9.0 devel)
X.Org: 2:1.20.3-1
Linux: 4.20.4
Firmware (firmware-amd-graphics): 20190114-1
libclc: Git:master/428e821c1e
DDX (xserver-xorg-video-amdgpu): 18.1.0-1
Let me know if you need anything else.
**Attachment 143232**, "dmesg excerpt showing the backtraces and other DRM related entries":
2019-01-26_dmesg_HoF2_lockup.log