ring_gfx hangs/freezes on Navi gpus
Submitted by Marko Popovic
Assigned to Default DRI bug account
Link to original bug (#111763)
Description
I'm making this topic as a separate tracking of ring_gfx related bugs since we should keep https://bugs.freedesktop.org/show_bug.cgi?id=111481 related to sdma0/1 type freezes since those are ones that seem to cause random "Out of the blue" hangs on the desktop.
There is another type of freeze/hang happening when playing Starcraft II via D9VK. This one doesn't seem to be related to either ngg or dma because I have them both disabled by AMD_DEBUG=nodma and AMD_DEBUG=nongg and the hangs occur anyway, on exactly the same place every time.
Error logs:
sep 17 11:48:24 Marko-PC kernel: [drm:amdgpu_dm_commit_planes.constprop.0 [amdgpu]] ERROR Waiting for fences timed out or interrupted!
sep 17 11:48:24 Marko-PC kernel: [drm:amdgpu_job_timedout [amdgpu]] ERROR ring gfx_0.0.0 timeout, signaled seq=2361623, emitted seq=2361625
sep 17 11:48:24 Marko-PC kernel: [drm:amdgpu_job_timedout [amdgpu]] ERROR Process information: process SC2_x64.exe pid 20236 thread SC2_x64.exe pid 20236
I will try and provide trace files by using renderdoc for described issues. They also happen in native games like Rise of the Tomb Raider and Vulkan etc. Will provide as much info as possible.
Using Kernel 5.3, MESA 19.2 and llvm9.