Crash with pathfinder (experimental vector graphics renderer) on AMD RX 5700 XT
System information
System:
Host: ishitatsuyuki-ryzen Kernel: 5.9.3-arch1-1 x86_64 bits: 64
compiler: N/A Desktop: GNOME 3.38.1 tk: GTK 3.24.23 wm: gnome-shell
dm: GDM Distro: Arch Linux
CPU:
Info: 8-Core model: AMD Ryzen 7 3700X bits: 64 type: MT MCP arch: Zen 2
L2 cache: 4096 KiB
flags: avx avx2 lm nx pae sse sse2 sse3 sse4_1 sse4_2 sse4a ssse3 svm
bogomips: 115050
Speed: 2753 MHz min/max: 2200/3600 MHz Core speeds (MHz): 1: 2262 2: 2748
3: 1927 4: 1997 5: 2191 6: 2189 7: 2193 8: 2193 9: 1862 10: 1865 11: 2197
12: 2196 13: 2196 14: 2195 15: 2059 16: 2401
Graphics:
Device-1: AMD Navi 10 [Radeon RX 5600 OEM/5600 XT / 5700/5700 XT]
vendor: Tul driver: amdgpu v: kernel bus ID: 0a:00.0 chip ID: 1002:731f
Display: wayland server: X.org 1.20.9 compositor: gnome-shell
driver: amdgpu resolution: <xdpyinfo missing>
OpenGL: renderer: AMD Radeon RX 5700 XT (NAVI10 DRM 3.39.0 5.9.3-arch1-1
LLVM 10.0.1)
v: 4.6 Mesa 20.2.1 direct render: Yes
Describe the issue
When running the demo of pathfinder, a driver hang occurs (nearly instantly after starting the program) and requires restarting the desktop environment to recover. An apitrace is attached that will reproduce the problem reliably (might require setting up the same distro with the same GPU).
The abovementioned repo has some bit of outdated code and requires some patching (related to window creation) to run under Arch. If necessary, I can provide the patches required for running the code.
Regression
Unknown.
Log files as attachment
dmesg around the hang:
[ 1131.517278] [drm:amdgpu_dm_atomic_commit_tail [amdgpu]] *ERROR* Waiting for fences timed out!
[ 1136.433460] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx_0.0.0 timeout, signaled seq=24550, emitted seq=24552
[ 1136.433510] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process information: process eglretrace pid 11312 thread eglretrace:cs0 pid 11313
[ 1136.433512] amdgpu 0000:0a:00.0: amdgpu: GPU reset begin!
[ 1136.786062] amdgpu 0000:0a:00.0: [drm:amdgpu_ring_test_helper [amdgpu]] *ERROR* ring kiq_2.1.0 test failed (-110)
[ 1136.786109] [drm:gfx_v10_0_hw_fini [amdgpu]] *ERROR* KGQ disable failed
[ 1137.028014] amdgpu 0000:0a:00.0: [drm:amdgpu_ring_test_helper [amdgpu]] *ERROR* ring kiq_2.1.0 test failed (-110)
[ 1137.028059] [drm:gfx_v10_0_hw_fini [amdgpu]] *ERROR* KCQ disable failed
[ 1137.270465] [drm:gfx_v10_0_hw_fini [amdgpu]] *ERROR* failed to halt cp gfx
[ 1137.341137] amdgpu 0000:0a:00.0: amdgpu: GPU BACO reset
[ 1140.484307] amdgpu 0000:0a:00.0: amdgpu: GPU reset succeeded, trying to resume
[ 1140.819960] amdgpu 0000:0a:00.0: amdgpu: RAS: optional ras ta ucode is not available
[ 1140.839960] amdgpu 0000:0a:00.0: amdgpu: SMU is resuming...
[ 1140.842665] amdgpu 0000:0a:00.0: amdgpu: SMU is resumed successfully!
apitrace: demo.1.trace