[amdgpu]: Golf With Your Friends (431240): ERROR Waiting for fences timed out
System information
System: Host: Flash-linux Kernel: 5.8.0-48-generic x86_64 bits: 64 compiler: N/A Desktop: Cinnamon 4.8.6 wm: muffin
dm: LightDM Distro: Linux Mint 20.1 Ulyssa base: Ubuntu 20.04 focal
CPU: Topology: 8-Core model: AMD Ryzen 7 3700X bits: 64 type: MT MCP arch: Zen L2 cache: 4096 KiB
flags: avx avx2 lm nx pae sse sse2 sse3 sse4_1 sse4_2 sse4a ssse3 svm bogomips: 114987
Speed: 2195 MHz min/max: 2200/3600 MHz Core speeds (MHz): 1: 2195 2: 2195 3: 2189 4: 2188 5: 2195 6: 2192 7: 2187
8: 2196 9: 2195 10: 2193 11: 2195 12: 2194 13: 2197 14: 2196 15: 2196 16: 2195
Graphics: Device-1: Advanced Micro Devices [AMD/ATI] Navi 10 [Radeon RX 5600 OEM/5600 XT / 5700/5700 XT] vendor: ASRock
driver: amdgpu v: kernel bus ID: 0d:00.0 chip ID: 1002:731f
Display: x11 server: X.Org 1.20.9 driver: amdgpu,ati unloaded: fbdev,modesetting,radeon,vesa
resolution: 2560x1440~60Hz
OpenGL: renderer: AMD Radeon RX 5700 (NAVI10 DRM 3.38.0 5.8.0-48-generic LLVM 11.0.1)
v: 4.6 Mesa 21.0.1 - kisak-mesa PPA direct render: Yes
Description
While playing Golf With Your Friends 431240 the amdgpu would crash and the desktop environment had to be restarted. This issue is fairly repeatable.
Start the game, select Offline mode and for Game Settings I selected Course as Forest and Mode as Classic. When playing a hole you can free the camera to look around the hole (controller Y button). The crash would tend to happen when you view the hole from above (controller RB button to move camera up) and you view the hole from various angles. It would not always crash but for me it would sometimes happen on the first or seventh hole; I was never able to finish an entire 18 holes. The crash would also happen to me while playing Valheim 892970, when you are sailing you can zoom the camera out and it would sometimes crash. However while playing Valheim it wasn't as repeatable. Both games appear to be Linux games that use the Unity game engine.
Syslog output
Mar 30 20:19:54 Flash-linux kernel: [10731.962165] [drm:amdgpu_dm_commit_planes.constprop.0 [amdgpu]] *ERROR* Waiting for fences timed out!
Mar 30 20:19:54 Flash-linux kernel: [10736.835892] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx_0.0.0 timeout, signaled seq=1386139, emitted seq=1386141
Mar 30 20:19:54 Flash-linux kernel: [10736.835978] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process information: process Golf With Your pid 16557 thread Golf With :cs0 pid 16559
Mar 30 20:19:54 Flash-linux kernel: [10736.835985] amdgpu 0000:0d:00.0: amdgpu: GPU reset begin!
Mar 30 20:19:54 Flash-linux kernel: [10737.192835] amdgpu 0000:0d:00.0: [drm:amdgpu_ring_test_helper [amdgpu]] *ERROR* ring kiq_2.1.0 test failed (-110)
Mar 30 20:19:54 Flash-linux kernel: [10737.192916] [drm:gfx_v10_0_hw_fini [amdgpu]] *ERROR* KGQ disable failed
Mar 30 20:19:54 Flash-linux kernel: [10737.446425] amdgpu 0000:0d:00.0: [drm:amdgpu_ring_test_helper [amdgpu]] *ERROR* ring kiq_2.1.0 test failed (-110)
Mar 30 20:19:54 Flash-linux kernel: [10737.446504] [drm:gfx_v10_0_hw_fini [amdgpu]] *ERROR* KCQ disable failed
Mar 30 20:19:55 Flash-linux kernel: [10737.700028] [drm:gfx_v10_0_cp_gfx_enable [amdgpu]] *ERROR* failed to halt cp gfx
Mar 30 20:19:55 Flash-linux kernel: [10737.733731] [drm] free PSP TMR buffer
Mar 30 20:19:58 Flash-linux kernel: [10740.914751] amdgpu 0000:0d:00.0: amdgpu: GPU reset succeeded, trying to resume
Mar 30 20:19:58 Flash-linux kernel: [10740.914869] [drm] PCIE GART of 512M enabled (table at 0x0000008000E10000).
Mar 30 20:19:58 Flash-linux kernel: [10740.914906] [drm] VRAM is lost due to GPU reset!
Mar 30 20:19:58 Flash-linux kernel: [10740.917191] [drm] PSP is resuming...
Mar 30 20:19:58 Flash-linux kernel: [10740.989556] [drm] reserve 0xa00000 from 0x81fe400000 for PSP TMR
Mar 30 20:19:58 Flash-linux kernel: [10741.181541] amdgpu 0000:0d:00.0: amdgpu: RAS: optional ras ta ucode is not available
Mar 30 20:19:58 Flash-linux kernel: [10741.205538] amdgpu: SMU is resuming...
Mar 30 20:19:58 Flash-linux kernel: [10741.207390] amdgpu: SMU is resumed successfully!
Mar 30 20:19:58 Flash-linux kernel: [10741.379186] [drm] kiq ring mec 2 pipe 1 q 0
Mar 30 20:19:58 Flash-linux kernel: [10741.390509] [drm] VCN decode and encode initialized successfully(under DPG Mode).
Mar 30 20:19:58 Flash-linux kernel: [10741.390566] [drm] JPEG decode initialized successfully.
Mar 30 20:19:58 Flash-linux kernel: [10741.390595] amdgpu 0000:0d:00.0: amdgpu: ring gfx_0.0.0 uses VM inv eng 0 on hub 0
Mar 30 20:19:58 Flash-linux kernel: [10741.390595] amdgpu 0000:0d:00.0: amdgpu: ring comp_1.0.0 uses VM inv eng 1 on hub 0
Mar 30 20:19:58 Flash-linux kernel: [10741.390596] amdgpu 0000:0d:00.0: amdgpu: ring comp_1.1.0 uses VM inv eng 4 on hub 0
Mar 30 20:19:58 Flash-linux kernel: [10741.390597] amdgpu 0000:0d:00.0: amdgpu: ring comp_1.2.0 uses VM inv eng 5 on hub 0
Mar 30 20:19:58 Flash-linux kernel: [10741.390597] amdgpu 0000:0d:00.0: amdgpu: ring comp_1.3.0 uses VM inv eng 6 on hub 0
Mar 30 20:19:58 Flash-linux kernel: [10741.390598] amdgpu 0000:0d:00.0: amdgpu: ring comp_1.0.1 uses VM inv eng 7 on hub 0
Mar 30 20:19:58 Flash-linux kernel: [10741.390599] amdgpu 0000:0d:00.0: amdgpu: ring comp_1.1.1 uses VM inv eng 8 on hub 0
Mar 30 20:19:58 Flash-linux kernel: [10741.390599] amdgpu 0000:0d:00.0: amdgpu: ring comp_1.2.1 uses VM inv eng 9 on hub 0
Mar 30 20:19:58 Flash-linux kernel: [10741.390600] amdgpu 0000:0d:00.0: amdgpu: ring comp_1.3.1 uses VM inv eng 10 on hub 0
Mar 30 20:19:58 Flash-linux kernel: [10741.390601] amdgpu 0000:0d:00.0: amdgpu: ring kiq_2.1.0 uses VM inv eng 11 on hub 0
Mar 30 20:19:58 Flash-linux kernel: [10741.390601] amdgpu 0000:0d:00.0: amdgpu: ring sdma0 uses VM inv eng 12 on hub 0
Mar 30 20:19:58 Flash-linux kernel: [10741.390602] amdgpu 0000:0d:00.0: amdgpu: ring sdma1 uses VM inv eng 13 on hub 0
Mar 30 20:19:58 Flash-linux kernel: [10741.390602] amdgpu 0000:0d:00.0: amdgpu: ring vcn_dec uses VM inv eng 0 on hub 1
Mar 30 20:19:58 Flash-linux kernel: [10741.390603] amdgpu 0000:0d:00.0: amdgpu: ring vcn_enc0 uses VM inv eng 1 on hub 1
Mar 30 20:19:58 Flash-linux kernel: [10741.390604] amdgpu 0000:0d:00.0: amdgpu: ring vcn_enc1 uses VM inv eng 4 on hub 1
Mar 30 20:19:58 Flash-linux kernel: [10741.390604] amdgpu 0000:0d:00.0: amdgpu: ring jpeg_dec uses VM inv eng 5 on hub 1
Mar 30 20:19:58 Flash-linux kernel: [10741.393296] [drm] recover vram bo from shadow start
Mar 30 20:19:58 Flash-linux kernel: [10741.398576] [drm] recover vram bo from shadow done
Mar 30 20:19:58 Flash-linux kernel: [10741.398578] [drm] Skip scheduling IBs!
Mar 30 20:19:58 Flash-linux kernel: [10741.398579] [drm] Skip scheduling IBs!
Mar 30 20:19:58 Flash-linux kernel: [10741.398598] amdgpu 0000:0d:00.0: amdgpu: GPU reset(2) succeeded!
Mar 30 20:19:58 Flash-linux kernel: [10741.398637] [drm] Skip scheduling IBs!
Mar 30 20:19:58 Flash-linux kernel: [10741.398646] [drm] Skip scheduling IBs!
Mar 30 20:19:58 Flash-linux kernel: [10741.398650] [drm] Skip scheduling IBs!
Mar 30 20:19:58 Flash-linux kernel: [10741.398653] [drm] Skip scheduling IBs!
Mar 30 20:19:59 Flash-linux systemd[1]: Started Process Core Dump (PID 16674/UID 0).
Mar 30 20:20:01 Flash-linux systemd-coredump[16675]: Core file was truncated to 2147483648 bytes.
Mar 30 20:20:02 Flash-linux systemd-coredump[16675]: Process 16557 (Golf With Your ) of user 1000 dumped core.#012#012Stack trace of thread 16623:#012#0 0x00007f2ed5cb5184 n/a (n/a + 0x0)
Mar 30 20:20:02 Flash-linux systemd[1]: systemd-coredump@70-16674-0.service: Succeeded.