[Navi] Pathfinder: Kingmaker is causing a GPU hang: flip_done timed out error
@shmerl
Submitted by Shmerl Assigned to Default DRI bug account
Link to original bug (#112266)
Description
When running Pathfinder: Kingmaker (latest GOG release, which should be the same as latest Steam one) on Sapphire Pulse RX 5700 XT, it's causing a weird GPU hang with flip_done timed out error (see below for detailed log), that doesn't look like the common shader hangs with ring gfx_0.0.0 timeout or common sdma hangs.
The game is using OpenGL, and I run the game on Debian testing, using this configuration:
kernel: 5.4-rc7
radeonsi: Mesa-master / llvm10:
OpenGL renderer string: AMD NAVI10 (DRM 3.35.0, 5.4.0-rc7, LLVM 10.0.0)
OpenGL core profile version string: 4.5 (Core Profile) Mesa 20.0.0-devel (git-eb6352162d)
llvm: 10~+201911120943210600592dd459242
from this llvm10 snapshot: https://tracker.debian.org/news/1079513/accepted-llvm-toolchain-snapshot-110201911120943210600592dd459242-1exp1-source-into-experimental/
DE: KDE Plasma 5.14.5 (X session).
GPU: Sapphire Pulse RX 5700 XT
Monitor: LG 27GL85-B (2560x1440, 144 Hz, DisplayPort 1.4 connection, adaptive sync activated in Xorg configuration).
When launching, I'm using AMD_DEBUG=nodma,nongg
Recording apitrace doesn't help, since replaying it is not reproducing the hang. So it could be some amdgpu issue? Please let me know, what additional info can be useful to help you narrow it down. However the hang is quite reproducible, and you can try it yourself with Pathfinder: Kingmaker.
The hang produces this in dmesg:
[ 659.445501] [drm:drm_atomic_helper_wait_for_dependencies [drm_kms_helper]] ERROR [CRTC:62:crtc-0] flip_done timed out
[ 669.685601] [drm:drm_atomic_helper_wait_for_dependencies [drm_kms_helper]] ERROR [PLANE:55:plane-5] flip_done timed out
[ 669.685644] ------------[ cut here ]------------
[ 669.685729] WARNING: CPU: 6 PID: 1018 at drivers/gpu/drm/amd/amdgpu/../display/amdgpu_dm/amdgpu_dm.c:5851 amdgpu_dm_atomic_commit_tail+0x1c56/0x1d70 [amdgpu]
[ 669.685730] Modules linked in: rfcomm(E) nf_tables(E) nfnetlink(E) bnep(E) edac_mce_amd(E) kvm_amd(E) kvm(E) irqbypass(E) crct10dif_pclmul(E) btusb(E) btrtl(E) snd_hda_codec_realtek(E) btbcm(E) crc32_pclmul(E) btintel(E) iwlmvm(E) snd_hda_codec_generic(E) bluetooth(E) ghash_clmulni_intel(E) ledtrig_audio(E) mac80211(E) libarc4(E) snd_hda_codec_hdmi(E) uvcvideo(E) snd_hda_intel(E) videobuf2_vmalloc(E) snd_usb_audio(E) snd_intel_nhlt(E) videobuf2_memops(E) drbg(E) snd_hda_codec(E) videobuf2_v4l2(E) snd_usbmidi_lib(E) iwlwifi(E) nls_ascii(E) snd_hda_core(E) snd_rawmidi(E) videobuf2_common(E) snd_seq_device(E) snd_hwdep(E) efi_pstore(E) nls_cp437(E) ansi_cprng(E) snd_pcm(E) videodev(E) sp5100_tco(E) aesni_intel(E) cfg80211(E) vfat(E) ecdh_generic(E) crypto_simd(E) ecc(E) snd_timer(E) fat(E) ccp(E) snd(E) cryptd(E) mc(E) glue_helper(E) crc16(E) wmi_bmof(E) pcspkr(E) efivars(E) k10temp(E) watchdog(E) sg(E) rfkill(E) soundcore(E) rng_core(E) evdev(E) acpi_cpufreq(E) nct6775(E) hwmon_vid(E)
[ 669.685753] parport_pc(E) ppdev(E) lp(E) parport(E) efivarfs(E) ip_tables(E) x_tables(E) autofs4(E) xfs(E) btrfs(E) xor(E) zstd_decompress(E) zstd_compress(E) raid6_pq(E) libcrc32c(E) crc32c_generic(E) sd_mod(E) hid_generic(E) usbhid(E) hid(E) amdgpu(E) gpu_sched(E) mxm_wmi(E) ahci(E) ttm(E) libahci(E) drm_kms_helper(E) xhci_pci(E) crc32c_intel(E) xhci_hcd(E) i2c_piix4(E) libata(E) drm(E) igb(E) dca(E) mfd_core(E) ptp(E) scsi_mod(E) usbcore(E) pps_core(E) i2c_algo_bit(E) nvme(E) nvme_core(E) wmi(E) button(E)
[ 669.685770] CPU: 6 PID: 1018 Comm: Xorg Tainted: G E 5.4.0-rc7 #31 (closed)
[ 669.685771] Hardware name: To Be Filled By O.E.M. To Be Filled By O.E.M./X570 Taichi, BIOS P2.50 11/02/2019
[ 669.685846] RIP: 0010:amdgpu_dm_atomic_commit_tail+0x1c56/0x1d70 [amdgpu]
[ 669.685847] Code: 67 fb ff ff 41 8b 4c 24 60 48 c7 c2 60 d6 a2 c0 bf 02 00 00 00 48 c7 c6 80 f8 a9 c0 e8 e3 7d bb ff 49 8b 47 08 e9 31 e5 ff ff <0f>
0b e9 b4 ec ff ff 0f 0b 0f 0b e9 cb ec ff ff 48 8b 85 b0 fd ff
[ 669.685848] RSP: 0018:ffffb80fc1a978d0 EFLAGS: 00010002
[ 669.685849] RAX: 0000000000000002 RBX: ffff9454b5d54c00 RCX: ffff9455ec2c6170
[ 669.685850] RDX: 0000000000000001 RSI: 0000000000000206 RDI: ffff9455eaba6158
[ 669.685851] RBP: ffffb80fc1a97b80 R08: 0000000000000005 R09: 0000000000000000
[ 669.685851] R10: ffffb80fc1a97838 R11: ffffb80fc1a9783c R12: 0000000000000206
[ 669.685852] R13: ffff9455ec2c6000 R14: ffff94559d443800 R15: ffff9455eda20000
[ 669.685853] FS: 00007fc6a5a21f00(0000) GS:ffff9455fe980000(0000) knlGS:0000000000000000
[ 669.685854] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 669.685855] CR2: 00007fc6a5991678 CR3: 00000007f0390000 CR4: 0000000000340ee0
[ 669.685856] Call Trace:
[ 669.685864] ? __irq_work_queue_local+0x50/0x60
[ 669.685872] ? commit_tail+0x94/0x110 [drm_kms_helper]
[ 669.685878] commit_tail+0x94/0x110 [drm_kms_helper]
[ 669.685884] drm_atomic_helper_commit+0xb8/0x130 [drm_kms_helper]
[ 669.685889] drm_atomic_helper_set_config+0x79/0x90 [drm_kms_helper]
[ 669.685902] drm_mode_setcrtc+0x194/0x6a0 [drm]
[ 669.685956] ? amdgpu_cs_wait_ioctl+0xeb/0x160 [amdgpu]
[ 669.685966] ? drm_mode_getcrtc+0x180/0x180 [drm]
[ 669.685976] drm_ioctl_kernel+0xaa/0xf0 [drm]
[ 669.685986] drm_ioctl+0x208/0x390 [drm]
[ 669.685995] ? drm_mode_getcrtc+0x180/0x180 [drm]
[ 669.686044] amdgpu_drm_ioctl+0x49/0x80 [amdgpu]
[ 669.686048] do_vfs_ioctl+0x40e/0x670
[ 669.686050] ksys_ioctl+0x5e/0x90
[ 669.686052] __x64_sys_ioctl+0x16/0x20
[ 669.686055] do_syscall_64+0x52/0x160
[ 669.686058] entry_SYSCALL_64_after_hwframe+0x44/0xa9
[ 669.686060] RIP: 0033:0x7fc6a5f6a5b7
[ 669.686061] Code: 00 00 90 48 8b 05 d9 78 0c 00 64 c7 00 26 00 00 00 48 c7 c0 ff ff ff ff c3 66 2e 0f 1f 84 00 00 00 00 00 b8 10 00 00 00 0f 05 <48>
3d 01 f0 ff ff 73 01 c3 48 8b 0d a9 78 0c 00 f7 d8 64 89 01 48
[ 669.686062] RSP: 002b:00007ffd36fb37a8 EFLAGS: 00003246 ORIG_RAX: 0000000000000010
[ 669.686063] RAX: ffffffffffffffda RBX: 00007ffd36fb37e0 RCX: 00007fc6a5f6a5b7
[ 669.686064] RDX: 00007ffd36fb37e0 RSI: 00000000c06864a2 RDI: 000000000000000d
[ 669.686064] RBP: 00000000c06864a2 R08: 0000000000000000 R09: 000055c668ad0740
[ 669.686065] R10: 0000000000000000 R11: 0000000000003246 R12: 0000000000000000
[ 669.686065] R13: 000000000000000d R14: 000055c668a607d0 R15: 0000000000000000
[ 669.686067] ---[ end trace 47feccd771299f6b ]---
[ 669.686082] ------------[ cut here ]------------
[ 669.686158] WARNING: CPU: 6 PID: 1018 at drivers/gpu/drm/amd/amdgpu/../display/amdgpu_dm/amdgpu_dm.c:5458 amdgpu_dm_atomic_commit_tail+0x1c5f/0x1d70 [amdgpu]
[ 669.686158] Modules linked in: rfcomm(E) nf_tables(E) nfnetlink(E) bnep(E) edac_mce_amd(E) kvm_amd(E) kvm(E) irqbypass(E) crct10dif_pclmul(E) btusb(E) btrtl(E) snd_hda_codec_realtek(E) btbcm(E) crc32_pclmul(E) btintel(E) iwlmvm(E) snd_hda_codec_generic(E) bluetooth(E) ghash_clmulni_intel(E) ledtrig_audio(E) mac80211(E) libarc4(E) snd_hda_codec_hdmi(E) uvcvideo(E) snd_hda_intel(E) videobuf2_vmalloc(E) snd_usb_audio(E) snd_intel_nhlt(E) videobuf2_memops(E) drbg(E) snd_hda_codec(E) videobuf2_v4l2(E) snd_usbmidi_lib(E) iwlwifi(E) nls_ascii(E) snd_hda_core(E) snd_rawmidi(E) videobuf2_common(E) snd_seq_device(E) snd_hwdep(E) efi_pstore(E) nls_cp437(E) ansi_cprng(E) snd_pcm(E) videodev(E) sp5100_tco(E) aesni_intel(E) cfg80211(E) vfat(E) ecdh_generic(E) crypto_simd(E) ecc(E) snd_timer(E) fat(E) ccp(E) snd(E) cryptd(E) mc(E) glue_helper(E) crc16(E) wmi_bmof(E) pcspkr(E) efivars(E) k10temp(E) watchdog(E) sg(E) rfkill(E) soundcore(E) rng_core(E) evdev(E) acpi_cpufreq(E) nct6775(E) hwmon_vid(E)
[ 669.686175] parport_pc(E) ppdev(E) lp(E) parport(E) efivarfs(E) ip_tables(E) x_tables(E) autofs4(E) xfs(E) btrfs(E) xor(E) zstd_decompress(E) zstd_compress(E) raid6_pq(E) libcrc32c(E) crc32c_generic(E) sd_mod(E) hid_generic(E) usbhid(E) hid(E) amdgpu(E) gpu_sched(E) mxm_wmi(E) ahci(E) ttm(E) libahci(E) drm_kms_helper(E) xhci_pci(E) crc32c_intel(E) xhci_hcd(E) i2c_piix4(E) libata(E) drm(E) igb(E) dca(E) mfd_core(E) ptp(E) scsi_mod(E) usbcore(E) pps_core(E) i2c_algo_bit(E) nvme(E) nvme_core(E) wmi(E) button(E)
[ 669.686187] CPU: 6 PID: 1018 Comm: Xorg Tainted: G W E 5.4.0-rc7 #31 (closed)
[ 669.686187] Hardware name: To Be Filled By O.E.M. To Be Filled By O.E.M./X570 Taichi, BIOS P2.50 11/02/2019
[ 669.686258] RIP: 0010:amdgpu_dm_atomic_commit_tail+0x1c5f/0x1d70 [amdgpu]
[ 669.686259] Code: 48 c7 c2 60 d6 a2 c0 bf 02 00 00 00 48 c7 c6 80 f8 a9 c0 e8 e3 7d bb ff 49 8b 47 08 e9 31 e5 ff ff 0f 0b e9 b4 ec ff ff 0f 0b <0f>
0b e9 cb ec ff ff 48 8b 85 b0 fd ff ff 48 8d 8d 18 fe ff ff 48
[ 669.686259] RSP: 0018:ffffb80fc1a978d0 EFLAGS: 00010082
[ 669.686260] RAX: 0000000000000002 RBX: ffff9454b5d54c00 RCX: ffff9455ec2c6170
[ 669.686261] RDX: 0000000000000001 RSI: 0000000000000206 RDI: ffff9455eaba6158
[ 669.686261] RBP: ffffb80fc1a97b80 R08: 0000000000000005 R09: 0000000000000000
[ 669.686262] R10: ffffb80fc1a97838 R11: ffffb80fc1a9783c R12: 0000000000000206
[ 669.686263] R13: ffff9455ec2c6000 R14: ffff94559d443800 R15: ffff9455eda20000
[ 669.686264] FS: 00007fc6a5a21f00(0000) GS:ffff9455fe980000(0000) knlGS:0000000000000000
[ 669.686264] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 669.686265] CR2: 00007fc6a5991678 CR3: 00000007f0390000 CR4: 0000000000340ee0
[ 669.686266] Call Trace:
[ 669.686270] ? __irq_work_queue_local+0x50/0x60
[ 669.686277] ? commit_tail+0x94/0x110 [drm_kms_helper]
[ 669.686282] commit_tail+0x94/0x110 [drm_kms_helper]
[ 669.686288] drm_atomic_helper_commit+0xb8/0x130 [drm_kms_helper]
[ 669.686293] drm_atomic_helper_set_config+0x79/0x90 [drm_kms_helper]
[ 669.686304] drm_mode_setcrtc+0x194/0x6a0 [drm]
[ 669.686357] ? amdgpu_cs_wait_ioctl+0xeb/0x160 [amdgpu]
[ 669.686367] ? drm_mode_getcrtc+0x180/0x180 [drm]
[ 669.686377] drm_ioctl_kernel+0xaa/0xf0 [drm]
[ 669.686386] drm_ioctl+0x208/0x390 [drm]
[ 669.686396] ? drm_mode_getcrtc+0x180/0x180 [drm]
[ 669.686445] amdgpu_drm_ioctl+0x49/0x80 [amdgpu]
[ 669.686447] do_vfs_ioctl+0x40e/0x670
[ 669.686449] ksys_ioctl+0x5e/0x90
[ 669.686451] __x64_sys_ioctl+0x16/0x20
[ 669.686453] do_syscall_64+0x52/0x160
[ 669.686454] entry_SYSCALL_64_after_hwframe+0x44/0xa9
[ 669.686455] RIP: 0033:0x7fc6a5f6a5b7
[ 669.686457] Code: 00 00 90 48 8b 05 d9 78 0c 00 64 c7 00 26 00 00 00 48 c7 c0 ff ff ff ff c3 66 2e 0f 1f 84 00 00 00 00 00 b8 10 00 00 00 0f 05 <48>
3d 01 f0 ff ff 73 01 c3 48 8b 0d a9 78 0c 00 f7 d8 64 89 01 48
[ 669.686457] RSP: 002b:00007ffd36fb37a8 EFLAGS: 00003246 ORIG_RAX: 0000000000000010
[ 669.686458] RAX: ffffffffffffffda RBX: 00007ffd36fb37e0 RCX: 00007fc6a5f6a5b7
[ 669.686459] RDX: 00007ffd36fb37e0 RSI: 00000000c06864a2 RDI: 000000000000000d
[ 669.686459] RBP: 00000000c06864a2 R08: 0000000000000000 R09: 000055c668ad0740
[ 669.686460] R10: 0000000000000000 R11: 0000000000003246 R12: 0000000000000000
[ 669.686461] R13: 000000000000000d R14: 000055c668a607d0 R15: 0000000000000000
[ 669.686462] ---[ end trace 47feccd771299f6c ]---