5700XT crashes when playing games
Description
Last month Cyberpunk only crashed every 2 hours or so but now it crashes every time I start the game (usually around 5 seconds after I load into the world). I'm using mesa-git 21.0.0_devel 33a6c01e and 04c7fce7 (mesa-git AUR and mesa-git community) and the weird thing is that everything works perfectly with mesa 20.3.1.
The screen freezes and the GPU resets with checkerboarded artifacts on both of my monitors. After I logout and try to run the game again without rebooting, the game doesn't start. By running (not rebooted)
WINEPREFIX=/media/nvme960Evo/Steam\ Library/steamapps/compatdata/1091500/pfx/ .steam/steam/steamapps/common/Proton\ -\ Experimental/dist/bin/wine64 /media/nvme960Evo/Steam\ Library/steamapps/common/Cyberpunk\ 2077/REDprelauncher.exe
, I get the error floating point exception (core dumped)
Photo and video
Log files (for system lockups / game freezes / crashes)
journalctl
jan 03 14:39:13 Erik-Manjaro kernel: [drm:amdgpu_dm_atomic_commit_tail [amdgpu]] *ERROR* Waiting for fences timed out!
jan 03 14:39:18 Erik-Manjaro kernel: [drm:amdgpu_dm_atomic_commit_tail [amdgpu]] *ERROR* Waiting for fences timed out!
jan 03 14:39:18 Erik-Manjaro kernel: [drm:amdgpu_dm_atomic_commit_tail [amdgpu]] *ERROR* Waiting for fences timed out!
jan 03 14:39:18 Erik-Manjaro kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx_0.0.0 timeout, signaled seq=364512, emitted seq=364514
jan 03 14:39:18 Erik-Manjaro kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process information: process Cyberpunk2077.e pid 21007 thread Cyberpunk2:cs0 pid 21091
jan 03 14:39:18 Erik-Manjaro kernel: amdgpu 0000:0c:00.0: amdgpu: GPU reset begin!
jan 03 14:39:22 Erik-Manjaro kernel: amdgpu 0000:0c:00.0: amdgpu: failed to suspend display audio
jan 03 14:39:22 Erik-Manjaro kernel: ------------[ cut here ]------------
jan 03 14:39:22 Erik-Manjaro kernel: WARNING: CPU: 1 PID: 20688 at drivers/gpu/drm/amd/amdgpu/../display/dc/dcn20/dcn20_resource.c:3240 dcn20_validate_bandwidth_fp+0x91/0xe0 [amdgpu]
jan 03 14:39:22 Erik-Manjaro kernel: Modules linked in: rfcomm cmac algif_hash algif_skcipher af_alg bnep btusb btrtl btbcm btintel bluetooth snd_usb_audio snd_usbmidi_lib snd_rawmidi snd_seq_device mc ecdh_gen>
jan 03 14:39:22 Erik-Manjaro kernel: xt_conntrack pcspkr gpu_sched soundcore sp5100_tco rng_core k10temp i2c_piix4 i2c_algo_bit dca rfkill wmi evdev mac_hid pinctrl_amd acpi_cpufreq gpio_amdpt ip6table_filter >
jan 03 14:39:22 Erik-Manjaro kernel: CPU: 1 PID: 20688 Comm: kworker/1:4 Not tainted 5.10.3-106-tkg-upds #1
jan 03 14:39:22 Erik-Manjaro kernel: Hardware name: Gigabyte Technology Co., Ltd. X470 AORUS GAMING 5 WIFI/X470 AORUS GAMING 5 WIFI-CF, BIOS F60e 12/09/2020
jan 03 14:39:22 Erik-Manjaro kernel: Workqueue: events drm_sched_job_timedout [gpu_sched]
jan 03 14:39:22 Erik-Manjaro kernel: RIP: 0010:dcn20_validate_bandwidth_fp+0x91/0xe0 [amdgpu]
jan 03 14:39:22 Erik-Manjaro kernel: Code: 21 c2 89 d0 84 d2 75 32 31 d2 f2 0f 11 85 58 26 00 00 48 89 ee 4c 89 e7 e8 2c f6 ff ff 0f b6 95 14 1f 00 00 21 c2 84 d2 75 30 <0f> 0b 48 89 9d 58 26 00 00 5b 5d 41 5c >
jan 03 14:39:22 Erik-Manjaro kernel: RSP: 0018:ffffa3bb80943bf8 EFLAGS: 00010246
jan 03 14:39:22 Erik-Manjaro kernel: RAX: 0000000000000001 RBX: 4079400000000000 RCX: 0000000003a9a001
jan 03 14:39:22 Erik-Manjaro kernel: RDX: 0000000000000000 RSI: cc4ce5e37c06ac9e RDI: 00000000000301a0
jan 03 14:39:22 Erik-Manjaro kernel: RBP: ffff9a3ec0e80000 R08: 0000000000000006 R09: ffff9a3fa7d00000
jan 03 14:39:22 Erik-Manjaro kernel: R10: ffff9a40f7b79000 R11: 0000000100000001 R12: ffff9a3fa7d00000
jan 03 14:39:22 Erik-Manjaro kernel: R13: ffff9a3fa7d00000 R14: ffff9a3f8db40800 R15: ffff9a3ec0e80000
jan 03 14:39:22 Erik-Manjaro kernel: FS: 0000000000000000(0000) GS:ffff9a42aea40000(0000) knlGS:0000000000000000
jan 03 14:39:22 Erik-Manjaro kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
jan 03 14:39:22 Erik-Manjaro kernel: CR2: 00007f837c19f000 CR3: 000000028d3ac000 CR4: 0000000000750ee0
jan 03 14:39:22 Erik-Manjaro kernel: PKRU: 55555554
jan 03 14:39:22 Erik-Manjaro kernel: Call Trace:
jan 03 14:39:22 Erik-Manjaro kernel: dcn20_validate_bandwidth+0x24/0x40 [amdgpu]
jan 03 14:39:22 Erik-Manjaro kernel: dc_validate_global_state+0x2e3/0x380 [amdgpu]
jan 03 14:39:22 Erik-Manjaro kernel: ? dc_rem_all_planes_for_stream+0xca/0x110 [amdgpu]
jan 03 14:39:22 Erik-Manjaro kernel: dm_suspend+0x17c/0x1c0 [amdgpu]
jan 03 14:39:22 Erik-Manjaro kernel: amdgpu_device_ip_suspend_phase1+0x72/0xd0 [amdgpu]
jan 03 14:39:22 Erik-Manjaro kernel: ? _raw_spin_lock+0x13/0x30
jan 03 14:39:22 Erik-Manjaro kernel: amdgpu_device_ip_suspend+0x1c/0x60 [amdgpu]
jan 03 14:39:22 Erik-Manjaro kernel: amdgpu_device_pre_asic_reset+0x185/0x19c [amdgpu]
jan 03 14:39:22 Erik-Manjaro kernel: amdgpu_device_gpu_recover.cold+0x5f0/0x98d [amdgpu]
jan 03 14:39:22 Erik-Manjaro kernel: amdgpu_job_timedout+0x121/0x140 [amdgpu]
jan 03 14:39:22 Erik-Manjaro kernel: drm_sched_job_timedout+0x64/0xe0 [gpu_sched]
jan 03 14:39:22 Erik-Manjaro kernel: process_one_work+0x1d9/0x3a0
jan 03 14:39:22 Erik-Manjaro kernel: worker_thread+0x4d/0x3d0
jan 03 14:39:22 Erik-Manjaro kernel: ? rescuer_thread+0x410/0x410
jan 03 14:39:22 Erik-Manjaro kernel: kthread+0x14c/0x170
jan 03 14:39:22 Erik-Manjaro kernel: ? __kthread_bind_mask+0x60/0x60
jan 03 14:39:22 Erik-Manjaro kernel: ret_from_fork+0x22/0x30
jan 03 14:39:22 Erik-Manjaro kernel: ---[ end trace be929cc27fc75364 ]---
jan 03 14:39:22 Erik-Manjaro kernel: amdgpu 0000:0c:00.0: [drm:amdgpu_ring_test_helper [amdgpu]] *ERROR* ring kiq_2.1.0 test failed (-110)
jan 03 14:39:22 Erik-Manjaro kernel: [drm:gfx_v10_0_hw_fini [amdgpu]] *ERROR* KGQ disable failed
jan 03 14:39:23 Erik-Manjaro kernel: amdgpu 0000:0c:00.0: [drm:amdgpu_ring_test_helper [amdgpu]] *ERROR* ring kiq_2.1.0 test failed (-110)
jan 03 14:39:23 Erik-Manjaro kernel: [drm:gfx_v10_0_hw_fini [amdgpu]] *ERROR* KCQ disable failed
jan 03 14:39:23 Erik-Manjaro kernel: [drm:gfx_v10_0_hw_fini [amdgpu]] *ERROR* failed to halt cp gfx
jan 03 14:39:23 Erik-Manjaro kernel: [drm] free PSP TMR buffer
jan 03 14:39:23 Erik-Manjaro kernel: amdgpu 0000:0c:00.0: amdgpu: BACO reset
jan 03 14:39:26 Erik-Manjaro kernel: amdgpu 0000:0c:00.0: amdgpu: GPU reset succeeded, trying to resume
jan 03 14:39:26 Erik-Manjaro kernel: [drm] PCIE GART of 512M enabled (table at 0x0000008000300000).
jan 03 14:39:26 Erik-Manjaro kernel: [drm] VRAM is lost due to GPU reset!
jan 03 14:39:26 Erik-Manjaro kernel: [drm] PSP is resuming...
jan 03 14:39:26 Erik-Manjaro kernel: [drm] reserve 0x900000 from 0x81fe400000 for PSP TMR
jan 03 14:39:26 Erik-Manjaro kernel: amdgpu 0000:0c:00.0: amdgpu: RAS: optional ras ta ucode is not available
jan 03 14:39:26 Erik-Manjaro kernel: amdgpu 0000:0c:00.0: amdgpu: RAP: optional rap ta ucode is not available
jan 03 14:39:26 Erik-Manjaro kernel: amdgpu 0000:0c:00.0: amdgpu: SMU is resuming...
jan 03 14:39:26 Erik-Manjaro kernel: amdgpu 0000:0c:00.0: amdgpu: smu driver if version = 0x00000036, smu fw if version = 0x00000037, smu fw version = 0x002a3d00 (42.61.0)
jan 03 14:39:26 Erik-Manjaro kernel: amdgpu 0000:0c:00.0: amdgpu: SMU driver if version not matched
jan 03 14:39:26 Erik-Manjaro kernel: amdgpu 0000:0c:00.0: amdgpu: SMU is resumed successfully!
jan 03 14:39:26 Erik-Manjaro kernel: [drm] kiq ring mec 2 pipe 1 q 0
jan 03 14:39:26 Erik-Manjaro kernel: [drm] VCN decode and encode initialized successfully(under DPG Mode).
jan 03 14:39:26 Erik-Manjaro kernel: [drm] JPEG decode initialized successfully.
jan 03 14:39:26 Erik-Manjaro kernel: amdgpu 0000:0c:00.0: amdgpu: ring gfx_0.0.0 uses VM inv eng 0 on hub 0
jan 03 14:39:26 Erik-Manjaro kernel: amdgpu 0000:0c:00.0: amdgpu: ring comp_1.0.0 uses VM inv eng 1 on hub 0
jan 03 14:39:26 Erik-Manjaro kernel: amdgpu 0000:0c:00.0: amdgpu: ring comp_1.1.0 uses VM inv eng 4 on hub 0
jan 03 14:39:26 Erik-Manjaro kernel: amdgpu 0000:0c:00.0: amdgpu: ring comp_1.2.0 uses VM inv eng 5 on hub 0
jan 03 14:39:26 Erik-Manjaro kernel: amdgpu 0000:0c:00.0: amdgpu: ring comp_1.3.0 uses VM inv eng 6 on hub 0
jan 03 14:39:26 Erik-Manjaro kernel: amdgpu 0000:0c:00.0: amdgpu: ring comp_1.0.1 uses VM inv eng 7 on hub 0
jan 03 14:39:26 Erik-Manjaro kernel: amdgpu 0000:0c:00.0: amdgpu: ring comp_1.1.1 uses VM inv eng 8 on hub 0
jan 03 14:39:26 Erik-Manjaro kernel: amdgpu 0000:0c:00.0: amdgpu: ring comp_1.2.1 uses VM inv eng 9 on hub 0
jan 03 14:39:26 Erik-Manjaro kernel: amdgpu 0000:0c:00.0: amdgpu: ring comp_1.3.1 uses VM inv eng 10 on hub 0
jan 03 14:39:26 Erik-Manjaro kernel: amdgpu 0000:0c:00.0: amdgpu: ring kiq_2.1.0 uses VM inv eng 11 on hub 0
jan 03 14:39:26 Erik-Manjaro kernel: amdgpu 0000:0c:00.0: amdgpu: ring sdma0 uses VM inv eng 12 on hub 0
jan 03 14:39:26 Erik-Manjaro kernel: amdgpu 0000:0c:00.0: amdgpu: ring sdma1 uses VM inv eng 13 on hub 0
jan 03 14:39:26 Erik-Manjaro kernel: amdgpu 0000:0c:00.0: amdgpu: ring vcn_dec uses VM inv eng 0 on hub 1
jan 03 14:39:26 Erik-Manjaro kernel: amdgpu 0000:0c:00.0: amdgpu: ring vcn_enc0 uses VM inv eng 1 on hub 1
jan 03 14:39:26 Erik-Manjaro kernel: amdgpu 0000:0c:00.0: amdgpu: ring vcn_enc1 uses VM inv eng 4 on hub 1
jan 03 14:39:26 Erik-Manjaro kernel: amdgpu 0000:0c:00.0: amdgpu: ring jpeg_dec uses VM inv eng 5 on hub 1
jan 03 14:39:26 Erik-Manjaro kernel: amdgpu 0000:0c:00.0: amdgpu: recover vram bo from shadow start
jan 03 14:39:26 Erik-Manjaro kernel: amdgpu 0000:0c:00.0: amdgpu: recover vram bo from shadow done
jan 03 14:39:26 Erik-Manjaro kernel: [drm] Skip scheduling IBs!
jan 03 14:39:26 Erik-Manjaro kernel: [drm] Skip scheduling IBs!
jan 03 14:39:26 Erik-Manjaro kernel: [drm] Skip scheduling IBs!
jan 03 14:39:26 Erik-Manjaro kernel: amdgpu 0000:0c:00.0: amdgpu: GPU reset(2) succeeded!
jan 03 14:39:26 Erik-Manjaro kernel: [drm] Skip scheduling IBs!
jan 03 14:39:26 Erik-Manjaro kernel: [drm] Skip scheduling IBs!
jan 03 14:39:26 Erik-Manjaro kernel: [drm] Skip scheduling IBs!
jan 03 14:39:26 Erik-Manjaro kernel: [drm] Skip scheduling IBs!
jan 03 14:39:26 Erik-Manjaro kernel: [drm] Skip scheduling IBs!
jan 03 14:39:26 Erik-Manjaro kernel: [drm] Skip scheduling IBs!
jan 03 14:39:26 Erik-Manjaro kernel: [drm] Skip scheduling IBs!
jan 03 14:39:26 Erik-Manjaro kernel: [drm] Skip scheduling IBs!
jan 03 14:39:26 Erik-Manjaro kernel: [drm] Skip scheduling IBs!
jan 03 14:39:26 Erik-Manjaro kernel: [drm] Skip scheduling IBs!
jan 03 14:39:26 Erik-Manjaro kernel: [drm] Skip scheduling IBs!
jan 03 14:39:26 Erik-Manjaro kernel: [drm] Skip scheduling IBs!
jan 03 14:39:26 Erik-Manjaro kernel: [drm] Skip scheduling IBs!
jan 03 14:39:26 Erik-Manjaro kernel: [drm] Skip scheduling IBs!
jan 03 14:39:26 Erik-Manjaro kernel: [drm] Skip scheduling IBs!
jan 03 14:39:26 Erik-Manjaro kernel: [drm] Skip scheduling IBs!
jan 03 14:39:26 Erik-Manjaro kernel: [drm] Skip scheduling IBs!
jan 03 14:39:26 Erik-Manjaro kernel: [drm] Skip scheduling IBs!
jan 03 14:39:26 Erik-Manjaro kernel: [drm] Skip scheduling IBs!
jan 03 14:39:26 Erik-Manjaro kernel: [drm] Skip scheduling IBs!
jan 03 14:39:26 Erik-Manjaro kernel: [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
jan 03 14:39:26 Erik-Manjaro kernel: [drm] Skip scheduling IBs!
jan 03 14:39:26 Erik-Manjaro kernel: [drm] Skip scheduling IBs!
jan 03 14:39:26 Erik-Manjaro kernel: [drm] Skip scheduling IBs!
jan 03 14:39:26 Erik-Manjaro kernel: [drm] Skip scheduling IBs!
jan 03 14:39:26 Erik-Manjaro kernel: [drm] Skip scheduling IBs!
jan 03 14:39:26 Erik-Manjaro kernel: [drm] Skip scheduling IBs!
jan 03 14:39:26 Erik-Manjaro kernel: [drm] Skip scheduling IBs!
jan 03 14:39:26 Erik-Manjaro kernel: [drm] Skip scheduling IBs!
jan 03 14:39:26 Erik-Manjaro kernel: [drm] Skip scheduling IBs!
jan 03 14:39:26 Erik-Manjaro kernel: [drm] Skip scheduling IBs!
jan 03 14:39:26 Erik-Manjaro kernel: [drm] Skip scheduling IBs!
jan 03 14:39:26 Erik-Manjaro kernel: [drm] Skip scheduling IBs!
jan 03 14:39:26 Erik-Manjaro kernel: [drm] Skip scheduling IBs!
jan 03 14:39:26 Erik-Manjaro kernel: [drm] Skip scheduling IBs!
jan 03 14:39:26 Erik-Manjaro kernel: [drm] Skip scheduling IBs!
jan 03 14:39:26 Erik-Manjaro kernel: [drm] Skip scheduling IBs!
jan 03 14:39:26 Erik-Manjaro kernel: [drm] Skip scheduling IBs!
jan 03 14:39:26 Erik-Manjaro kernel: [drm] Skip scheduling IBs!
jan 03 14:39:26 Erik-Manjaro kernel: [drm] Skip scheduling IBs!
jan 03 14:39:26 Erik-Manjaro kernel: [drm] Skip scheduling IBs!
jan 03 14:39:26 Erik-Manjaro kernel: [drm] Skip scheduling IBs!
jan 03 14:39:26 Erik-Manjaro kernel: [drm] Skip scheduling IBs!
jan 03 14:39:26 Erik-Manjaro kernel: [drm] Skip scheduling IBs!
jan 03 14:39:26 Erik-Manjaro kernel: [drm] Skip scheduling IBs!
jan 03 14:39:26 Erik-Manjaro /usr/lib/gdm-x-session[2143]: amdgpu: amdgpu_cs_query_fence_status failed.
jan 03 14:39:26 Erik-Manjaro /usr/lib/gdm-x-session[2143]: amdgpu: amdgpu_cs_query_fence_status failed.
dmesg
[ +19,889205] [drm:amdgpu_dm_atomic_commit_tail [amdgpu]] *ERROR* Waiting for fences timed out!
[ +0,000058] [drm:amdgpu_dm_atomic_commit_tail [amdgpu]] *ERROR* Waiting for fences timed out!
[ +0,130288] [UFW BLOCK] IN=enp7s0 OUT= MAC=01:00:5e:00:00:01:b0:6e:bf:62:3b:a8:08:00 SRC=192.168.1.1 DST=224.0.0.1 LEN=36 TOS=0x00 PREC=0x00 TTL=1 ID=32390 DF PROTO=2
[ +4,989567] [drm:amdgpu_dm_atomic_commit_tail [amdgpu]] *ERROR* Waiting for fences timed out!
[ +0,649979] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx_0.0.0 timeout, signaled seq=1512204, emitted seq=1512206
[ +0,000031] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process information: process Cyberpunk2077.e pid 105440 thread Cyberpunk2:cs0 pid 105489
[ +0,000003] amdgpu 0000:0c:00.0: amdgpu: GPU reset begin!
[ +3,999948] amdgpu 0000:0c:00.0: amdgpu: failed to suspend display audio
[ +0,000417] ------------[ cut here ]------------
[ +0,000051] WARNING: CPU: 5 PID: 610 at drivers/gpu/drm/amd/amdgpu/../display/dc/dcn20/dcn20_resource.c:3240 dcn20_validate_bandwidth_fp+0x91/0xe0 [amdgpu]
[ +0,000001] Modules linked in: rfcomm cmac algif_hash algif_skcipher af_alg bnep btusb btrtl btbcm btintel bluetooth snd_usb_audio snd_usbmidi_lib snd_rawmidi snd_seq_device mc ecdh_generic cdc_acm ecc hid_log>
[ +0,000026] xt_conntrack pcspkr soundcore dca i2c_algo_bit sp5100_tco k10temp i2c_piix4 rfkill rng_core pinctrl_amd gpio_amdpt evdev wmi mac_hid acpi_cpufreq ip6table_filter ip6_tables nf_conntrack_netbios_ns>
[ +0,000020] CPU: 5 PID: 610 Comm: kworker/5:3 Not tainted 5.10.3-106-tkg-upds #1
[ +0,000001] Hardware name: Gigabyte Technology Co., Ltd. X470 AORUS GAMING 5 WIFI/X470 AORUS GAMING 5 WIFI-CF, BIOS F60e 12/09/2020
[ +0,000002] Workqueue: events drm_sched_job_timedout [gpu_sched]
[ +0,000037] RIP: 0010:dcn20_validate_bandwidth_fp+0x91/0xe0 [amdgpu]
[ +0,000001] Code: 21 c2 89 d0 84 d2 75 32 31 d2 f2 0f 11 85 58 26 00 00 48 89 ee 4c 89 e7 e8 2c f6 ff ff 0f b6 95 14 1f 00 00 21 c2 84 d2 75 30 <0f> 0b 48 89 9d 58 26 00 00 5b 5d 41 5c c3 75 bf 48 89 9d 58 26 >
[ +0,000001] RSP: 0018:ffff98fe815d3bf8 EFLAGS: 00010246
[ +0,000001] RAX: 0000000000000001 RBX: 4079400000000000 RCX: 00000000025ab005
[ +0,000001] RDX: 0000000000000000 RSI: 3e26bfc66e0d3dd3 RDI: 00000000000301a0
[ +0,000000] RBP: ffff88d64b3c0000 R08: 0000000000000006 R09: ffff88d3e5170000
[ +0,000000] R10: ffff88d4b7892000 R11: 0000000100000001 R12: ffff88d3e5170000
[ +0,000001] R13: ffff88d3e5170000 R14: ffff88d3e4c07000 R15: ffff88d64b3c0000
[ +0,000000] FS: 0000000000000000(0000) GS:ffff88d6eeb40000(0000) knlGS:0000000000000000
[ +0,000001] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ +0,000000] CR2: 00007f8e23cebfd8 CR3: 000000015c51a000 CR4: 0000000000750ee0
[ +0,000001] PKRU: 55555554
[ +0,000000] Call Trace:
[ +0,000037] dcn20_validate_bandwidth+0x24/0x40 [amdgpu]
[ +0,000034] dc_validate_global_state+0x2e3/0x380 [amdgpu]
[ +0,000033] ? dc_rem_all_planes_for_stream+0xca/0x110 [amdgpu]
[ +0,000034] dm_suspend+0x17c/0x1c0 [amdgpu]
[ +0,000024] amdgpu_device_ip_suspend_phase1+0x72/0xd0 [amdgpu]
[ +0,000003] ? _raw_spin_lock+0x13/0x30
[ +0,000021] amdgpu_device_ip_suspend+0x1c/0x60 [amdgpu]
[ +0,000039] amdgpu_device_pre_asic_reset+0x185/0x19c [amdgpu]
[ +0,000035] amdgpu_device_gpu_recover.cold+0x5f0/0x98d [amdgpu]
[ +0,000031] amdgpu_job_timedout+0x121/0x140 [amdgpu]
[ +0,000003] drm_sched_job_timedout+0x64/0xe0 [gpu_sched]
[ +0,000003] process_one_work+0x1d9/0x3a0
[ +0,000001] ? rescuer_thread+0x410/0x410
[ +0,000001] worker_thread+0x4d/0x3d0
[ +0,000001] ? rescuer_thread+0x410/0x410
[ +0,000001] kthread+0x14c/0x170
[ +0,000001] ? __kthread_bind_mask+0x60/0x60
[ +0,000001] ret_from_fork+0x22/0x30
[ +0,000002] ---[ end trace 26a7f5b99d116c12 ]---
[ +0,469094] [drm:amdgpu_dm_atomic_commit_tail [amdgpu]] *ERROR* Waiting for fences timed out!
[ +0,005707] amdgpu 0000:0c:00.0: [drm:amdgpu_ring_test_helper [amdgpu]] *ERROR* ring kiq_2.1.0 test failed (-110)
[ +0,000029] [drm:gfx_v10_0_hw_fini [amdgpu]] *ERROR* KGQ disable failed
[ +0,293821] amdgpu 0000:0c:00.0: [drm:amdgpu_ring_test_helper [amdgpu]] *ERROR* ring kiq_2.1.0 test failed (-110)
[ +0,000026] [drm:gfx_v10_0_hw_fini [amdgpu]] *ERROR* KCQ disable failed
[ +0,294143] [drm:gfx_v10_0_hw_fini [amdgpu]] *ERROR* failed to halt cp gfx
[ +0,018205] [drm] free PSP TMR buffer
[ +0,045128] amdgpu 0000:0c:00.0: amdgpu: BACO reset
[ +3,142917] amdgpu 0000:0c:00.0: amdgpu: GPU reset succeeded, trying to resume
[ +0,000219] [drm] PCIE GART of 512M enabled (table at 0x0000008000300000).
[ +0,000129] [drm] VRAM is lost due to GPU reset!
[ +0,011269] [drm] PSP is resuming...
[ +0,068286] [drm] reserve 0x900000 from 0x81fe400000 for PSP TMR
[ +0,095991] amdgpu 0000:0c:00.0: amdgpu: RAS: optional ras ta ucode is not available
[ +0,011996] amdgpu 0000:0c:00.0: amdgpu: RAP: optional rap ta ucode is not available
[ +0,000002] amdgpu 0000:0c:00.0: amdgpu: SMU is resuming...
[ +0,000004] amdgpu 0000:0c:00.0: amdgpu: smu driver if version = 0x00000036, smu fw if version = 0x00000037, smu fw version = 0x002a3d00 (42.61.0)
[ +0,000001] amdgpu 0000:0c:00.0: amdgpu: SMU driver if version not matched
[ +0,002461] amdgpu 0000:0c:00.0: amdgpu: SMU is resumed successfully!
[ +0,137037] [drm] kiq ring mec 2 pipe 1 q 0
[ +0,007150] [drm] VCN decode and encode initialized successfully(under DPG Mode).
[ +0,000130] [drm] JPEG decode initialized successfully.
[ +0,000034] amdgpu 0000:0c:00.0: amdgpu: ring gfx_0.0.0 uses VM inv eng 0 on hub 0
[ +0,000001] amdgpu 0000:0c:00.0: amdgpu: ring comp_1.0.0 uses VM inv eng 1 on hub 0
[ +0,000000] amdgpu 0000:0c:00.0: amdgpu: ring comp_1.1.0 uses VM inv eng 4 on hub 0
[ +0,000001] amdgpu 0000:0c:00.0: amdgpu: ring comp_1.2.0 uses VM inv eng 5 on hub 0
[ +0,000000] amdgpu 0000:0c:00.0: amdgpu: ring comp_1.3.0 uses VM inv eng 6 on hub 0
[ +0,000001] amdgpu 0000:0c:00.0: amdgpu: ring comp_1.0.1 uses VM inv eng 7 on hub 0
[ +0,000000] amdgpu 0000:0c:00.0: amdgpu: ring comp_1.1.1 uses VM inv eng 8 on hub 0
[ +0,000001] amdgpu 0000:0c:00.0: amdgpu: ring comp_1.2.1 uses VM inv eng 9 on hub 0
[ +0,000000] amdgpu 0000:0c:00.0: amdgpu: ring comp_1.3.1 uses VM inv eng 10 on hub 0
[ +0,000001] amdgpu 0000:0c:00.0: amdgpu: ring kiq_2.1.0 uses VM inv eng 11 on hub 0
[ +0,000000] amdgpu 0000:0c:00.0: amdgpu: ring sdma0 uses VM inv eng 12 on hub 0
[ +0,000001] amdgpu 0000:0c:00.0: amdgpu: ring sdma1 uses VM inv eng 13 on hub 0
[ +0,000000] amdgpu 0000:0c:00.0: amdgpu: ring vcn_dec uses VM inv eng 0 on hub 1
[ +0,000001] amdgpu 0000:0c:00.0: amdgpu: ring vcn_enc0 uses VM inv eng 1 on hub 1
[ +0,000001] amdgpu 0000:0c:00.0: amdgpu: ring vcn_enc1 uses VM inv eng 4 on hub 1
[ +0,000000] amdgpu 0000:0c:00.0: amdgpu: ring jpeg_dec uses VM inv eng 5 on hub 1
[ +0,002753] amdgpu 0000:0c:00.0: amdgpu: recover vram bo from shadow start
[ +0,017047] amdgpu 0000:0c:00.0: amdgpu: recover vram bo from shadow done
[ +0,000002] [drm] Skip scheduling IBs!
[ +0,000001] [drm] Skip scheduling IBs!
[ +0,000016] amdgpu 0000:0c:00.0: amdgpu: GPU reset(2) succeeded!
radv_dumps_105440_2021.01.03_17.36.40.tar
System information
- OS: Manjaro Linux
- GPU: Radeon RX 5700 XT
- Kernel version: Linux 5.10.3-106-tkg-upds
- Mesa version: Mesa 21.0.0-devel (git-33a6c01e)
- Desktop environment: i3 with picom-jonaburg-git
- Xserver version: 1.20.10
- VKD3D version: vkd3d-1.1-1740-gd003424b
- Wine/Proton version: Proton Experimental-5.13-20201218c
### Regression
Everything worked with Mesa 20.3.1 (laggier than master)
API captures
I'll add a comment with a recorded GFXReconstruct
Further information
I tried running the game with RADV LLVM and with AMDVLK 2.0.170 and it still crashed
My environment doesn't set ACO_DEBUG
, RADV_DEBUG
, and RADV_PERFTEST