[gfxhub] page fault in game TW2
Hi,
Is this my setup, the game or the drivers fault ? Can I do something about this ?
System: Archlinux
xf86-video-amdgpu: 19.1.0-2
linux-zen: 5.8.12.zen1-1
Card: PowerColor Red Devil Radeon™ RX 5700 XT 8GB GDDR6
Game: Total War Warhammer 2 - https://www.feralinteractive.com/en/games/warhammer2tw/
The GPU reset is semi-successful, meaning I can enter a tty and restart the loginmanager (sometimes the loginmagager (lightdm) logs out). But the game closes/remains unresponsive. Red/Green pixel artifacts over the last 'hung' image frame.
[ 8091.669629] amdgpu 0000:0b:00.0: amdgpu: [gfxhub] page fault (src_id:0 ring:128 vmid:1 pasid:32783, for process TotalWarhammer2 pid 23045 thread TotalWarha:cs0 pid 23049)
[ 8091.669632] amdgpu 0000:0b:00.0: amdgpu: in page starting at address 0x000080013289d000 from client 27
[ 8091.669633] amdgpu 0000:0b:00.0: amdgpu: GCVM_L2_PROTECTION_FAULT_STATUS:0x00100900
[ 8091.669634] amdgpu 0000:0b:00.0: amdgpu: Faulty UTCL2 client ID: 0x4
[ 8091.669635] amdgpu 0000:0b:00.0: amdgpu: MORE_FAULTS: 0x0
[ 8091.669636] amdgpu 0000:0b:00.0: amdgpu: WALKER_ERROR: 0x0
[ 8091.669637] amdgpu 0000:0b:00.0: amdgpu: PERMISSION_FAULTS: 0x0
[ 8091.669637] amdgpu 0000:0b:00.0: amdgpu: MAPPING_ERROR: 0x1
[ 8091.669638] amdgpu 0000:0b:00.0: amdgpu: RW: 0x0
[ 8096.680516] [drm:amdgpu_dm_atomic_commit_tail [amdgpu]] *ERROR* Waiting for fences timed out!
[ 8097.192530] [drm:amdgpu_dm_atomic_commit_tail [amdgpu]] *ERROR* Waiting for fences timed out!
[ 8101.810552] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx_0.0.0 timeout, signaled seq=480950, emitted seq=480952
[ 8101.810604] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process information: process TotalWarhammer2 pid 23045 thread TotalWarha:cs0 pid 23049
[ 8101.810609] amdgpu 0000:0b:00.0: amdgpu: GPU reset begin!
[ 8101.810805] ------------[ cut here ]------------
[ 8101.810863] WARNING: CPU: 6 PID: 127 at drivers/gpu/drm/amd/amdgpu/../display/dc/dcn20/dcn20_resource.c:3194 dcn20_validate_bandwidth_fp+0x7a/0xb0 [amdgpu]
[ 8101.810863] Modules linked in: f2fs tun snd_usb_audio snd_usbmidi_lib snd_rawmidi snd_seq_device lz4hc lz4hc_compress cmac algif_hash algif_skcipher af_alg bnep mousedev joydev input_leds gspca_ov534 btusb gspca_main videobuf2_vmalloc btrtl videobuf2_memops btbcm videobuf2_v4l2 btintel videobuf2_common bluetooth videodev nls_iso8859_1 nls_cp437 uas vfat ecdh_generic usb_storage ecc mc fat hid_generic snd_hda_codec_realtek snd_hda_codec_generic ledtrig_audio snd_hda_codec_hdmi edac_mce_amd kvm_amd snd_hda_intel snd_intel_dspcfg kvm snd_hda_codec eeepc_wmi asus_wmi snd_hda_core battery irqbypass r8169 sparse_keymap snd_hwdep crct10dif_pclmul crc32_pclmul snd_pcm ghash_clmulni_intel rfkill wmi_bmof ccp realtek aesni_intel snd_timer rng_core snd crypto_simd cryptd sp5100_tco glue_helper libphy soundcore i2c_piix4 k10temp rapl usbhid hid wmi evdev gpio_amdpt pinctrl_amd mac_hid acpi_cpufreq vboxnetflt(OE) vboxnetadp(OE) vboxdrv(OE) crypto_user ip_tables x_tables ext4 crc32c_generic crc16
[ 8101.810881] mbcache jbd2 crc32c_intel xhci_pci xhci_pci_renesas xhci_hcd radeon amdgpu gpu_sched i2c_algo_bit drm_kms_helper syscopyarea sysfillrect sysimgblt fb_sys_fops cec rc_core ttm drm agpgart
[ 8101.810887] CPU: 6 PID: 127 Comm: kworker/6:1 Tainted: G OE 5.8.12-zen1-1-zen #1
[ 8101.810888] Hardware name: System manufacturer System Product Name/PRIME B450M-A, BIOS 2006 11/13/2019
[ 8101.810890] Workqueue: events drm_sched_job_timedout [gpu_sched]
[ 8101.810946] RIP: 0010:dcn20_validate_bandwidth_fp+0x7a/0xb0 [amdgpu]
[ 8101.810947] Code: 00 7b 35 22 85 a8 1e 00 00 75 2f 31 d2 f2 0f 11 85 d8 25 00 00 48 89 ee 4c 89 e7 e8 30 f6 ff ff 89 c2 22 95 a8 1e 00 00 75 2a <0f> 0b 48 89 9d d8 25 00 00 5b 5d 41 5c c3 75 c9 48 89 9d d8 25 00
[ 8101.810948] RSP: 0018:ffffa11bc05efc50 EFLAGS: 00010246
[ 8101.810949] RAX: 0000000000000001 RBX: 4079400000000000 RCX: 0000000000c58e06
[ 8101.810949] RDX: 0000000000000000 RSI: ff80cf65b775f52e RDI: 00000000000311a0
[ 8101.810949] RBP: ffff9307f1e40000 R08: 0000000000000000 R09: 0000000000000000
[ 8101.810950] R10: ffff9307f1e401b8 R11: ffff9307eede7800 R12: ffff930a82be0000
[ 8101.810950] R13: ffff9307f1e40000 R14: 0000000000000002 R15: ffff930a82be0000
[ 8101.810951] FS: 0000000000000000(0000) GS:ffff930a8e980000(0000) knlGS:0000000000000000
[ 8101.810951] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 8101.810952] CR2: 0000555bfa11ae08 CR3: 00000003d498a000 CR4: 0000000000340ee0
[ 8101.810952] Call Trace:
[ 8101.811009] dcn20_validate_bandwidth+0x24/0x40 [amdgpu]
[ 8101.811061] dc_validate_global_state+0x3b7/0x4b0 [amdgpu]
[ 8101.811116] amdgpu_dm_commit_zero_streams+0xfe/0x140 [amdgpu]
[ 8101.811171] dm_suspend+0x9a/0xb0 [amdgpu]
[ 8101.811211] amdgpu_device_ip_suspend_phase1+0x79/0xd0 [amdgpu]
[ 8101.811253] ? amdgpu_fence_process+0x44/0x150 [amdgpu]
[ 8101.811293] amdgpu_device_ip_suspend+0x1c/0x60 [amdgpu]
[ 8101.811350] amdgpu_device_gpu_recover.cold+0x65e/0xf6c [amdgpu]
[ 8101.811401] amdgpu_job_timedout+0x121/0x140 [amdgpu]
[ 8101.811403] drm_sched_job_timedout+0x64/0xe0 [gpu_sched]
[ 8101.811406] process_one_work+0x1da/0x3d0
[ 8101.811407] worker_thread+0x4d/0x460
[ 8101.811408] ? process_one_work+0x3d0/0x3d0
[ 8101.811409] kthread+0x19f/0x1d0
[ 8101.811410] ? __kthread_init_worker+0x50/0x50
[ 8101.811412] ret_from_fork+0x22/0x30
[ 8101.811413] ---[ end trace b7eb68a4c7ef9a32 ]---
[ 8102.202650] amdgpu 0000:0b:00.0: [drm:amdgpu_ring_test_helper [amdgpu]] *ERROR* ring kiq_2.1.0 test failed (-110)
[ 8102.202699] [drm:gfx_v10_0_hw_fini [amdgpu]] *ERROR* KGQ disable failed
[ 8102.312549] [drm:amdgpu_dm_atomic_commit_tail [amdgpu]] *ERROR* Waiting for fences timed out!
[ 8102.451782] amdgpu 0000:0b:00.0: [drm:amdgpu_ring_test_helper [amdgpu]] *ERROR* ring kiq_2.1.0 test failed (-110)
[ 8102.451830] [drm:gfx_v10_0_hw_fini [amdgpu]] *ERROR* KCQ disable failed
[ 8102.700976] [drm:gfx_v10_0_hw_fini [amdgpu]] *ERROR* failed to halt cp gfx
[ 8102.715486] [drm] free PSP TMR buffer
[ 8105.897156] amdgpu 0000:0b:00.0: amdgpu: GPU reset succeeded, trying to resume
[ 8105.897297] [drm] PCIE GART of 512M enabled (table at 0x0000008000E10000).
[ 8105.897324] [drm] VRAM is lost due to GPU reset!
[ 8105.909066] [drm] PSP is resuming...
[ 8106.078520] [drm] reserve 0x900000 from 0x81fe400000 for PSP TMR
[ 8106.147511] amdgpu 0000:0b:00.0: amdgpu: RAS: optional ras ta ucode is not available
[ 8106.153511] amdgpu: SMU is resuming...
[ 8106.155438] amdgpu: SMU is resumed successfully!
[ 8106.330103] [drm] kiq ring mec 2 pipe 1 q 0
[ 8106.334291] [drm] VCN decode and encode initialized successfully(under DPG Mode).
[ 8106.334421] [drm] JPEG decode initialized successfully.
[ 8106.334439] amdgpu 0000:0b:00.0: amdgpu: ring gfx_0.0.0 uses VM inv eng 0 on hub 0
[ 8106.334440] amdgpu 0000:0b:00.0: amdgpu: ring comp_1.0.0 uses VM inv eng 1 on hub 0
[ 8106.334441] amdgpu 0000:0b:00.0: amdgpu: ring comp_1.1.0 uses VM inv eng 4 on hub 0
[ 8106.334442] amdgpu 0000:0b:00.0: amdgpu: ring comp_1.2.0 uses VM inv eng 5 on hub 0
[ 8106.334442] amdgpu 0000:0b:00.0: amdgpu: ring comp_1.3.0 uses VM inv eng 6 on hub 0
[ 8106.334443] amdgpu 0000:0b:00.0: amdgpu: ring comp_1.0.1 uses VM inv eng 7 on hub 0
[ 8106.334444] amdgpu 0000:0b:00.0: amdgpu: ring comp_1.1.1 uses VM inv eng 8 on hub 0
[ 8106.334445] amdgpu 0000:0b:00.0: amdgpu: ring comp_1.2.1 uses VM inv eng 9 on hub 0
[ 8106.334445] amdgpu 0000:0b:00.0: amdgpu: ring comp_1.3.1 uses VM inv eng 10 on hub 0
[ 8106.334446] amdgpu 0000:0b:00.0: amdgpu: ring kiq_2.1.0 uses VM inv eng 11 on hub 0
[ 8106.334447] amdgpu 0000:0b:00.0: amdgpu: ring sdma0 uses VM inv eng 12 on hub 0
[ 8106.334447] amdgpu 0000:0b:00.0: amdgpu: ring sdma1 uses VM inv eng 13 on hub 0
[ 8106.334448] amdgpu 0000:0b:00.0: amdgpu: ring vcn_dec uses VM inv eng 0 on hub 1
[ 8106.334449] amdgpu 0000:0b:00.0: amdgpu: ring vcn_enc0 uses VM inv eng 1 on hub 1
[ 8106.334449] amdgpu 0000:0b:00.0: amdgpu: ring vcn_enc1 uses VM inv eng 4 on hub 1
[ 8106.334450] amdgpu 0000:0b:00.0: amdgpu: ring jpeg_dec uses VM inv eng 5 on hub 1
[ 8106.336121] [drm] recover vram bo from shadow start
[ 8106.363398] [drm] recover vram bo from shadow done
[ 8106.363400] [drm] Skip scheduling IBs!
[ 8106.363400] [drm] Skip scheduling IBs!
[ 8106.363410] [drm] Skip scheduling IBs!
[ 8106.363413] [drm] Skip scheduling IBs!
[ 8106.363421] [drm] Skip scheduling IBs!
[ 8106.363423] [drm] Skip scheduling IBs!
[ 8106.363426] [drm] Skip scheduling IBs!
[ 8106.363428] [drm] Skip scheduling IBs!
[ 8106.363430] [drm] Skip scheduling IBs!
[ 8106.363433] [drm] Skip scheduling IBs!
[ 8106.363434] [drm] Skip scheduling IBs!
[ 8106.363435] [drm] Skip scheduling IBs!
[ 8106.363437] [drm] Skip scheduling IBs!
[ 8106.363438] [drm] Skip scheduling IBs!
[ 8106.363439] [drm] Skip scheduling IBs!
[ 8106.363441] [drm] Skip scheduling IBs!
[ 8106.363443] [drm] Skip scheduling IBs!
[ 8106.363444] [drm] Skip scheduling IBs!
[ 8106.363446] [drm] Skip scheduling IBs!
[ 8106.363448] [drm] Skip scheduling IBs!
[ 8106.363449] [drm] Skip scheduling IBs!
[ 8106.363450] [drm] Skip scheduling IBs!
[ 8106.363451] [drm] Skip scheduling IBs!
[ 8106.363483] [drm] Skip scheduling IBs!
[ 8106.363485] amdgpu 0000:0b:00.0: amdgpu: GPU reset(2) succeeded!
[ 8106.363487] [drm] Skip scheduling IBs!
[ 8106.363489] [drm] Skip scheduling IBs!
[ 8106.363491] [drm] Skip scheduling IBs!
[ 8106.363492] [drm] Skip scheduling IBs!
[ 8106.363493] [drm] Skip scheduling IBs!
[ 8106.363495] [drm] Skip scheduling IBs!
[ 8106.363498] [drm] Skip scheduling IBs!
[ 8106.363500] [drm] Skip scheduling IBs!
[ 8106.363500] [drm] Skip scheduling IBs!
[ 8106.363501] [drm] Skip scheduling IBs!
[ 8106.363501] [drm] Skip scheduling IBs!
[ 8106.363503] [drm] Skip scheduling IBs!
[ 8106.363509] [drm] Skip scheduling IBs!
[ 8106.363511] [drm] Skip scheduling IBs!
[ 8106.363514] [drm] Skip scheduling IBs!
[ 8106.363530] [drm] Skip scheduling IBs!
[ 8106.363532] [drm] Skip scheduling IBs!
[ 8106.363534] [drm] Skip scheduling IBs!
[ 8106.363535] [drm] Skip scheduling IBs!
[ 8106.363536] [drm] Skip scheduling IBs!
[ 8106.363537] [drm] Skip scheduling IBs!
[ 8106.363539] [drm] Skip scheduling IBs!
[ 8106.363541] [drm] Skip scheduling IBs!
[ 8106.363542] [drm] Skip scheduling IBs!
[ 8106.363543] [drm] Skip scheduling IBs!
[ 8106.363545] [drm] Skip scheduling IBs!
[ 8106.363547] [drm] Skip scheduling IBs!
[ 8106.363549] [drm] Skip scheduling IBs!
[ 8106.363550] [drm] Skip scheduling IBs!
[ 8106.363551] [drm] Skip scheduling IBs!
[ 8106.363553] [drm] Skip scheduling IBs!
[ 8106.363554] [drm] Skip scheduling IBs!
[ 8106.363555] [drm] Skip scheduling IBs!
[ 8106.363556] [drm] Skip scheduling IBs!
[ 8106.363557] [drm] Skip scheduling IBs!
[ 8106.363558] [drm] Skip scheduling IBs!
[ 8106.363560] [drm] Skip scheduling IBs!
[ 8106.363561] [drm] Skip scheduling IBs!
[ 8106.363562] [drm] Skip scheduling IBs!
[ 8106.363564] [drm] Skip scheduling IBs!
[ 8106.363567] [drm] Skip scheduling IBs!
[ 8106.363569] [drm] Skip scheduling IBs!
[ 8106.363571] [drm] Skip scheduling IBs!
[ 8106.363572] [drm] Skip scheduling IBs!
[ 8106.363574] [drm] Skip scheduling IBs!
[ 8106.363575] [drm] Skip scheduling IBs!
[ 8106.363577] [drm] Skip scheduling IBs!
[ 8106.363579] [drm] Skip scheduling IBs!
[ 8106.363581] [drm] Skip scheduling IBs!
[ 8106.363582] [drm] Skip scheduling IBs!
[ 8106.363584] [drm] Skip scheduling IBs!
[ 8106.363585] [drm] Skip scheduling IBs!
[ 8106.363587] [drm] Skip scheduling IBs!
[ 8106.363588] [drm] Skip scheduling IBs!
[ 8106.363589] [drm] Skip scheduling IBs!
[ 8106.363591] [drm] Skip scheduling IBs!
[ 8106.363593] [drm] Skip scheduling IBs!
[ 8106.364504] [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!