Navi 21 GFX hangs since "drm/amd/amdgpu: add pipe1 hardware support"
Brief summary of the problem:
GFX hangs occur shortly after logging into a GNOME Wayland session (which makes use of the EGL_IMG_context_priority
extension to get a high priority context for gnome-shell).
Output from journalctl --dmesg
:
Aug 09 15:32:21 simula kernel: Command line: BOOT_IMAGE=/vmlinuz-5.19.0+ root=/dev/mapper/simula-root ro rd.neednet=1 amdgpu.lockup_timeout=20000 quiet splash
Aug 09 15:32:21 simula kernel: Kernel command line: BOOT_IMAGE=/vmlinuz-5.19.0+ root=/dev/mapper/simula-root ro rd.neednet=1 amdgpu.lockup_timeout=20000 quiet splash
Aug 09 15:32:21 simula kernel: ACPI: bus type drm_connector registered
Aug 09 15:32:24 simula kernel: [drm] amdgpu kernel modesetting enabled.
Aug 09 15:32:24 simula kernel: amdgpu: Ignoring ACPI CRAT on non-APU system
Aug 09 15:32:24 simula kernel: amdgpu: Virtual CRAT table created for CPU
Aug 09 15:32:24 simula kernel: amdgpu: Topology: Add CPU node
Aug 09 15:32:24 simula kernel: amdgpu 0000:0c:00.0: vgaarb: deactivate vga console
Aug 09 15:32:24 simula kernel: amdgpu 0000:0c:00.0: enabling device (0006 -> 0007)
Aug 09 15:32:24 simula kernel: [drm] initializing kernel modesetting (SIENNA_CICHLID 0x1002:0x73A3 0x1002:0x0E1E 0x00).
Aug 09 15:32:24 simula kernel: [drm] register mmio base: 0xFCA00000
Aug 09 15:32:24 simula kernel: [drm] register mmio size: 1048576
Aug 09 15:32:24 simula kernel: [drm] add ip block number 0 <nv_common>
Aug 09 15:32:24 simula kernel: [drm] add ip block number 1 <gmc_v10_0>
Aug 09 15:32:24 simula kernel: [drm] add ip block number 2 <navi10_ih>
Aug 09 15:32:24 simula kernel: [drm] add ip block number 3 <psp>
Aug 09 15:32:24 simula kernel: [drm] add ip block number 4 <smu>
Aug 09 15:32:24 simula kernel: [drm] add ip block number 5 <dm>
Aug 09 15:32:24 simula kernel: [drm] add ip block number 6 <gfx_v10_0>
Aug 09 15:32:24 simula kernel: [drm] add ip block number 7 <sdma_v5_2>
Aug 09 15:32:24 simula kernel: [drm] add ip block number 8 <vcn_v3_0>
Aug 09 15:32:24 simula kernel: [drm] add ip block number 9 <jpeg_v3_0>
Aug 09 15:32:24 simula kernel: amdgpu 0000:0c:00.0: amdgpu: Fetched VBIOS from VFCT
Aug 09 15:32:24 simula kernel: amdgpu: ATOM BIOS: 113-D4300100-051
Aug 09 15:32:24 simula kernel: [drm] VCN(0) decode is enabled in VM mode
Aug 09 15:32:24 simula kernel: [drm] VCN(1) decode is enabled in VM mode
Aug 09 15:32:24 simula kernel: [drm] VCN(0) encode is enabled in VM mode
Aug 09 15:32:24 simula kernel: [drm] VCN(1) encode is enabled in VM mode
Aug 09 15:32:24 simula kernel: [drm] JPEG decode is enabled in VM mode
Aug 09 15:32:24 simula kernel: amdgpu 0000:0c:00.0: amdgpu: Trusted Memory Zone (TMZ) feature disabled as experimental (default)
Aug 09 15:32:24 simula kernel: amdgpu 0000:0c:00.0: amdgpu: MEM ECC is active.
Aug 09 15:32:24 simula kernel: amdgpu 0000:0c:00.0: amdgpu: SRAM ECC is not presented.
Aug 09 15:32:24 simula kernel: amdgpu 0000:0c:00.0: amdgpu: RAS INFO: ras initialized successfully, hardware ability[101] ras_mask[101]
Aug 09 15:32:24 simula kernel: [drm] vm size is 262144 GB, 4 levels, block size is 9-bit, fragment size is 9-bit
Aug 09 15:32:24 simula kernel: amdgpu 0000:0c:00.0: BAR 2: releasing [mem 0x7ff0000000-0x7ff01fffff 64bit pref]
Aug 09 15:32:24 simula kernel: amdgpu 0000:0c:00.0: BAR 0: releasing [mem 0x7fe0000000-0x7fefffffff 64bit pref]
Aug 09 15:32:24 simula kernel: amdgpu 0000:0c:00.0: BAR 0: assigned [mem 0x1000000000-0x17ffffffff 64bit pref]
Aug 09 15:32:24 simula kernel: amdgpu 0000:0c:00.0: BAR 2: assigned [mem 0xc00000000-0xc001fffff 64bit pref]
Aug 09 15:32:24 simula kernel: amdgpu 0000:0c:00.0: amdgpu: VRAM: 30704M 0x0000008000000000 - 0x000000877EFFFFFF (30704M used)
Aug 09 15:32:24 simula kernel: amdgpu 0000:0c:00.0: amdgpu: GART: 512M 0x0000000000000000 - 0x000000001FFFFFFF
Aug 09 15:32:24 simula kernel: amdgpu 0000:0c:00.0: amdgpu: AGP: 267878400M 0x0000008800000000 - 0x0000FFFFFFFFFFFF
Aug 09 15:32:24 simula kernel: [drm] Detected VRAM RAM=30704M, BAR=32768M
Aug 09 15:32:24 simula kernel: [drm] RAM width 256bits GDDR6
Aug 09 15:32:24 simula kernel: [drm] amdgpu: 30704M of VRAM memory ready
Aug 09 15:32:24 simula kernel: [drm] amdgpu: 16023M of GTT memory ready.
Aug 09 15:32:24 simula kernel: [drm] GART: num cpu pages 131072, num gpu pages 131072
Aug 09 15:32:24 simula kernel: [drm] PCIE GART of 512M enabled (table at 0x0000008000300000).
Aug 09 15:32:24 simula kernel: amdgpu 0000:0c:00.0: amdgpu: STB initialized to 2048 entries
Aug 09 15:32:24 simula kernel: [drm] Loading DMUB firmware via PSP: version=0x02020003
Aug 09 15:32:24 simula kernel: [drm] use_doorbell being set to: [true]
Aug 09 15:32:24 simula kernel: [drm] use_doorbell being set to: [true]
Aug 09 15:32:24 simula kernel: [drm] use_doorbell being set to: [true]
Aug 09 15:32:24 simula kernel: [drm] use_doorbell being set to: [true]
Aug 09 15:32:24 simula kernel: [drm] Found VCN firmware Version ENC: 1.13 DEC: 2 VEP: 0 Revision: 42
Aug 09 15:32:24 simula kernel: amdgpu 0000:0c:00.0: amdgpu: Will use PSP to load VCN firmware
Aug 09 15:32:24 simula kernel: [drm] reserve 0xa00000 from 0x8001000000 for PSP TMR
Aug 09 15:32:24 simula kernel: [drm] failed to load ucode GLOBAL_TAP_DELAYS(0x23)
Aug 09 15:32:24 simula kernel: [drm] psp gfx command LOAD_IP_FW(0x6) failed and response status is (0xFFFF0010)
Aug 09 15:32:24 simula kernel: [drm] failed to load ucode SE0_TAP_DELAYS(0x24)
Aug 09 15:32:24 simula kernel: [drm] psp gfx command LOAD_IP_FW(0x6) failed and response status is (0xFFFF0010)
Aug 09 15:32:24 simula kernel: [drm] failed to load ucode SE1_TAP_DELAYS(0x25)
Aug 09 15:32:24 simula kernel: [drm] psp gfx command LOAD_IP_FW(0x6) failed and response status is (0xFFFF0010)
Aug 09 15:32:24 simula kernel: [drm] failed to load ucode SE2_TAP_DELAYS(0x26)
Aug 09 15:32:24 simula kernel: [drm] psp gfx command LOAD_IP_FW(0x6) failed and response status is (0xFFFF0010)
Aug 09 15:32:24 simula kernel: [drm] failed to load ucode SE3_TAP_DELAYS(0x27)
Aug 09 15:32:24 simula kernel: [drm] psp gfx command LOAD_IP_FW(0x6) failed and response status is (0xFFFF0010)
Aug 09 15:32:24 simula kernel: amdgpu 0000:0c:00.0: amdgpu: GECC is enabled
Aug 09 15:32:24 simula kernel: amdgpu 0000:0c:00.0: amdgpu: SECUREDISPLAY: securedisplay ta ucode is not available
Aug 09 15:32:24 simula kernel: amdgpu 0000:0c:00.0: amdgpu: use vbios provided pptable
Aug 09 15:32:25 simula kernel: amdgpu 0000:0c:00.0: amdgpu: SMU is initialized successfully!
Aug 09 15:32:25 simula kernel: [drm] Display Core initialized with v3.2.196!
Aug 09 15:32:25 simula kernel: [drm] DMUB hardware initialized: version=0x02020003
Aug 09 15:32:25 simula kernel: [drm] kiq ring mec 2 pipe 1 q 0
Aug 09 15:32:25 simula kernel: [drm] VCN decode and encode initialized successfully(under DPG Mode).
Aug 09 15:32:25 simula kernel: [drm] JPEG decode initialized successfully.
Aug 09 15:32:25 simula kernel: kfd kfd: amdgpu: Allocated 3969056 bytes on gart
Aug 09 15:32:25 simula kernel: amdgpu: sdma_bitmap: ffff
Aug 09 15:32:25 simula kernel: amdgpu: HMM registered 30704MB device memory
Aug 09 15:32:25 simula kernel: amdgpu: SRAT table not found
Aug 09 15:32:25 simula kernel: amdgpu: Virtual CRAT table created for GPU
Aug 09 15:32:25 simula kernel: amdgpu: Topology: Add dGPU node [0x73a3:0x1002]
Aug 09 15:32:25 simula kernel: kfd kfd: amdgpu: added device 1002:73a3
Aug 09 15:32:25 simula kernel: amdgpu 0000:0c:00.0: amdgpu: SE 4, SH per SE 2, CU per SH 10, active_cu_number 60
Aug 09 15:32:25 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring gfx_0.0.0 uses VM inv eng 0 on hub 0
Aug 09 15:32:25 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring gfx_0.1.0 uses VM inv eng 1 on hub 0
Aug 09 15:32:25 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring comp_1.0.0 uses VM inv eng 4 on hub 0
Aug 09 15:32:25 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring comp_1.1.0 uses VM inv eng 5 on hub 0
Aug 09 15:32:25 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring comp_1.2.0 uses VM inv eng 6 on hub 0
Aug 09 15:32:25 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring comp_1.3.0 uses VM inv eng 7 on hub 0
Aug 09 15:32:25 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring comp_1.0.1 uses VM inv eng 8 on hub 0
Aug 09 15:32:25 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring comp_1.1.1 uses VM inv eng 9 on hub 0
Aug 09 15:32:25 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring comp_1.2.1 uses VM inv eng 10 on hub 0
Aug 09 15:32:25 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring comp_1.3.1 uses VM inv eng 11 on hub 0
Aug 09 15:32:25 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring kiq_2.1.0 uses VM inv eng 12 on hub 0
Aug 09 15:32:25 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring sdma0 uses VM inv eng 13 on hub 0
Aug 09 15:32:25 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring sdma1 uses VM inv eng 14 on hub 0
Aug 09 15:32:25 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring sdma2 uses VM inv eng 15 on hub 0
Aug 09 15:32:25 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring sdma3 uses VM inv eng 16 on hub 0
Aug 09 15:32:25 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring vcn_dec_0 uses VM inv eng 0 on hub 1
Aug 09 15:32:25 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring vcn_enc_0.0 uses VM inv eng 1 on hub 1
Aug 09 15:32:25 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring vcn_enc_0.1 uses VM inv eng 4 on hub 1
Aug 09 15:32:25 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring vcn_dec_1 uses VM inv eng 5 on hub 1
Aug 09 15:32:25 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring vcn_enc_1.0 uses VM inv eng 6 on hub 1
Aug 09 15:32:25 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring vcn_enc_1.1 uses VM inv eng 7 on hub 1
Aug 09 15:32:25 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring jpeg_dec uses VM inv eng 8 on hub 1
Aug 09 15:32:25 simula kernel: amdgpu 0000:0c:00.0: amdgpu: Using BACO for runtime pm
Aug 09 15:32:25 simula kernel: [drm] Initialized amdgpu 3.48.0 20150101 for 0000:0c:00.0 on minor 0
Aug 09 15:32:25 simula kernel: fbcon: amdgpudrmfb (fb0) is primary device
Aug 09 15:32:25 simula kernel: [drm] DSC precompute is not needed.
Aug 09 15:32:25 simula kernel: amdgpu 0000:0c:00.0: [drm] fb0: amdgpudrmfb frame buffer device
Aug 09 15:34:03 simula systemd[1]: Starting Load Kernel Module drm...
Aug 09 15:34:03 simula kernel: snd_hda_intel 0000:0c:00.1: bound 0000:0c:00.0 (ops amdgpu_dm_audio_component_bind_ops [amdgpu])
Aug 09 15:34:39 simula kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx_0.0.0 timeout, signaled seq=1376, emitted seq=1378
Aug 09 15:34:39 simula kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process information: process gnome-shell pid 3138 thread gnome-shel:cs0 pid 3183
Aug 09 15:34:39 simula kernel: amdgpu 0000:0c:00.0: amdgpu: GPU reset begin!
Aug 09 15:34:39 simula kernel: amdgpu 0000:0c:00.0: [drm:amdgpu_ring_test_helper [amdgpu]] *ERROR* ring kiq_2.1.0 test failed (-110)
Aug 09 15:34:39 simula kernel: [drm:gfx_v10_0_hw_fini [amdgpu]] *ERROR* KGQ disable failed
Aug 09 15:34:39 simula kernel: [drm:gfx_v10_0_hw_fini [amdgpu]] *ERROR* failed to halt cp gfx
Aug 09 15:34:39 simula kernel: [drm] free PSP TMR buffer
Aug 09 15:34:40 simula kernel: amdgpu 0000:0c:00.0: amdgpu: MODE1 reset
Aug 09 15:34:40 simula kernel: amdgpu 0000:0c:00.0: amdgpu: GPU mode1 reset
Aug 09 15:34:40 simula kernel: amdgpu 0000:0c:00.0: amdgpu: GPU smu mode1 reset
Aug 09 15:34:42 simula kernel: amdgpu 0000:0c:00.0: amdgpu: GPU reset succeeded, trying to resume
Aug 09 15:34:42 simula kernel: [drm] PCIE GART of 512M enabled (table at 0x0000008000300000).
Aug 09 15:34:42 simula kernel: [drm] VRAM is lost due to GPU reset!
Aug 09 15:34:42 simula kernel: [drm] PSP is resuming...
Aug 09 15:34:42 simula kernel: [drm] reserve 0xa00000 from 0x877c000000 for PSP TMR
Aug 09 15:34:42 simula kernel: [drm] failed to load ucode GLOBAL_TAP_DELAYS(0x23)
Aug 09 15:34:42 simula kernel: [drm] psp gfx command LOAD_IP_FW(0x6) failed and response status is (0xFFFF0010)
Aug 09 15:34:42 simula kernel: [drm] failed to load ucode SE0_TAP_DELAYS(0x24)
Aug 09 15:34:42 simula kernel: [drm] psp gfx command LOAD_IP_FW(0x6) failed and response status is (0xFFFF0010)
Aug 09 15:34:42 simula kernel: [drm] failed to load ucode SE1_TAP_DELAYS(0x25)
Aug 09 15:34:42 simula kernel: [drm] psp gfx command LOAD_IP_FW(0x6) failed and response status is (0xFFFF0010)
Aug 09 15:34:42 simula kernel: [drm] failed to load ucode SE2_TAP_DELAYS(0x26)
Aug 09 15:34:42 simula kernel: [drm] psp gfx command LOAD_IP_FW(0x6) failed and response status is (0xFFFF0010)
Aug 09 15:34:42 simula kernel: [drm] failed to load ucode SE3_TAP_DELAYS(0x27)
Aug 09 15:34:42 simula kernel: [drm] psp gfx command LOAD_IP_FW(0x6) failed and response status is (0xFFFF0010)
Aug 09 15:34:42 simula kernel: amdgpu 0000:0c:00.0: amdgpu: GECC is enabled
Aug 09 15:34:42 simula kernel: amdgpu 0000:0c:00.0: amdgpu: SECUREDISPLAY: securedisplay ta ucode is not available
Aug 09 15:34:42 simula kernel: amdgpu 0000:0c:00.0: amdgpu: SMU is resuming...
Aug 09 15:34:42 simula kernel: amdgpu 0000:0c:00.0: amdgpu: use vbios provided pptable
Aug 09 15:34:42 simula kernel: amdgpu 0000:0c:00.0: amdgpu: SMU is resumed successfully!
Aug 09 15:34:42 simula kernel: [drm] DMUB hardware initialized: version=0x02020003
Aug 09 15:34:43 simula kernel: [drm] kiq ring mec 2 pipe 1 q 0
Aug 09 15:34:43 simula kernel: [drm] VCN decode and encode initialized successfully(under DPG Mode).
Aug 09 15:34:43 simula kernel: [drm] JPEG decode initialized successfully.
Aug 09 15:34:43 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring gfx_0.0.0 uses VM inv eng 0 on hub 0
Aug 09 15:34:43 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring gfx_0.1.0 uses VM inv eng 1 on hub 0
Aug 09 15:34:43 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring comp_1.0.0 uses VM inv eng 4 on hub 0
Aug 09 15:34:43 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring comp_1.1.0 uses VM inv eng 5 on hub 0
Aug 09 15:34:43 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring comp_1.2.0 uses VM inv eng 6 on hub 0
Aug 09 15:34:43 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring comp_1.3.0 uses VM inv eng 7 on hub 0
Aug 09 15:34:43 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring comp_1.0.1 uses VM inv eng 8 on hub 0
Aug 09 15:34:43 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring comp_1.1.1 uses VM inv eng 9 on hub 0
Aug 09 15:34:43 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring comp_1.2.1 uses VM inv eng 10 on hub 0
Aug 09 15:34:43 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring comp_1.3.1 uses VM inv eng 11 on hub 0
Aug 09 15:34:43 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring kiq_2.1.0 uses VM inv eng 12 on hub 0
Aug 09 15:34:43 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring sdma0 uses VM inv eng 13 on hub 0
Aug 09 15:34:43 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring sdma1 uses VM inv eng 14 on hub 0
Aug 09 15:34:43 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring sdma2 uses VM inv eng 15 on hub 0
Aug 09 15:34:43 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring sdma3 uses VM inv eng 16 on hub 0
Aug 09 15:34:43 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring vcn_dec_0 uses VM inv eng 0 on hub 1
Aug 09 15:34:43 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring vcn_enc_0.0 uses VM inv eng 1 on hub 1
Aug 09 15:34:43 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring vcn_enc_0.1 uses VM inv eng 4 on hub 1
Aug 09 15:34:43 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring vcn_dec_1 uses VM inv eng 5 on hub 1
Aug 09 15:34:43 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring vcn_enc_1.0 uses VM inv eng 6 on hub 1
Aug 09 15:34:43 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring vcn_enc_1.1 uses VM inv eng 7 on hub 1
Aug 09 15:34:43 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring jpeg_dec uses VM inv eng 8 on hub 1
Aug 09 15:34:43 simula kernel: amdgpu 0000:0c:00.0: amdgpu: recover vram bo from shadow start
Aug 09 15:34:43 simula kernel: amdgpu 0000:0c:00.0: amdgpu: recover vram bo from shadow done
Aug 09 15:34:43 simula kernel: [drm] Skip scheduling IBs!
Aug 09 15:34:43 simula kernel: [drm] Skip scheduling IBs!
Aug 09 15:34:43 simula kernel: amdgpu 0000:0c:00.0: amdgpu: GPU reset(2) succeeded!
Aug 09 15:34:43 simula kernel: [drm] Skip scheduling IBs!
[...]
Aug 09 15:34:43 simula kernel: [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
Aug 09 15:34:43 simula kernel: [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
Aug 09 15:34:43 simula kernel: [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
Aug 09 15:34:53 simula kernel: amdgpu 0000:0c:00.0: [drm] *ERROR* [CRTC:77:crtc-0] flip_done timed out
Aug 09 15:36:00 simula kernel: [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
[...]
Aug 09 15:44:19 simula kernel: [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
Aug 09 15:44:29 simula kernel: amdgpu 0000:0c:00.0: [drm] *ERROR* flip_done timed out
Aug 09 15:44:29 simula kernel: amdgpu 0000:0c:00.0: [drm] *ERROR* [CRTC:77:crtc-0] commit wait timed out
Aug 09 15:44:39 simula kernel: amdgpu 0000:0c:00.0: [drm] *ERROR* flip_done timed out
Aug 09 15:44:39 simula kernel: amdgpu 0000:0c:00.0: [drm] *ERROR* [CONNECTOR:94:DP-1] commit wait timed out
Aug 09 15:44:50 simula kernel: amdgpu 0000:0c:00.0: [drm] *ERROR* flip_done timed out
Aug 09 15:44:50 simula kernel: amdgpu 0000:0c:00.0: [drm] *ERROR* [PLANE:65:plane-5] commit wait timed out
Aug 09 15:45:00 simula kernel: amdgpu 0000:0c:00.0: [drm] *ERROR* [CRTC:77:crtc-0] flip_done timed out
Aug 09 15:45:10 simula kernel: amdgpu 0000:0c:00.0: [drm] *ERROR* flip_done timed out
Aug 09 15:45:10 simula kernel: amdgpu 0000:0c:00.0: [drm] *ERROR* [CRTC:77:crtc-0] commit wait timed out
Aug 09 15:45:20 simula kernel: amdgpu 0000:0c:00.0: [drm] *ERROR* flip_done timed out
Aug 09 15:45:20 simula kernel: amdgpu 0000:0c:00.0: [drm] *ERROR* [CONNECTOR:94:DP-1] commit wait timed out
Aug 09 15:45:31 simula kernel: amdgpu 0000:0c:00.0: [drm] *ERROR* flip_done timed out
Aug 09 15:45:31 simula kernel: amdgpu 0000:0c:00.0: [drm] *ERROR* [PLANE:65:plane-5] commit wait timed out
Aug 09 15:45:41 simula kernel: amdgpu 0000:0c:00.0: [drm] *ERROR* [CRTC:77:crtc-0] flip_done timed out
Bisected to agd5f/linux@4c763180.
I tried amdgpu.async_gfx_ring=0
on the kernel command line as a workaround, but that resulted in the amdgpu driver failing to initialize successfully at all.
Hardware description:
- GPU: AMD Navi 21 GL-XL [Radeon PRO W6800]
System information:
- Kernel version: 5.19.0+
- Custom kernel:
drm-next-2022-08-03
tag merged on top oflinux-5.19.y
branch - Mesa 22.2.0-rc1 or newer
How to reproduce the issue:
- Boot kernel which has the bisected commit
- Log into GNOME Wayland session
- Run Steam in Big Picture mode
The GFX hang occurs within a few seconds of the Steam Big Picture mode starting up.
Edited by Michel Dänzer