Skip to content
GitLab
Projects Groups Snippets
  • /
  • Help
    • Help
    • Support
    • Community forum
    • Submit feedback
    • Contribute to GitLab
  • Register
  • Sign in
  • A amd
  • Project information
    • Project information
    • Activity
    • Labels
    • Members
  • Repository
    • Repository
    • Files
    • Commits
    • Branches
    • Tags
    • Contributors
    • Graph
    • Compare
  • Issues 1,517
    • Issues 1,517
    • List
    • Boards
    • Service Desk
    • Milestones
  • Merge requests 1
    • Merge requests 1
  • CI/CD
    • CI/CD
    • Pipelines
    • Jobs
    • Schedules
  • Deployments
    • Deployments
    • Environments
    • Releases
  • Packages and registries
    • Packages and registries
    • Container Registry
  • Monitor
    • Monitor
    • Incidents
  • Analytics
    • Analytics
    • Value stream
    • CI/CD
    • Repository
  • Wiki
    • Wiki
  • Snippets
    • Snippets
  • Activity
  • Graph
  • Create a new issue
  • Jobs
  • Commits
  • Issue Boards
Collapse sidebar

Due to an influx of spam, we have had to impose restrictions on new accounts. Please see this wiki page for instructions on how to get full permissions. Sorry for the inconvenience.

  • drm
  • amd
  • Issues
  • #2117
Closed
Open
Issue created Aug 09, 2022 by Michel Dänzer@daenzerDeveloper

Navi 21 GFX hangs since "drm/amd/amdgpu: add pipe1 hardware support"

Brief summary of the problem:

GFX hangs occur shortly after logging into a GNOME Wayland session (which makes use of the EGL_IMG_context_priority extension to get a high priority context for gnome-shell).

Output from journalctl --dmesg:

Aug 09 15:32:21 simula kernel: Command line: BOOT_IMAGE=/vmlinuz-5.19.0+ root=/dev/mapper/simula-root ro rd.neednet=1 amdgpu.lockup_timeout=20000 quiet splash
Aug 09 15:32:21 simula kernel: Kernel command line: BOOT_IMAGE=/vmlinuz-5.19.0+ root=/dev/mapper/simula-root ro rd.neednet=1 amdgpu.lockup_timeout=20000 quiet splash
Aug 09 15:32:21 simula kernel: ACPI: bus type drm_connector registered
Aug 09 15:32:24 simula kernel: [drm] amdgpu kernel modesetting enabled.
Aug 09 15:32:24 simula kernel: amdgpu: Ignoring ACPI CRAT on non-APU system
Aug 09 15:32:24 simula kernel: amdgpu: Virtual CRAT table created for CPU
Aug 09 15:32:24 simula kernel: amdgpu: Topology: Add CPU node
Aug 09 15:32:24 simula kernel: amdgpu 0000:0c:00.0: vgaarb: deactivate vga console
Aug 09 15:32:24 simula kernel: amdgpu 0000:0c:00.0: enabling device (0006 -> 0007)
Aug 09 15:32:24 simula kernel: [drm] initializing kernel modesetting (SIENNA_CICHLID 0x1002:0x73A3 0x1002:0x0E1E 0x00).
Aug 09 15:32:24 simula kernel: [drm] register mmio base: 0xFCA00000
Aug 09 15:32:24 simula kernel: [drm] register mmio size: 1048576
Aug 09 15:32:24 simula kernel: [drm] add ip block number 0 <nv_common>
Aug 09 15:32:24 simula kernel: [drm] add ip block number 1 <gmc_v10_0>
Aug 09 15:32:24 simula kernel: [drm] add ip block number 2 <navi10_ih>
Aug 09 15:32:24 simula kernel: [drm] add ip block number 3 <psp>
Aug 09 15:32:24 simula kernel: [drm] add ip block number 4 <smu>
Aug 09 15:32:24 simula kernel: [drm] add ip block number 5 <dm>
Aug 09 15:32:24 simula kernel: [drm] add ip block number 6 <gfx_v10_0>
Aug 09 15:32:24 simula kernel: [drm] add ip block number 7 <sdma_v5_2>
Aug 09 15:32:24 simula kernel: [drm] add ip block number 8 <vcn_v3_0>
Aug 09 15:32:24 simula kernel: [drm] add ip block number 9 <jpeg_v3_0>
Aug 09 15:32:24 simula kernel: amdgpu 0000:0c:00.0: amdgpu: Fetched VBIOS from VFCT
Aug 09 15:32:24 simula kernel: amdgpu: ATOM BIOS: 113-D4300100-051
Aug 09 15:32:24 simula kernel: [drm] VCN(0) decode is enabled in VM mode
Aug 09 15:32:24 simula kernel: [drm] VCN(1) decode is enabled in VM mode
Aug 09 15:32:24 simula kernel: [drm] VCN(0) encode is enabled in VM mode
Aug 09 15:32:24 simula kernel: [drm] VCN(1) encode is enabled in VM mode
Aug 09 15:32:24 simula kernel: [drm] JPEG decode is enabled in VM mode
Aug 09 15:32:24 simula kernel: amdgpu 0000:0c:00.0: amdgpu: Trusted Memory Zone (TMZ) feature disabled as experimental (default)
Aug 09 15:32:24 simula kernel: amdgpu 0000:0c:00.0: amdgpu: MEM ECC is active.
Aug 09 15:32:24 simula kernel: amdgpu 0000:0c:00.0: amdgpu: SRAM ECC is not presented.
Aug 09 15:32:24 simula kernel: amdgpu 0000:0c:00.0: amdgpu: RAS INFO: ras initialized successfully, hardware ability[101] ras_mask[101]
Aug 09 15:32:24 simula kernel: [drm] vm size is 262144 GB, 4 levels, block size is 9-bit, fragment size is 9-bit
Aug 09 15:32:24 simula kernel: amdgpu 0000:0c:00.0: BAR 2: releasing [mem 0x7ff0000000-0x7ff01fffff 64bit pref]
Aug 09 15:32:24 simula kernel: amdgpu 0000:0c:00.0: BAR 0: releasing [mem 0x7fe0000000-0x7fefffffff 64bit pref]
Aug 09 15:32:24 simula kernel: amdgpu 0000:0c:00.0: BAR 0: assigned [mem 0x1000000000-0x17ffffffff 64bit pref]
Aug 09 15:32:24 simula kernel: amdgpu 0000:0c:00.0: BAR 2: assigned [mem 0xc00000000-0xc001fffff 64bit pref]
Aug 09 15:32:24 simula kernel: amdgpu 0000:0c:00.0: amdgpu: VRAM: 30704M 0x0000008000000000 - 0x000000877EFFFFFF (30704M used)
Aug 09 15:32:24 simula kernel: amdgpu 0000:0c:00.0: amdgpu: GART: 512M 0x0000000000000000 - 0x000000001FFFFFFF
Aug 09 15:32:24 simula kernel: amdgpu 0000:0c:00.0: amdgpu: AGP: 267878400M 0x0000008800000000 - 0x0000FFFFFFFFFFFF
Aug 09 15:32:24 simula kernel: [drm] Detected VRAM RAM=30704M, BAR=32768M
Aug 09 15:32:24 simula kernel: [drm] RAM width 256bits GDDR6
Aug 09 15:32:24 simula kernel: [drm] amdgpu: 30704M of VRAM memory ready
Aug 09 15:32:24 simula kernel: [drm] amdgpu: 16023M of GTT memory ready.
Aug 09 15:32:24 simula kernel: [drm] GART: num cpu pages 131072, num gpu pages 131072
Aug 09 15:32:24 simula kernel: [drm] PCIE GART of 512M enabled (table at 0x0000008000300000).
Aug 09 15:32:24 simula kernel: amdgpu 0000:0c:00.0: amdgpu: STB initialized to 2048 entries
Aug 09 15:32:24 simula kernel: [drm] Loading DMUB firmware via PSP: version=0x02020003
Aug 09 15:32:24 simula kernel: [drm] use_doorbell being set to: [true]
Aug 09 15:32:24 simula kernel: [drm] use_doorbell being set to: [true]
Aug 09 15:32:24 simula kernel: [drm] use_doorbell being set to: [true]
Aug 09 15:32:24 simula kernel: [drm] use_doorbell being set to: [true]
Aug 09 15:32:24 simula kernel: [drm] Found VCN firmware Version ENC: 1.13 DEC: 2 VEP: 0 Revision: 42
Aug 09 15:32:24 simula kernel: amdgpu 0000:0c:00.0: amdgpu: Will use PSP to load VCN firmware
Aug 09 15:32:24 simula kernel: [drm] reserve 0xa00000 from 0x8001000000 for PSP TMR
Aug 09 15:32:24 simula kernel: [drm] failed to load ucode GLOBAL_TAP_DELAYS(0x23) 
Aug 09 15:32:24 simula kernel: [drm] psp gfx command LOAD_IP_FW(0x6) failed and response status is (0xFFFF0010)
Aug 09 15:32:24 simula kernel: [drm] failed to load ucode SE0_TAP_DELAYS(0x24) 
Aug 09 15:32:24 simula kernel: [drm] psp gfx command LOAD_IP_FW(0x6) failed and response status is (0xFFFF0010)
Aug 09 15:32:24 simula kernel: [drm] failed to load ucode SE1_TAP_DELAYS(0x25) 
Aug 09 15:32:24 simula kernel: [drm] psp gfx command LOAD_IP_FW(0x6) failed and response status is (0xFFFF0010)
Aug 09 15:32:24 simula kernel: [drm] failed to load ucode SE2_TAP_DELAYS(0x26) 
Aug 09 15:32:24 simula kernel: [drm] psp gfx command LOAD_IP_FW(0x6) failed and response status is (0xFFFF0010)
Aug 09 15:32:24 simula kernel: [drm] failed to load ucode SE3_TAP_DELAYS(0x27) 
Aug 09 15:32:24 simula kernel: [drm] psp gfx command LOAD_IP_FW(0x6) failed and response status is (0xFFFF0010)
Aug 09 15:32:24 simula kernel: amdgpu 0000:0c:00.0: amdgpu: GECC is enabled
Aug 09 15:32:24 simula kernel: amdgpu 0000:0c:00.0: amdgpu: SECUREDISPLAY: securedisplay ta ucode is not available
Aug 09 15:32:24 simula kernel: amdgpu 0000:0c:00.0: amdgpu: use vbios provided pptable
Aug 09 15:32:25 simula kernel: amdgpu 0000:0c:00.0: amdgpu: SMU is initialized successfully!
Aug 09 15:32:25 simula kernel: [drm] Display Core initialized with v3.2.196!
Aug 09 15:32:25 simula kernel: [drm] DMUB hardware initialized: version=0x02020003
Aug 09 15:32:25 simula kernel: [drm] kiq ring mec 2 pipe 1 q 0
Aug 09 15:32:25 simula kernel: [drm] VCN decode and encode initialized successfully(under DPG Mode).
Aug 09 15:32:25 simula kernel: [drm] JPEG decode initialized successfully.
Aug 09 15:32:25 simula kernel: kfd kfd: amdgpu: Allocated 3969056 bytes on gart
Aug 09 15:32:25 simula kernel: amdgpu: sdma_bitmap: ffff
Aug 09 15:32:25 simula kernel: amdgpu: HMM registered 30704MB device memory
Aug 09 15:32:25 simula kernel: amdgpu: SRAT table not found
Aug 09 15:32:25 simula kernel: amdgpu: Virtual CRAT table created for GPU
Aug 09 15:32:25 simula kernel: amdgpu: Topology: Add dGPU node [0x73a3:0x1002]
Aug 09 15:32:25 simula kernel: kfd kfd: amdgpu: added device 1002:73a3
Aug 09 15:32:25 simula kernel: amdgpu 0000:0c:00.0: amdgpu: SE 4, SH per SE 2, CU per SH 10, active_cu_number 60
Aug 09 15:32:25 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring gfx_0.0.0 uses VM inv eng 0 on hub 0
Aug 09 15:32:25 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring gfx_0.1.0 uses VM inv eng 1 on hub 0
Aug 09 15:32:25 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring comp_1.0.0 uses VM inv eng 4 on hub 0
Aug 09 15:32:25 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring comp_1.1.0 uses VM inv eng 5 on hub 0
Aug 09 15:32:25 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring comp_1.2.0 uses VM inv eng 6 on hub 0
Aug 09 15:32:25 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring comp_1.3.0 uses VM inv eng 7 on hub 0
Aug 09 15:32:25 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring comp_1.0.1 uses VM inv eng 8 on hub 0
Aug 09 15:32:25 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring comp_1.1.1 uses VM inv eng 9 on hub 0
Aug 09 15:32:25 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring comp_1.2.1 uses VM inv eng 10 on hub 0
Aug 09 15:32:25 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring comp_1.3.1 uses VM inv eng 11 on hub 0
Aug 09 15:32:25 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring kiq_2.1.0 uses VM inv eng 12 on hub 0
Aug 09 15:32:25 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring sdma0 uses VM inv eng 13 on hub 0
Aug 09 15:32:25 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring sdma1 uses VM inv eng 14 on hub 0
Aug 09 15:32:25 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring sdma2 uses VM inv eng 15 on hub 0
Aug 09 15:32:25 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring sdma3 uses VM inv eng 16 on hub 0
Aug 09 15:32:25 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring vcn_dec_0 uses VM inv eng 0 on hub 1
Aug 09 15:32:25 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring vcn_enc_0.0 uses VM inv eng 1 on hub 1
Aug 09 15:32:25 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring vcn_enc_0.1 uses VM inv eng 4 on hub 1
Aug 09 15:32:25 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring vcn_dec_1 uses VM inv eng 5 on hub 1
Aug 09 15:32:25 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring vcn_enc_1.0 uses VM inv eng 6 on hub 1
Aug 09 15:32:25 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring vcn_enc_1.1 uses VM inv eng 7 on hub 1
Aug 09 15:32:25 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring jpeg_dec uses VM inv eng 8 on hub 1
Aug 09 15:32:25 simula kernel: amdgpu 0000:0c:00.0: amdgpu: Using BACO for runtime pm
Aug 09 15:32:25 simula kernel: [drm] Initialized amdgpu 3.48.0 20150101 for 0000:0c:00.0 on minor 0
Aug 09 15:32:25 simula kernel: fbcon: amdgpudrmfb (fb0) is primary device
Aug 09 15:32:25 simula kernel: [drm] DSC precompute is not needed.
Aug 09 15:32:25 simula kernel: amdgpu 0000:0c:00.0: [drm] fb0: amdgpudrmfb frame buffer device
Aug 09 15:34:03 simula systemd[1]: Starting Load Kernel Module drm...
Aug 09 15:34:03 simula kernel: snd_hda_intel 0000:0c:00.1: bound 0000:0c:00.0 (ops amdgpu_dm_audio_component_bind_ops [amdgpu])
Aug 09 15:34:39 simula kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx_0.0.0 timeout, signaled seq=1376, emitted seq=1378
Aug 09 15:34:39 simula kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process information: process gnome-shell pid 3138 thread gnome-shel:cs0 pid 3183
Aug 09 15:34:39 simula kernel: amdgpu 0000:0c:00.0: amdgpu: GPU reset begin!
Aug 09 15:34:39 simula kernel: amdgpu 0000:0c:00.0: [drm:amdgpu_ring_test_helper [amdgpu]] *ERROR* ring kiq_2.1.0 test failed (-110)
Aug 09 15:34:39 simula kernel: [drm:gfx_v10_0_hw_fini [amdgpu]] *ERROR* KGQ disable failed
Aug 09 15:34:39 simula kernel: [drm:gfx_v10_0_hw_fini [amdgpu]] *ERROR* failed to halt cp gfx
Aug 09 15:34:39 simula kernel: [drm] free PSP TMR buffer
Aug 09 15:34:40 simula kernel: amdgpu 0000:0c:00.0: amdgpu: MODE1 reset
Aug 09 15:34:40 simula kernel: amdgpu 0000:0c:00.0: amdgpu: GPU mode1 reset
Aug 09 15:34:40 simula kernel: amdgpu 0000:0c:00.0: amdgpu: GPU smu mode1 reset
Aug 09 15:34:42 simula kernel: amdgpu 0000:0c:00.0: amdgpu: GPU reset succeeded, trying to resume
Aug 09 15:34:42 simula kernel: [drm] PCIE GART of 512M enabled (table at 0x0000008000300000).
Aug 09 15:34:42 simula kernel: [drm] VRAM is lost due to GPU reset!
Aug 09 15:34:42 simula kernel: [drm] PSP is resuming...
Aug 09 15:34:42 simula kernel: [drm] reserve 0xa00000 from 0x877c000000 for PSP TMR
Aug 09 15:34:42 simula kernel: [drm] failed to load ucode GLOBAL_TAP_DELAYS(0x23) 
Aug 09 15:34:42 simula kernel: [drm] psp gfx command LOAD_IP_FW(0x6) failed and response status is (0xFFFF0010)
Aug 09 15:34:42 simula kernel: [drm] failed to load ucode SE0_TAP_DELAYS(0x24) 
Aug 09 15:34:42 simula kernel: [drm] psp gfx command LOAD_IP_FW(0x6) failed and response status is (0xFFFF0010)
Aug 09 15:34:42 simula kernel: [drm] failed to load ucode SE1_TAP_DELAYS(0x25) 
Aug 09 15:34:42 simula kernel: [drm] psp gfx command LOAD_IP_FW(0x6) failed and response status is (0xFFFF0010)
Aug 09 15:34:42 simula kernel: [drm] failed to load ucode SE2_TAP_DELAYS(0x26) 
Aug 09 15:34:42 simula kernel: [drm] psp gfx command LOAD_IP_FW(0x6) failed and response status is (0xFFFF0010)
Aug 09 15:34:42 simula kernel: [drm] failed to load ucode SE3_TAP_DELAYS(0x27) 
Aug 09 15:34:42 simula kernel: [drm] psp gfx command LOAD_IP_FW(0x6) failed and response status is (0xFFFF0010)
Aug 09 15:34:42 simula kernel: amdgpu 0000:0c:00.0: amdgpu: GECC is enabled
Aug 09 15:34:42 simula kernel: amdgpu 0000:0c:00.0: amdgpu: SECUREDISPLAY: securedisplay ta ucode is not available
Aug 09 15:34:42 simula kernel: amdgpu 0000:0c:00.0: amdgpu: SMU is resuming...
Aug 09 15:34:42 simula kernel: amdgpu 0000:0c:00.0: amdgpu: use vbios provided pptable
Aug 09 15:34:42 simula kernel: amdgpu 0000:0c:00.0: amdgpu: SMU is resumed successfully!
Aug 09 15:34:42 simula kernel: [drm] DMUB hardware initialized: version=0x02020003
Aug 09 15:34:43 simula kernel: [drm] kiq ring mec 2 pipe 1 q 0
Aug 09 15:34:43 simula kernel: [drm] VCN decode and encode initialized successfully(under DPG Mode).
Aug 09 15:34:43 simula kernel: [drm] JPEG decode initialized successfully.
Aug 09 15:34:43 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring gfx_0.0.0 uses VM inv eng 0 on hub 0
Aug 09 15:34:43 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring gfx_0.1.0 uses VM inv eng 1 on hub 0
Aug 09 15:34:43 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring comp_1.0.0 uses VM inv eng 4 on hub 0
Aug 09 15:34:43 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring comp_1.1.0 uses VM inv eng 5 on hub 0
Aug 09 15:34:43 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring comp_1.2.0 uses VM inv eng 6 on hub 0
Aug 09 15:34:43 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring comp_1.3.0 uses VM inv eng 7 on hub 0
Aug 09 15:34:43 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring comp_1.0.1 uses VM inv eng 8 on hub 0
Aug 09 15:34:43 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring comp_1.1.1 uses VM inv eng 9 on hub 0
Aug 09 15:34:43 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring comp_1.2.1 uses VM inv eng 10 on hub 0
Aug 09 15:34:43 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring comp_1.3.1 uses VM inv eng 11 on hub 0
Aug 09 15:34:43 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring kiq_2.1.0 uses VM inv eng 12 on hub 0
Aug 09 15:34:43 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring sdma0 uses VM inv eng 13 on hub 0
Aug 09 15:34:43 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring sdma1 uses VM inv eng 14 on hub 0
Aug 09 15:34:43 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring sdma2 uses VM inv eng 15 on hub 0
Aug 09 15:34:43 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring sdma3 uses VM inv eng 16 on hub 0
Aug 09 15:34:43 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring vcn_dec_0 uses VM inv eng 0 on hub 1
Aug 09 15:34:43 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring vcn_enc_0.0 uses VM inv eng 1 on hub 1
Aug 09 15:34:43 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring vcn_enc_0.1 uses VM inv eng 4 on hub 1
Aug 09 15:34:43 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring vcn_dec_1 uses VM inv eng 5 on hub 1
Aug 09 15:34:43 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring vcn_enc_1.0 uses VM inv eng 6 on hub 1
Aug 09 15:34:43 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring vcn_enc_1.1 uses VM inv eng 7 on hub 1
Aug 09 15:34:43 simula kernel: amdgpu 0000:0c:00.0: amdgpu: ring jpeg_dec uses VM inv eng 8 on hub 1
Aug 09 15:34:43 simula kernel: amdgpu 0000:0c:00.0: amdgpu: recover vram bo from shadow start
Aug 09 15:34:43 simula kernel: amdgpu 0000:0c:00.0: amdgpu: recover vram bo from shadow done
Aug 09 15:34:43 simula kernel: [drm] Skip scheduling IBs!
Aug 09 15:34:43 simula kernel: [drm] Skip scheduling IBs!
Aug 09 15:34:43 simula kernel: amdgpu 0000:0c:00.0: amdgpu: GPU reset(2) succeeded!
Aug 09 15:34:43 simula kernel: [drm] Skip scheduling IBs!
[...]
Aug 09 15:34:43 simula kernel: [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
Aug 09 15:34:43 simula kernel: [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
Aug 09 15:34:43 simula kernel: [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
Aug 09 15:34:53 simula kernel: amdgpu 0000:0c:00.0: [drm] *ERROR* [CRTC:77:crtc-0] flip_done timed out
Aug 09 15:36:00 simula kernel: [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
[...]
Aug 09 15:44:19 simula kernel: [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
Aug 09 15:44:29 simula kernel: amdgpu 0000:0c:00.0: [drm] *ERROR* flip_done timed out
Aug 09 15:44:29 simula kernel: amdgpu 0000:0c:00.0: [drm] *ERROR* [CRTC:77:crtc-0] commit wait timed out
Aug 09 15:44:39 simula kernel: amdgpu 0000:0c:00.0: [drm] *ERROR* flip_done timed out
Aug 09 15:44:39 simula kernel: amdgpu 0000:0c:00.0: [drm] *ERROR* [CONNECTOR:94:DP-1] commit wait timed out
Aug 09 15:44:50 simula kernel: amdgpu 0000:0c:00.0: [drm] *ERROR* flip_done timed out
Aug 09 15:44:50 simula kernel: amdgpu 0000:0c:00.0: [drm] *ERROR* [PLANE:65:plane-5] commit wait timed out
Aug 09 15:45:00 simula kernel: amdgpu 0000:0c:00.0: [drm] *ERROR* [CRTC:77:crtc-0] flip_done timed out
Aug 09 15:45:10 simula kernel: amdgpu 0000:0c:00.0: [drm] *ERROR* flip_done timed out
Aug 09 15:45:10 simula kernel: amdgpu 0000:0c:00.0: [drm] *ERROR* [CRTC:77:crtc-0] commit wait timed out
Aug 09 15:45:20 simula kernel: amdgpu 0000:0c:00.0: [drm] *ERROR* flip_done timed out
Aug 09 15:45:20 simula kernel: amdgpu 0000:0c:00.0: [drm] *ERROR* [CONNECTOR:94:DP-1] commit wait timed out
Aug 09 15:45:31 simula kernel: amdgpu 0000:0c:00.0: [drm] *ERROR* flip_done timed out
Aug 09 15:45:31 simula kernel: amdgpu 0000:0c:00.0: [drm] *ERROR* [PLANE:65:plane-5] commit wait timed out
Aug 09 15:45:41 simula kernel: amdgpu 0000:0c:00.0: [drm] *ERROR* [CRTC:77:crtc-0] flip_done timed out

Bisected to agd5f/linux@4c763180.

I tried amdgpu.async_gfx_ring=0 on the kernel command line as a workaround, but that resulted in the amdgpu driver failing to initialize successfully at all.

Hardware description:

  • GPU: AMD Navi 21 GL-XL [Radeon PRO W6800]

System information:

  • Kernel version: 5.19.0+
  • Custom kernel: drm-next-2022-08-03 tag merged on top of linux-5.19.y branch
  • Mesa 22.2.0-rc1 or newer

How to reproduce the issue:

  1. Boot kernel which has the bisected commit
  2. Log into GNOME Wayland session
  3. Run Steam in Big Picture mode

The GFX hang occurs within a few seconds of the Steam Big Picture mode starting up.

Edited Aug 10, 2022 by Michel Dänzer
Assignee
Assign to
Time tracking