Ryzen 7 6800H / Radeon RX680M: X crash during display off.
Hardware: Lenovo Legion 5 Pro 16ARH7H Ryzen 7 6800H / Radeon RX680M amdgpu driver, RTX3060M with vfio-pci, no vm running at the time of failure.
Brief summary of the problem:
Laptop exited X windows, complete log out. After login, audio was dead, probably unrelated.
Laptop is kept from changing power state based on settings, but after three minutes displays are turned off. This happened when the displays were off. I saw the laptop displays wake up and reinitialize.
This did not happen before so it's either a change in the driver, a bug in Plasma, or a hardware problem. I did recently update both the kernel and Gentoo.
Versions: kde-frameworks/plasma-5.106.0 x11-drivers/xf86-video-amdgpu-22.0.0 x11-base/xorg-drivers-21.1-r2 x11-base/xorg-server-21.1.8
Here is the dmesg output for amdgpu driver during normal bring up captured after the reboot after the failure.
[ 4.092635] [drm] amdgpu kernel modesetting enabled.
[ 4.093954] amdgpu: Virtual CRAT table created for CPU
[ 4.093960] amdgpu: Topology: Add CPU node
[ 4.094047] amdgpu 0000:35:00.0: enabling device (0006 -> 0007)
[ 4.094090] [drm] initializing kernel modesetting (YELLOW_CARP 0x1002:0x1681 0x17AA:0x3B07 0xC8).
[ 4.094138] [drm] register mmio base: 0xB9800000
[ 4.094138] [drm] register mmio size: 524288
[ 4.095416] [drm] add ip block number 0 <nv_common>
[ 4.095417] [drm] add ip block number 1 <gmc_v10_0>
[ 4.095418] [drm] add ip block number 2 <navi10_ih>
[ 4.095418] [drm] add ip block number 3 <psp>
[ 4.095419] [drm] add ip block number 4 <smu>
[ 4.095420] [drm] add ip block number 5 <dm>
[ 4.095420] [drm] add ip block number 6 <gfx_v10_0>
[ 4.095421] [drm] add ip block number 7 <sdma_v5_2>
[ 4.095422] [drm] add ip block number 8 <vcn_v3_0>
[ 4.095422] [drm] add ip block number 9 <jpeg_v3_0>
[ 4.095444] amdgpu 0000:35:00.0: amdgpu: Fetched VBIOS from VFCT
[ 4.095445] amdgpu: ATOM BIOS: 113-REMBRANDT-X37
[ 4.095452] Loading firmware: amdgpu/yellow_carp_toc.bin
[ 4.098867] Loading firmware: amdgpu/yellow_carp_ta.bin
[ 4.099139] Loading firmware: amdgpu/yellow_carp_dmcub.bin
[ 4.099412] Loading firmware: amdgpu/yellow_carp_pfp.bin
[ 4.099619] Loading firmware: amdgpu/yellow_carp_me.bin
[ 4.099890] Loading firmware: amdgpu/yellow_carp_ce.bin
[ 4.100165] Loading firmware: amdgpu/yellow_carp_rlc.bin
[ 4.100325] Loading firmware: amdgpu/yellow_carp_mec.bin
[ 4.100601] Loading firmware: amdgpu/yellow_carp_mec2.bin
[ 4.100788] [drm] VCN(0) decode is enabled in VM mode
[ 4.100789] [drm] VCN(0) encode is enabled in VM mode
[ 4.100790] Loading firmware: amdgpu/yellow_carp_vcn.bin
[ 4.101338] [drm] JPEG decode is enabled in VM mode
[ 4.101339] amdgpu 0000:35:00.0: vgaarb: deactivate vga console
[ 4.101340] amdgpu 0000:35:00.0: amdgpu: Trusted Memory Zone (TMZ) feature disabled as experimental (default)
[ 4.101342] amdgpu 0000:35:00.0: amdgpu: PCIE atomic ops is not supported
[ 4.101370] [drm] vm size is 262144 GB, 4 levels, block size is 9-bit, fragment size is 9-bit
[ 4.101374] amdgpu 0000:35:00.0: amdgpu: VRAM: 512M 0x000000F400000000 - 0x000000F41FFFFFFF (512M used)
[ 4.101375] amdgpu 0000:35:00.0: amdgpu: GART: 1024M 0x0000000000000000 - 0x000000003FFFFFFF
[ 4.101376] amdgpu 0000:35:00.0: amdgpu: AGP: 267419648M 0x000000F800000000 - 0x0000FFFFFFFFFFFF
[ 4.101380] [drm] Detected VRAM RAM=512M, BAR=512M
[ 4.101381] [drm] RAM width 128bits DDR5
[ 4.101555] [drm] amdgpu: 512M of VRAM memory ready
[ 4.101556] [drm] amdgpu: 15640M of GTT memory ready.
[ 4.101562] [drm] GART: num cpu pages 262144, num gpu pages 262144
[ 4.101897] [drm] PCIE GART of 1024M enabled (table at 0x000000F41FC00000).
[ 4.102082] [drm] Loading DMUB firmware via PSP: version=0x0400002E
[ 4.102680] Loading firmware: amdgpu/yellow_carp_sdma.bin
[ 4.102791] [drm] use_doorbell being set to: [true]
[ 4.102822] [drm] Found VCN firmware Version ENC: 1.23 DEC: 2 VEP: 0 Revision: 5
[ 4.102826] amdgpu 0000:35:00.0: amdgpu: Will use PSP to load VCN firmware
Hardware description:
- CPU: Ryzen 7 6800H
- OTHER GPU: 01:00.0 VGA compatible controller [0300]: NVIDIA Corporation GA106M [GeForce RTX 3060 Mobile / Max-Q] [10de:2560] (rev a1)
- AMD GPU: 35:00.0 VGA compatible controller [0300]: Advanced Micro Devices, Inc. [AMD/ATI] Rembrandt [Radeon 680M] [1002:1681] (rev c8)
- System Memory: 32GB DDR5
- Display(s): Laptop display plus external DP LG 27MU58P-B 4K/60Hz Monitor
- Type of Display Connection: DP
System information:
- Distro name and Version: Gentoo 2.13
- Kernel version: Linux lenny 6.3.4-gentoo-r1 #1 (closed) SMP Mon May 29 07:59:08 PDT 2023 x86_64 AMD Ryzen 7 6800H with Radeon Graphics AuthenticAMD GNU/Linux
- Custom kernel: It is a full custom kernel.
- AMD official driver version: x11-drivers/xf86-video-amdgpu-22.0.0
How to reproduce the issue:
It is difficult to reproduce this. I just left the laptop sitting for a few hours and it happened. It hasn't happened since.
Attached files:
syslog output of actual crash:
Jun 8 07:09:40 lenny kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring sdma0 timeout, signaled seq=444930, emitted seq=444932
Jun 8 07:09:40 lenny kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process information: process pid 0 thread pid 0
Jun 8 07:09:40 lenny kernel: amdgpu 0000:35:00.0: amdgpu: GPU reset begin!
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: MODE2 reset
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: GPU reset succeeded, trying to resume
Jun 8 07:09:41 lenny kernel: [drm] PCIE GART of 1024M enabled (table at 0x000000F41FC00000).
Jun 8 07:09:41 lenny kernel: [drm] PSP is resuming...
Jun 8 07:09:41 lenny kernel: [drm] reserve 0xa00000 from 0xf41e000000 for PSP TMR
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: RAS: optional ras ta ucode is not available
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: RAP: optional rap ta ucode is not available
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: SECUREDISPLAY: securedisplay ta ucode is not available
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: SMU is resuming...
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: SMU is resumed successfully!
Jun 8 07:09:41 lenny kernel: [drm] DMUB hardware initialized: version=0x0400002E
Jun 8 07:09:41 lenny kernel: [drm] kiq ring mec 2 pipe 1 q 0
Jun 8 07:09:41 lenny kernel: [drm] VCN decode and encode initialized successfully(under DPG Mode).
Jun 8 07:09:41 lenny kernel: [drm] JPEG decode initialized successfully.
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: ring gfx_0.0.0 uses VM inv eng 0 on hub 0
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: ring comp_1.0.0 uses VM inv eng 1 on hub 0
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: ring comp_1.1.0 uses VM inv eng 4 on hub 0
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: ring comp_1.2.0 uses VM inv eng 5 on hub 0
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: ring comp_1.3.0 uses VM inv eng 6 on hub 0
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: ring comp_1.0.1 uses VM inv eng 7 on hub 0
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: ring comp_1.1.1 uses VM inv eng 8 on hub 0
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: ring comp_1.2.1 uses VM inv eng 9 on hub 0
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: ring comp_1.3.1 uses VM inv eng 10 on hub 0
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: ring kiq_2.1.0 uses VM inv eng 11 on hub 0
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: ring sdma0 uses VM inv eng 12 on hub 0
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: ring vcn_dec_0 uses VM inv eng 0 on hub 1
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: ring vcn_enc_0.0 uses VM inv eng 1 on hub 1
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: ring vcn_enc_0.1 uses VM inv eng 4 on hub 1
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: ring jpeg_dec uses VM inv eng 5 on hub 1
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: recover vram bo from shadow start
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: recover vram bo from shadow done
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: GPU reset(1) succeeded!
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: [gfxhub] page fault (src_id:0 ring:40 vmid:6 pasid:32769, for process Xorg pid 2131 thread Xorg:cs0 pid 2488)
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: in page starting at address 0x0000800103002000 from client 0x1b (UTCL2)
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: GCVM_L2_PROTECTION_FAULT_STATUS:0x00640051
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: \x09 Faulty UTCL2 client ID: CB/DB (0x0)
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: \x09 MORE_FAULTS: 0x1
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: \x09 WALKER_ERROR: 0x0
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: \x09 PERMISSION_FAULTS: 0x5
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: \x09 MAPPING_ERROR: 0x0
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: \x09 RW: 0x1
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: [gfxhub] page fault (src_id:0 ring:40 vmid:6 pasid:32769, for process Xorg pid 2131 thread Xorg:cs0 pid 2488)
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: in page starting at address 0x0000800103003000 from client 0x1b (UTCL2)
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: GCVM_L2_PROTECTION_FAULT_STATUS:0x00640051
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: \x09 Faulty UTCL2 client ID: CB/DB (0x0)
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: \x09 MORE_FAULTS: 0x1
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: \x09 WALKER_ERROR: 0x0
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: \x09 PERMISSION_FAULTS: 0x5
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: \x09 MAPPING_ERROR: 0x0
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: \x09 RW: 0x1
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: [gfxhub] page fault (src_id:0 ring:40 vmid:6 pasid:32769, for process Xorg pid 2131 thread Xorg:cs0 pid 2488)
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: in page starting at address 0x0000800103033000 from client 0x1b (UTCL2)
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: GCVM_L2_PROTECTION_FAULT_STATUS:0x00640051
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: \x09 Faulty UTCL2 client ID: CB/DB (0x0)
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: \x09 MORE_FAULTS: 0x1
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: \x09 WALKER_ERROR: 0x0
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: \x09 PERMISSION_FAULTS: 0x5
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: \x09 MAPPING_ERROR: 0x0
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: \x09 RW: 0x1
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: [gfxhub] page fault (src_id:0 ring:40 vmid:6 pasid:32769, for process Xorg pid 2131 thread Xorg:cs0 pid 2488)
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: in page starting at address 0x0000800103034000 from client 0x1b (UTCL2)
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: GCVM_L2_PROTECTION_FAULT_STATUS:0x00640051
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: \x09 Faulty UTCL2 client ID: CB/DB (0x0)
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: \x09 MORE_FAULTS: 0x1
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: \x09 WALKER_ERROR: 0x0
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: \x09 PERMISSION_FAULTS: 0x5
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: \x09 MAPPING_ERROR: 0x0
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: \x09 RW: 0x1
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: [gfxhub] page fault (src_id:0 ring:40 vmid:6 pasid:32769, for process Xorg pid 2131 thread Xorg:cs0 pid 2488)
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: in page starting at address 0x0000800103031000 from client 0x1b (UTCL2)
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: GCVM_L2_PROTECTION_FAULT_STATUS:0x00640051
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: \x09 Faulty UTCL2 client ID: CB/DB (0x0)
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: \x09 MORE_FAULTS: 0x1
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: \x09 WALKER_ERROR: 0x0
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: \x09 PERMISSION_FAULTS: 0x5
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: \x09 MAPPING_ERROR: 0x0
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: \x09 RW: 0x1
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: [gfxhub] page fault (src_id:0 ring:40 vmid:6 pasid:32769, for process Xorg pid 2131 thread Xorg:cs0 pid 2488)
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: in page starting at address 0x0000800103032000 from client 0x1b (UTCL2)
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: GCVM_L2_PROTECTION_FAULT_STATUS:0x00000000
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: \x09 Faulty UTCL2 client ID: CB/DB (0x0)
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: \x09 MORE_FAULTS: 0x0
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: \x09 WALKER_ERROR: 0x0
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: \x09 PERMISSION_FAULTS: 0x0
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: \x09 MAPPING_ERROR: 0x0
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: \x09 RW: 0x0
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: [gfxhub] page fault (src_id:0 ring:40 vmid:6 pasid:32769, for process Xorg pid 2131 thread Xorg:cs0 pid 2488)
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: in page starting at address 0x0000800103065000 from client 0x1b (UTCL2)
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: GCVM_L2_PROTECTION_FAULT_STATUS:0x00640051
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: \x09 Faulty UTCL2 client ID: CB/DB (0x0)
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: \x09 MORE_FAULTS: 0x1
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: \x09 WALKER_ERROR: 0x0
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: \x09 PERMISSION_FAULTS: 0x5
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: \x09 MAPPING_ERROR: 0x0
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: \x09 RW: 0x1
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: [gfxhub] page fault (src_id:0 ring:40 vmid:6 pasid:32769, for process Xorg pid 2131 thread Xorg:cs0 pid 2488)
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: in page starting at address 0x0000800103001000 from client 0x1b (UTCL2)
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: GCVM_L2_PROTECTION_FAULT_STATUS:0x00640051
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: \x09 Faulty UTCL2 client ID: CB/DB (0x0)
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: \x09 MORE_FAULTS: 0x1
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: \x09 WALKER_ERROR: 0x0
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: \x09 PERMISSION_FAULTS: 0x5
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: \x09 MAPPING_ERROR: 0x0
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: \x09 RW: 0x1
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: [gfxhub] page fault (src_id:0 ring:40 vmid:6 pasid:32769, for process Xorg pid 2131 thread Xorg:cs0 pid 2488)
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: in page starting at address 0x0000800103000000 from client 0x1b (UTCL2)
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: GCVM_L2_PROTECTION_FAULT_STATUS:0x00640051
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: \x09 Faulty UTCL2 client ID: CB/DB (0x0)
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: \x09 MORE_FAULTS: 0x1
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: \x09 WALKER_ERROR: 0x0
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: \x09 PERMISSION_FAULTS: 0x5
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: \x09 MAPPING_ERROR: 0x0
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: \x09 RW: 0x1
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: [gfxhub] page fault (src_id:0 ring:40 vmid:6 pasid:32769, for process Xorg pid 2131 thread Xorg:cs0 pid 2488)
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: in page starting at address 0x0000800103062000 from client 0x1b (UTCL2)
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: GCVM_L2_PROTECTION_FAULT_STATUS:0x00640051
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: \x09 Faulty UTCL2 client ID: CB/DB (0x0)
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: \x09 MORE_FAULTS: 0x1
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: \x09 WALKER_ERROR: 0x0
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: \x09 PERMISSION_FAULTS: 0x5
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: \x09 MAPPING_ERROR: 0x0
Jun 8 07:09:41 lenny kernel: amdgpu 0000:35:00.0: amdgpu: \x09 RW: 0x1
Jun 8 07:09:53 lenny kernel: gmc_v10_0_process_interrupt: 1930 callbacks suppressed
Jun 8 07:09:53 lenny kernel: amdgpu 0000:35:00.0: amdgpu: [gfxhub] page fault (src_id:0 ring:40 vmid:6 pasid:32769, for process Xorg pid 2131 thread Xorg:cs0 pid 2488)
Jun 8 07:09:53 lenny kernel: amdgpu 0000:35:00.0: amdgpu: in page starting at address 0x0000800103009000 from client 0x1b (UTCL2)
Jun 8 07:09:53 lenny kernel: amdgpu 0000:35:00.0: amdgpu: GCVM_L2_PROTECTION_FAULT_STATUS:0x00641051
Jun 8 07:09:53 lenny kernel: amdgpu 0000:35:00.0: amdgpu: \x09 Faulty UTCL2 client ID: TCP (0x8)
Jun 8 07:09:53 lenny kernel: amdgpu 0000:35:00.0: amdgpu: \x09 MORE_FAULTS: 0x1
Jun 8 07:09:53 lenny kernel: amdgpu 0000:35:00.0: amdgpu: \x09 WALKER_ERROR: 0x0
Jun 8 07:09:53 lenny kernel: amdgpu 0000:35:00.0: amdgpu: \x09 PERMISSION_FAULTS: 0x5
Jun 8 07:09:53 lenny kernel: amdgpu 0000:35:00.0: amdgpu: \x09 MAPPING_ERROR: 0x0
Jun 8 07:09:53 lenny kernel: amdgpu 0000:35:00.0: amdgpu: \x09 RW: 0x1
Jun 8 07:09:53 lenny kernel: amdgpu 0000:35:00.0: amdgpu: [gfxhub] page fault (src_id:0 ring:40 vmid:6 pasid:32769, for process Xorg pid 2131 thread Xorg:cs0 pid 2488)
Jun 8 07:09:53 lenny kernel: amdgpu 0000:35:00.0: amdgpu: in page starting at address 0x0000800103008000 from client 0x1b (UTCL2)
Jun 8 07:09:53 lenny kernel: amdgpu 0000:35:00.0: amdgpu: GCVM_L2_PROTECTION_FAULT_STATUS:0x00000000
Jun 8 07:09:53 lenny kernel: amdgpu 0000:35:00.0: amdgpu: \x09 Faulty UTCL2 client ID: CB/DB (0x0)
Jun 8 07:09:53 lenny kernel: amdgpu 0000:35:00.0: amdgpu: \x09 MORE_FAULTS: 0x0
Jun 8 07:09:53 lenny kernel: amdgpu 0000:35:00.0: amdgpu: \x09 WALKER_ERROR: 0x0
Jun 8 07:09:53 lenny kernel: amdgpu 0000:35:00.0: amdgpu: \x09 PERMISSION_FAULTS: 0x0
Jun 8 07:09:53 lenny kernel: amdgpu 0000:35:00.0: amdgpu: \x09 MAPPING_ERROR: 0x0
Jun 8 07:09:53 lenny kernel: amdgpu 0000:35:00.0: amdgpu: \x09 RW: 0x0
Jun 8 07:09:53 lenny kernel: amdgpu 0000:35:00.0: amdgpu: [gfxhub] page fault (src_id:0 ring:40 vmid:6 pasid:32769, for process Xorg pid 2131 thread Xorg:cs0 pid 2488)
Jun 8 07:09:53 lenny kernel: amdgpu 0000:35:00.0: amdgpu: in page starting at address 0x0000800103009000 from client 0x1b (UTCL2)
Jun 8 07:09:53 lenny kernel: amdgpu 0000:35:00.0: amdgpu: GCVM_L2_PROTECTION_FAULT_STATUS:0x00000000
Jun 8 07:09:53 lenny kernel: amdgpu 0000:35:00.0: amdgpu: \x09 Faulty UTCL2 client ID: CB/DB (0x0)
Jun 8 07:09:53 lenny kernel: amdgpu 0000:35:00.0: amdgpu: \x09 MORE_FAULTS: 0x0
Jun 8 07:09:53 lenny kernel: amdgpu 0000:35:00.0: amdgpu: \x09 WALKER_ERROR: 0x0
Jun 8 07:09:53 lenny kernel: amdgpu 0000:35:00.0: amdgpu: \x09 PERMISSION_FAULTS: 0x0
Jun 8 07:09:53 lenny kernel: amdgpu 0000:35:00.0: amdgpu: \x09 MAPPING_ERROR: 0x0
Jun 8 07:09:53 lenny kernel: amdgpu 0000:35:00.0: amdgpu: \x09 RW: 0x0
Jun 8 07:09:53 lenny kernel: amdgpu 0000:35:00.0: amdgpu: [gfxhub] page fault (src_id:0 ring:40 vmid:6 pasid:32769, for process Xorg pid 2131 thread Xorg:cs0 pid 2488)
Jun 8 07:09:53 lenny kernel: amdgpu 0000:35:00.0: amdgpu: in page starting at address 0x0000800103008000 from client 0x1b (UTCL2)
Jun 8 07:09:53 lenny kernel: amdgpu 0000:35:00.0: amdgpu: GCVM_L2_PROTECTION_FAULT_STATUS:0x00000000
Jun 8 07:09:53 lenny kernel: amdgpu 0000:35:00.0: amdgpu: \x09 Faulty UTCL2 client ID: CB/DB (0x0)
Jun 8 07:09:53 lenny kernel: amdgpu 0000:35:00.0: amdgpu: \x09 MORE_FAULTS: 0x0
Jun 8 07:09:53 lenny kernel: amdgpu 0000:35:00.0: amdgpu: \x09 WALKER_ERROR: 0x0
Jun 8 07:09:53 lenny kernel: amdgpu 0000:35:00.0: amdgpu: \x09 PERMISSION_FAULTS: 0x0
Jun 8 07:09:53 lenny kernel: amdgpu 0000:35:00.0: amdgpu: \x09 MAPPING_ERROR: 0x0
Jun 8 07:09:53 lenny kernel: amdgpu 0000:35:00.0: amdgpu: \x09 RW: 0x0
Jun 8 07:09:53 lenny kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx_0.0.0 timeout, but soft recovered
Jun 8 07:10:03 lenny kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx_0.0.0 timeout, signaled seq=23929877, emitted seq=23929880
Jun 8 07:10:03 lenny kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process information: process Xorg pid 2131 thread Xorg:cs0 pid 2488
Jun 8 07:10:03 lenny kernel: amdgpu 0000:35:00.0: amdgpu: GPU reset begin!
Jun 8 07:10:03 lenny kernel: amdgpu 0000:35:00.0: amdgpu: MODE2 reset
Jun 8 07:10:03 lenny kernel: amdgpu 0000:35:00.0: amdgpu: GPU reset succeeded, trying to resume
Jun 8 07:10:03 lenny kernel: [drm] PCIE GART of 1024M enabled (table at 0x000000F41FC00000).
Jun 8 07:10:03 lenny kernel: [drm] PSP is resuming...
Jun 8 07:10:03 lenny kernel: [drm] reserve 0xa00000 from 0xf41e000000 for PSP TMR
Jun 8 07:10:04 lenny kernel: amdgpu 0000:35:00.0: amdgpu: RAS: optional ras ta ucode is not available
Jun 8 07:10:04 lenny kernel: amdgpu 0000:35:00.0: amdgpu: RAP: optional rap ta ucode is not available
Jun 8 07:10:04 lenny kernel: amdgpu 0000:35:00.0: amdgpu: SECUREDISPLAY: securedisplay ta ucode is not available
Jun 8 07:10:04 lenny kernel: amdgpu 0000:35:00.0: amdgpu: SMU is resuming...
Jun 8 07:10:04 lenny kernel: amdgpu 0000:35:00.0: amdgpu: SMU is resumed successfully!
Jun 8 07:10:04 lenny kernel: [drm] DMUB hardware initialized: version=0x0400002E
Jun 8 07:10:04 lenny kernel: [drm] kiq ring mec 2 pipe 1 q 0
Jun 8 07:10:04 lenny kernel: [drm] VCN decode and encode initialized successfully(under DPG Mode).
Jun 8 07:10:04 lenny kernel: [drm] JPEG decode initialized successfully.
Jun 8 07:10:04 lenny kernel: amdgpu 0000:35:00.0: amdgpu: ring gfx_0.0.0 uses VM inv eng 0 on hub 0
Jun 8 07:10:04 lenny kernel: amdgpu 0000:35:00.0: amdgpu: ring comp_1.0.0 uses VM inv eng 1 on hub 0
Jun 8 07:10:04 lenny kernel: amdgpu 0000:35:00.0: amdgpu: ring comp_1.1.0 uses VM inv eng 4 on hub 0
Jun 8 07:10:04 lenny kernel: amdgpu 0000:35:00.0: amdgpu: ring comp_1.2.0 uses VM inv eng 5 on hub 0
Jun 8 07:10:04 lenny kernel: amdgpu 0000:35:00.0: amdgpu: ring comp_1.3.0 uses VM inv eng 6 on hub 0
Jun 8 07:10:04 lenny kernel: amdgpu 0000:35:00.0: amdgpu: ring comp_1.0.1 uses VM inv eng 7 on hub 0
Jun 8 07:10:04 lenny kernel: amdgpu 0000:35:00.0: amdgpu: ring comp_1.1.1 uses VM inv eng 8 on hub 0
Jun 8 07:10:04 lenny kernel: amdgpu 0000:35:00.0: amdgpu: ring comp_1.2.1 uses VM inv eng 9 on hub 0
Jun 8 07:10:04 lenny kernel: amdgpu 0000:35:00.0: amdgpu: ring comp_1.3.1 uses VM inv eng 10 on hub 0
Jun 8 07:10:04 lenny kernel: amdgpu 0000:35:00.0: amdgpu: ring kiq_2.1.0 uses VM inv eng 11 on hub 0
Jun 8 07:10:04 lenny kernel: amdgpu 0000:35:00.0: amdgpu: ring sdma0 uses VM inv eng 12 on hub 0
Jun 8 07:10:04 lenny kernel: amdgpu 0000:35:00.0: amdgpu: ring vcn_dec_0 uses VM inv eng 0 on hub 1
Jun 8 07:10:04 lenny kernel: amdgpu 0000:35:00.0: amdgpu: ring vcn_enc_0.0 uses VM inv eng 1 on hub 1
Jun 8 07:10:04 lenny kernel: amdgpu 0000:35:00.0: amdgpu: ring vcn_enc_0.1 uses VM inv eng 4 on hub 1
Jun 8 07:10:04 lenny kernel: amdgpu 0000:35:00.0: amdgpu: ring jpeg_dec uses VM inv eng 5 on hub 1
Jun 8 07:10:04 lenny kernel: amdgpu 0000:35:00.0: amdgpu: recover vram bo from shadow start
Jun 8 07:10:04 lenny kernel: amdgpu 0000:35:00.0: amdgpu: recover vram bo from shadow done
Jun 8 07:10:04 lenny kernel: amdgpu 0000:35:00.0: amdgpu: GPU reset(4) succeeded!
Jun 8 07:10:04 lenny kernel: [drm] Skip scheduling IBs!
Jun 8 07:10:04 lenny kernel: [drm] Skip scheduling IBs!
Jun 8 07:10:04 lenny kernel: [drm] Skip scheduling IBs!
Jun 8 07:10:04 lenny kernel: [drm] Skip scheduling IBs!
Jun 8 07:10:04 lenny kernel: [drm] Skip scheduling IBs!
Jun 8 07:10:04 lenny kernel: [drm] Skip scheduling IBs!
Jun 8 07:10:04 lenny kernel: [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
Let me know if more information is needed.