Amd GPU freeze
Brief summary of the problem:
Sometimes I experience a system freeze.
The journal of the previous log show me:
Nov 13 18:13:45 svd026p15s kernel: amdgpu 0000:04:00.0: amdgpu: MAPPING_ERROR: 0x0
Nov 13 18:13:45 svd026p15s kernel: amdgpu 0000:04:00.0: amdgpu: RW: 0x0
Nov 13 18:13:45 svd026p15s kernel: amdgpu 0000:04:00.0: amdgpu: [gfxhub0] retry page fault (src_id:0 ring:0 vmid:1 pasid:32769, for process
Nov 13 18:13:45 svd026p15s kernel: amdgpu 0000:04:00.0: amdgpu: in page starting at address 0x00008001025bc000 from IH client 0x1b (UTCL2)
Nov 13 18:13:45 svd026p15s kernel: amdgpu 0000:04:00.0: amdgpu: VM_L2_PROTECTION_FAULT_STATUS:0x00101031
Nov 13 18:13:45 svd026p15s kernel: amdgpu 0000:04:00.0: amdgpu: Faulty UTCL2 client ID: TCP (0x8)
Nov 13 18:13:45 svd026p15s kernel: amdgpu 0000:04:00.0: amdgpu: MORE_FAULTS: 0x1
Nov 13 18:13:45 svd026p15s kernel: amdgpu 0000:04:00.0: amdgpu: WALKER_ERROR: 0x0
Nov 13 18:13:45 svd026p15s kernel: amdgpu 0000:04:00.0: amdgpu: PERMISSION_FAULTS: 0x3
Nov 13 18:13:45 svd026p15s kernel: amdgpu 0000:04:00.0: amdgpu: MAPPING_ERROR: 0x0
Nov 13 18:13:45 svd026p15s kernel: amdgpu 0000:04:00.0: amdgpu: RW: 0x0
Nov 13 18:13:45 svd026p15s kernel: amdgpu 0000:04:00.0: amdgpu: [gfxhub0] retry page fault (src_id:0 ring:0 vmid:1 pasid:32769, for process
Nov 13 18:13:45 svd026p15s kernel: amdgpu 0000:04:00.0: amdgpu: in page starting at address 0x00008001025bd000 from IH client 0x1b (UTCL2)
Nov 13 18:13:45 svd026p15s kernel: amdgpu 0000:04:00.0: amdgpu: VM_L2_PROTECTION_FAULT_STATUS:0x00101031
Nov 13 18:13:45 svd026p15s kernel: amdgpu 0000:04:00.0: amdgpu: Faulty UTCL2 client ID: TCP (0x8)
Nov 13 18:13:45 svd026p15s kernel: amdgpu 0000:04:00.0: amdgpu: MORE_FAULTS: 0x1
Nov 13 18:13:45 svd026p15s kernel: amdgpu 0000:04:00.0: amdgpu: WALKER_ERROR: 0x0
Nov 13 18:13:45 svd026p15s kernel: amdgpu 0000:04:00.0: amdgpu: PERMISSION_FAULTS: 0x3
Nov 13 18:13:45 svd026p15s kernel: amdgpu 0000:04:00.0: amdgpu: MAPPING_ERROR: 0x0
Nov 13 18:13:45 svd026p15s kernel: amdgpu 0000:04:00.0: amdgpu: RW: 0x0
Nov 13 18:13:45 svd026p15s kernel: amdgpu 0000:04:00.0: amdgpu: [gfxhub0] retry page fault (src_id:0 ring:0 vmid:1 pasid:32769, for process
Nov 13 18:13:45 svd026p15s kernel: amdgpu 0000:04:00.0: amdgpu: in page starting at address 0x000080010259e000 from IH client 0x1b (UTCL2)
Nov 13 18:13:45 svd026p15s kernel: amdgpu 0000:04:00.0: amdgpu: VM_L2_PROTECTION_FAULT_STATUS:0x00101031
Nov 13 18:13:45 svd026p15s kernel: amdgpu 0000:04:00.0: amdgpu: Faulty UTCL2 client ID: TCP (0x8)
Nov 13 18:13:45 svd026p15s kernel: amdgpu 0000:04:00.0: amdgpu: MORE_FAULTS: 0x1
Nov 13 18:13:45 svd026p15s kernel: amdgpu 0000:04:00.0: amdgpu: WALKER_ERROR: 0x0
Nov 13 18:13:45 svd026p15s kernel: amdgpu 0000:04:00.0: amdgpu: PERMISSION_FAULTS: 0x3
Nov 13 18:13:45 svd026p15s kernel: amdgpu 0000:04:00.0: amdgpu: MAPPING_ERROR: 0x0
Nov 13 18:13:45 svd026p15s kernel: amdgpu 0000:04:00.0: amdgpu: RW: 0x0
Nov 13 18:13:45 svd026p15s kernel: amdgpu 0000:04:00.0: amdgpu: [gfxhub0] retry page fault (src_id:0 ring:0 vmid:1 pasid:32769, for process
Nov 13 18:13:45 svd026p15s kernel: amdgpu 0000:04:00.0: amdgpu: in page starting at address 0x000080010259f000 from IH client 0x1b (UTCL2)
Nov 13 18:13:45 svd026p15s kernel: amdgpu 0000:04:00.0: amdgpu: VM_L2_PROTECTION_FAULT_STATUS:0x00101031
Nov 13 18:13:45 svd026p15s kernel: amdgpu 0000:04:00.0: amdgpu: Faulty UTCL2 client ID: TCP (0x8)
Nov 13 18:13:45 svd026p15s kernel: amdgpu 0000:04:00.0: amdgpu: MORE_FAULTS: 0x1
Nov 13 18:13:45 svd026p15s kernel: amdgpu 0000:04:00.0: amdgpu: WALKER_ERROR: 0x0
Nov 13 18:13:45 svd026p15s kernel: amdgpu 0000:04:00.0: amdgpu: PERMISSION_FAULTS: 0x3
Nov 13 18:13:45 svd026p15s kernel: amdgpu 0000:04:00.0: amdgpu: MAPPING_ERROR: 0x0
Nov 13 18:13:45 svd026p15s kernel: amdgpu 0000:04:00.0: amdgpu: RW: 0x0
Nov 13 18:13:50 svd026p15s kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx timeout, but soft recovered
Hardware description:
- CPU: AMD Ryzen 7 PRO 2700U
- GPU: Radeon Vega Mobile Gfx
- System Memory: 16Gb
- Display(s): Laptop display + External 24" monitor
- Type of Display Connection: HDMI
System information:
- Distro name and Version: Linux Mint 19.3 (Ubuntu 18.04)
- Kernel version: 5.15.0 (mainline), 5.15.1 (custom), 5.15.2 (custom)
- AMD official driver version: N/A
How to reproduce the issue:
It happened 3 times after intensive PC usage (multimedia, and development homework).