amdgpu no-retry page fault under 5.19
Aug 02 15:31:59 w7700 kernel: amdgpu 0000:03:00.0: amdgpu: [mmhub0] no-retry page fault (src_id:0 ring:40 vmid:1 pasid:32773, for process teams pid 3495 thread teams:cs0 pid 3544)
Aug 02 15:31:59 w7700 kernel: amdgpu 0000:03:00.0: amdgpu: in page starting at address 0x0000800102a24000 from IH client 0x12 (VMC)
Aug 02 15:31:59 w7700 kernel: amdgpu 0000:03:00.0: amdgpu: VM_L2_PROTECTION_FAULT_STATUS:0x00140050
Aug 02 15:31:59 w7700 kernel: amdgpu 0000:03:00.0: amdgpu: Faulty UTCL2 client ID: MP1 (0x0)
Aug 02 15:31:59 w7700 kernel: amdgpu 0000:03:00.0: amdgpu: MORE_FAULTS: 0x0
Aug 02 15:31:59 w7700 kernel: amdgpu 0000:03:00.0: amdgpu: WALKER_ERROR: 0x0
Aug 02 15:31:59 w7700 kernel: amdgpu 0000:03:00.0: amdgpu: PERMISSION_FAULTS: 0x5
Aug 02 15:31:59 w7700 kernel: amdgpu 0000:03:00.0: amdgpu: MAPPING_ERROR: 0x0
Aug 02 15:31:59 w7700 kernel: amdgpu 0000:03:00.0: amdgpu: RW: 0x1
Brief summary of the problem:
After upgrading to 5.19 (locally built from the Ubuntu mainline https://git.launchpad.net/~ubuntu-kernel-test/ubuntu/+source/linux/+git/mainline-crack) I experienced two lockups where the display was unresponsive and the machine could not be cleanly shut down, even with SysRq keys. I could change VT but could not login and I had to force power off both times.
Hardware description:
- CPU: AMD Ryzen 7 PRO 4750U with Radeon Graphics
- GPU: 03:00.0 VGA compatible controller [0300]: Advanced Micro Devices, Inc. [AMD/ATI] Renoir [1002:1636] (rev d1)
- System Memory: 32 GB
- Display(s): laptop panel and ViewSonic 24in monitor
- Type of Display Connection: eDP and DP
System information:
- Distro name and Version: Ubuntu 22.04.1 LTS (with a custom kernel)
- Kernel version: 5.19.0
- Custom kernel: Ubuntu mainline-crack
- AMD official driver version: ?
How to reproduce the issue:
- Install kernel 5.19
- Wait. (the first time, the screen was locked and when I came back, nothing worked. The second time, I was on a teams call and everything locked up. Audio continued to work until the end of the call so about half an hour, then I rebooted)
Attached files:
Log files (for system lockups / game freezes / crashes)
- Dmesg log (full log)
- Xorg log
- Any other log