amdgpu kernel page fault: [drm:amdgpu_vm_handle_fault [amdgpu]] *ERROR* Can't handle page fault (-12)
Brief summary of the problem:
Reproducible hard crash on Arch Linux (frozen, flowery colorful squares) upon loading a certain mod on Tabletop Simulator.
Hardware description:
- CPU: i7 6700k
- GPU: Vega 56
- System Memory: 32G
- Display(s): 1
- Type of Diplay Connection: DP
System infomration:
- Distro name and Version: Arch
- Kernel version: Linux 5.7.10-zen1-1-zen
- AMD package version: Mesa/vulkan-radeon 20.1.4-1
How to reproduce the issue:
Launch Tabletop Simulator with Proton, try to load https://steamcommunity.com/sharedfiles/filedetails/?id=918727568.
Attached files:
dmesg:
[drm:amdgpu_vm_handle_fault [amdgpu]] *ERROR* Can't handle page fault (-12)
amdgpu 0000:03:00.0: [gfxhub0] retry page fault (src_id:0 ring:0 vmid:6 pasid:32782, for process Tabletop Si>Jul 26 15:49:36 machina kernel: amdgpu 0000:03:00.0: in page starting at address 0x0000000700070000 from client 27
amdgpu 0000:03:00.0: VM_L2_PROTECTION_FAULT_STATUS:0x00600C31
amdgpu 0000:03:00.0: MORE_FAULTS: 0x1
amdgpu 0000:03:00.0: WALKER_ERROR: 0x0
amdgpu 0000:03:00.0: PERMISSION_FAULTS: 0x3
amdgpu 0000:03:00.0: MAPPING_ERROR: 0x0
amdgpu 0000:03:00.0: RW: 0x0
[drm:amdgpu_vm_handle_fault [amdgpu]] *ERROR* Can't handle page fault (-12)
amdgpu 0000:03:00.0: [gfxhub0] retry page fault (src_id:0 ring:0 vmid:6 pasid:32782, for process Tabletop Si>Jul 26 15:49:36 machina kernel: amdgpu 0000:03:00.0: in page starting at address 0x0000000700071000 from client 27
amdgpu 0000:03:00.0: VM_L2_PROTECTION_FAULT_STATUS:0x00000000
amdgpu 0000:03:00.0: MORE_FAULTS: 0x0
amdgpu 0000:03:00.0: WALKER_ERROR: 0x0
amdgpu 0000:03:00.0: PERMISSION_FAULTS: 0x0
amdgpu 0000:03:00.0: MAPPING_ERROR: 0x0
amdgpu 0000:03:00.0: RW: 0x0
[drm:amdgpu_vm_handle_fault [amdgpu]] *ERROR* Can't handle page fault (-12)
amdgpu 0000:03:00.0: [gfxhub0] retry page fault (src_id:0 ring:0 vmid:6 pasid:32782, for process Tabletop Si>Jul 26 15:49:36 machina kernel: amdgpu 0000:03:00.0: in page starting at address 0x0000000700072000 from client 27
amdgpu 0000:03:00.0: VM_L2_PROTECTION_FAULT_STATUS:0x00000000
amdgpu 0000:03:00.0: MORE_FAULTS: 0x0
amdgpu 0000:03:00.0: WALKER_ERROR: 0x0
amdgpu 0000:03:00.0: PERMISSION_FAULTS: 0x0
amdgpu 0000:03:00.0: MAPPING_ERROR: 0x0
amdgpu 0000:03:00.0: RW: 0x0
[drm:amdgpu_vm_handle_fault [amdgpu]] *ERROR* Can't handle page fault (-12)
amdgpu 0000:03:00.0: [gfxhub0] retry page fault (src_id:0 ring:0 vmid:6 pasid:32782, for process Tabletop Si>Jul 26 15:49:36 machina kernel: amdgpu 0000:03:00.0: in page starting at address 0x0000000700073000 from client 27
amdgpu 0000:03:00.0: VM_L2_PROTECTION_FAULT_STATUS:0x00000000
amdgpu 0000:03:00.0: MORE_FAULTS: 0x0
amdgpu 0000:03:00.0: WALKER_ERROR: 0x0
amdgpu 0000:03:00.0: PERMISSION_FAULTS: 0x0
amdgpu 0000:03:00.0: MAPPING_ERROR: 0x0
amdgpu 0000:03:00.0: RW: 0x0
[drm:amdgpu_vm_handle_fault [amdgpu]] *ERROR* Can't handle page fault (-12)
amdgpu 0000:03:00.0: [gfxhub0] retry page fault (src_id:0 ring:0 vmid:6 pasid:32782, for process Tabletop Si>Jul 26 15:49:36 machina kernel: amdgpu 0000:03:00.0: in page starting at address 0x0000000700074000 from client 27
amdgpu 0000:03:00.0: VM_L2_PROTECTION_FAULT_STATUS:0x00000000
amdgpu 0000:03:00.0: MORE_FAULTS: 0x0
amdgpu 0000:03:00.0: WALKER_ERROR: 0x0
amdgpu 0000:03:00.0: PERMISSION_FAULTS: 0x0
amdgpu 0000:03:00.0: MAPPING_ERROR: 0x0
amdgpu 0000:03:00.0: RW: 0x0
[drm:amdgpu_vm_handle_fault [amdgpu]] *ERROR* Can't handle page fault (-12)
amdgpu 0000:03:00.0: [gfxhub0] retry page fault (src_id:0 ring:0 vmid:6 pasid:32782, for process Tabletop Si>Jul 26 15:49:36 machina kernel: amdgpu 0000:03:00.0: in page starting at address 0x0000000700075000 from client 27
amdgpu 0000:03:00.0: VM_L2_PROTECTION_FAULT_STATUS:0x00000000
amdgpu 0000:03:00.0: MORE_FAULTS: 0x0
amdgpu 0000:03:00.0: WALKER_ERROR: 0x0
amdgpu 0000:03:00.0: PERMISSION_FAULTS: 0x0
amdgpu 0000:03:00.0: MAPPING_ERROR: 0x0
amdgpu 0000:03:00.0: RW: 0x0
[drm:amdgpu_vm_handle_fault [amdgpu]] *ERROR* Can't handle page fault (-12)
amdgpu 0000:03:00.0: [gfxhub0] retry page fault (src_id:0 ring:0 vmid:6 pasid:32782, for process Tabletop Si>Jul 26 15:49:36 machina kernel: amdgpu 0000:03:00.0: in page starting at address 0x0000000700076000 from client 27
amdgpu 0000:03:00.0: VM_L2_PROTECTION_FAULT_STATUS:0x00000000
amdgpu 0000:03:00.0: MORE_FAULTS: 0x0
amdgpu 0000:03:00.0: WALKER_ERROR: 0x0
amdgpu 0000:03:00.0: PERMISSION_FAULTS: 0x0
amdgpu 0000:03:00.0: MAPPING_ERROR: 0x0
amdgpu 0000:03:00.0: RW: 0x0
[drm:amdgpu_vm_handle_fault [amdgpu]] *ERROR* Can't handle page fault (-12)
amdgpu 0000:03:00.0: [gfxhub0] retry page fault (src_id:0 ring:0 vmid:6 pasid:32782, for process Tabletop Si>Jul 26 15:49:36 machina kernel: amdgpu 0000:03:00.0: in page starting at address 0x0000000700075000 from client 27
amdgpu 0000:03:00.0: VM_L2_PROTECTION_FAULT_STATUS:0x00000000
amdgpu 0000:03:00.0: MORE_FAULTS: 0x0
amdgpu 0000:03:00.0: WALKER_ERROR: 0x0
amdgpu 0000:03:00.0: PERMISSION_FAULTS: 0x0
amdgpu 0000:03:00.0: MAPPING_ERROR: 0x0
amdgpu 0000:03:00.0: RW: 0x0
[drm:amdgpu_vm_handle_fault [amdgpu]] *ERROR* Can't handle page fault (-12)
amdgpu 0000:03:00.0: [gfxhub0] retry page fault (src_id:0 ring:0 vmid:6 pasid:32782, for process Tabletop Si>Jul 26 15:49:36 machina kernel: amdgpu 0000:03:00.0: in page starting at address 0x0000000700076000 from client 27
amdgpu 0000:03:00.0: VM_L2_PROTECTION_FAULT_STATUS:0x00000000
amdgpu 0000:03:00.0: MORE_FAULTS: 0x0
amdgpu 0000:03:00.0: WALKER_ERROR: 0x0
amdgpu 0000:03:00.0: PERMISSION_FAULTS: 0x0
amdgpu 0000:03:00.0: MAPPING_ERROR: 0x0
amdgpu 0000:03:00.0: RW: 0x0
[drm:amdgpu_vm_handle_fault [amdgpu]] *ERROR* Can't handle page fault (-12)
amdgpu 0000:03:00.0: [gfxhub0] retry page fault (src_id:0 ring:0 vmid:6 pasid:32782, for process Tabletop Si>Jul 26 15:49:36 machina kernel: amdgpu 0000:03:00.0: in page starting at address 0x0000000700077000 from client 27
amdgpu 0000:03:00.0: VM_L2_PROTECTION_FAULT_STATUS:0x00000000
amdgpu 0000:03:00.0: MORE_FAULTS: 0x0
amdgpu 0000:03:00.0: WALKER_ERROR: 0x0
amdgpu 0000:03:00.0: PERMISSION_FAULTS: 0x0
amdgpu 0000:03:00.0: MAPPING_ERROR: 0x0
amdgpu 0000:03:00.0: RW: 0x0
[drm:amdgpu_vm_handle_fault [amdgpu]] *ERROR* Can't handle page fault (-12)
amdgpu 0000:03:00.0: [gfxhub0] retry page fault (src_id:0 ring:0 vmid:6 pasid:32782, for process Tabletop Si>Jul 26 15:49:36 machina kernel: amdgpu 0000:03:00.0: in page starting at address 0x0000000700070000 from client 27
amdgpu 0000:03:00.0: VM_L2_PROTECTION_FAULT_STATUS:0x006012B1
amdgpu 0000:03:00.0: MORE_FAULTS: 0x1
amdgpu 0000:03:00.0: WALKER_ERROR: 0x0
amdgpu 0000:03:00.0: PERMISSION_FAULTS: 0xb
amdgpu 0000:03:00.0: MAPPING_ERROR: 0x0
amdgpu 0000:03:00.0: RW: 0x0
[drm:amdgpu_vm_handle_fault [amdgpu]] *ERROR* Can't handle page fault (-12)
amdgpu 0000:03:00.0: [gfxhub0] retry page fault (src_id:0 ring:0 vmid:6 pasid:32782, for process Tabletop Si>Jul 26 15:49:36 machina kernel: amdgpu 0000:03:00.0: in page starting at address 0x0000000700077000 from client 27
amdgpu 0000:03:00.0: VM_L2_PROTECTION_FAULT_STATUS:0x00000000
amdgpu 0000:03:00.0: MORE_FAULTS: 0x0
amdgpu 0000:03:00.0: WALKER_ERROR: 0x0
amdgpu 0000:03:00.0: PERMISSION_FAULTS: 0x0
amdgpu 0000:03:00.0: MAPPING_ERROR: 0x0
amdgpu 0000:03:00.0: RW: 0x0
[drm:amdgpu_vm_handle_fault [amdgpu]] *ERROR* Can't handle page fault (-12)
[drm:amdgpu_vm_handle_fault [amdgpu]] *ERROR* Can't handle page fault (-12)
[drm:amdgpu_vm_handle_fault [amdgpu]] *ERROR* Can't handle page fault (-12)
[drm:amdgpu_vm_handle_fault [amdgpu]] *ERROR* Can't handle page fault (-12)
[drm:amdgpu_vm_handle_fault [amdgpu]] *ERROR* Can't handle page fault (-12)
[drm:amdgpu_vm_handle_fault [amdgpu]] *ERROR* Can't handle page fault (-12)
[drm:amdgpu_vm_handle_fault [amdgpu]] *ERROR* Can't handle page fault (-12)
[drm:amdgpu_vm_handle_fault [amdgpu]] *ERROR* Can't handle page fault (-12)
With the last few lines repeated a couple thousand times.