Window system hang due to GPU Fault
Submitted by hjp..@..il.com
Assigned to Default DRI bug account
Link to original bug (#105819)
Description
Created attachment 138449
dmesg output with the GPU fault.
When displaying a "png" with Image magick display the system quite often hangs.
Screen is not updated, X can not be terminated.
In dmesg I got this error:
[ 1338.134608] amdgpu 0000:20:00.0: GPU fault detected: 147 0x03684402
[ 1338.134611] amdgpu 0000:20:00.0: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x0050
306D
[ 1338.134614] amdgpu 0000:20:00.0: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0C04
4002
[ 1338.134618] amdgpu 0000:20:00.0: VM fault (0x02, vmid 6, pasid 32773) at page
5255277, read from 'TC1' (0x54433100) (68)
[ 1348.576412] [drm:amdgpu_job_timedout [amdgpu]] ERROR ring gfx timeout, last
signaled seq=20984, last emitted seq=20986
[ 1348.576423] [drm] IP block:gfx_v8_0 is hung!
[ 1348.576472] [drm] GPU recovery disabled.
I could login using ssh but could not kill X.
I was using the "drm-next-4.17wip" from 29/03/2018 but the problem also happens with 4.15.X
Full dmesg is attached.
Attachment 138449, "dmesg output with the GPU fault.":
dmesg-amdgpu-fault.txt