[drm:amdgpu_device_ip_resume_phase2 [amdgpu]] *ERROR* resume of IP block <gfx_v7_0> failed -110
Brief summary of the problem:
The GPU fails to reset after a hang (which obviously shouldn't happen in the first place, but that's a different issue). First the screen freezes and after 10-20 seconds the monitors turn off because the signal is lost. After a hard reset I found this in the journal:
Jun 13 07:13:41 kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx timeout, signaled seq=7776150, emitted seq=7776152
Jun 13 07:13:41 kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process information: process cinnamon pid 1188 thread cinnamon:cs0 pid 1193
Jun 13 07:13:41 kernel: amdgpu 0000:01:00.0: amdgpu: GPU reset begin!
Jun 13 07:13:45 kernel: amdgpu 0000:01:00.0: amdgpu: failed to suspend display audio
Jun 13 07:13:45 kernel: amdgpu: VI should always have 2 performance levels
Jun 13 07:13:46 kernel: amdgpu 0000:01:00.0: amdgpu: BACO reset
Jun 13 07:13:46 kernel: amdgpu 0000:01:00.0: amdgpu: GPU reset succeeded, trying to resume
Jun 13 07:13:46 kernel: [drm] PCIE gen 3 link speeds already enabled
Jun 13 07:13:46 kernel: [drm] PCIE GART of 1024M enabled (table at 0x000000F400000000).
Jun 13 07:13:46 kernel: [drm] VRAM is lost due to GPU reset!
Jun 13 07:13:46 kernel: amdgpu 0000:01:00.0: [drm:amdgpu_ring_test_helper [amdgpu]] *ERROR* ring gfx test failed (-110)
Jun 13 07:13:46 kernel: [drm:amdgpu_device_ip_resume_phase2 [amdgpu]] *ERROR* resume of IP block <gfx_v7_0> failed -110
Jun 13 07:13:46 kernel: amdgpu 0000:01:00.0: amdgpu: GPU reset(2) failed
Jun 13 07:13:46 kernel: SW scheduler is used
Jun 13 07:13:46 kernel: amdgpu 0000:01:00.0: amdgpu: GPU reset end with ret = -110
Jun 13 07:13:56 kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx timeout, but soft recovered
Jun 13 07:14:06 kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx timeout, but soft recovered
Jun 13 07:14:24 systemd[1]: man-db.service: Succeeded.
Jun 13 07:14:24 systemd[1]: Finished Daily man-db regeneration.
Jun 13 07:14:49 systemd-logind[475]: Power key pressed.
Hardware description:
- CPU: Intel Core i5-3450
- GPU: PowerColor PCS+ Radeon R9 390
- System Memory: 16GiB
- Display(s): HP x27i (1440p) + Asus VS248H (1080p)
- Type of Diplay Connection: HDMI (HP) + DVI (Asus)
System information:
- Distro name and Version: Manjaro
- Kernel version: 5.12.8-1-MANJARO
- Custom kernel: No
- AMD package version: No package
- Mesa version: 21.1.2
How to reproduce the issue:
Hard to say. Sometimes the GPU will hang a few minutes (2-15) after resume or boot. After hard resetting the machine I'll find either #1326 (closed) or this problem in the logs.