Any OpenCL application causes "*ERROR* ring gfx timeout" on Vega 64
Submitted by Alexander Mezin
Assigned to Default DRI bug account
Link to original bug (#110637)
Description
Created attachment 144191 kernel log
Open LibreOffice, enable OpenCL in settings, restart it. Result:
[drm:amdgpu_job_timedout [amdgpu]] ERROR ring gfx timeout, signaled seq=698, emitted seq=700 [drm:amdgpu_job_timedout [amdgpu]] ERROR Process information: process soffice.bin pid 2517 thread soffice.bi:cs0 pid 2545 amdgpu 0000:67:00.0: GPU reset begin! amdgpu 0000:67:00.0: GPU BACO reset amdgpu: [powerplay] Failed message: 0x5, input parameter: 0x2000000, error code: 0xffffffff amdgpu 0000:67:00.0: GPU reset succeeded, trying to resume [drm] PCIE GART of 512M enabled (table at 0x000000F400900000). [drm:amdgpu_device_gpu_recover [amdgpu]] ERROR VRAM is lost! [drm] PSP is resuming... [drm] reserve 0x400000 from 0xf400d00000 for PSP TMR SIZE [drm] UVD and UVD ENC initialized successfully. [drm] VCE initialized successfully. [drm] recover vram bo from shadow start [drm] recover vram bo from shadow done [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! amdgpu 0000:67:00.0: GPU reset(2) succeeded! [drm:amdgpu_cs_ioctl [amdgpu]] ERROR Failed to initialize parser -125! [drm:amdgpu_cs_ioctl [amdgpu]] ERROR Failed to initialize parser -125! [drm:amdgpu_cs_ioctl [amdgpu]] ERROR Failed to initialize parser -125! [drm:amdgpu_cs_ioctl [amdgpu]] ERROR Failed to initialize parser -125! [drm:amdgpu_cs_ioctl [amdgpu]] ERROR Failed to initialize parser -125! [drm:amdgpu_cs_ioctl [amdgpu]] ERROR Failed to initialize parser -125! ...
Also the same problem with multiple games, so probably not OpenCL-related, just the easiest way to trigger it.
linux 5.1.arch1-1 (same results with 5.0.13, will also retest with 4.9) linux-firmware 20190502.92e17d0-1 (same results with 20190424.4b6cf2b-1) opencl-mesa 19.0.3-1 libdrm 2.4.98-1 libreoffice-fresh 6.2.3-2
GNOME on X.org with modesetting driver
Sapphire Vega 64 Nitro+, no overclocking
Attachment 144191, "kernel log":
amdgpu-fail2.log