(Recoverable) GPU hangs with GfxBench Manhattan GL tests
Submitted by Eero Tamminen
Assigned to Default DRI bug account
Link to original bug (#108898)
Description
Setup:
- FullHD monitor (through HDMI KVM)
- HadesCanyon KBL i7-8809G ([AMD/ATI] Vega [Radeon RX Vega M] (rev c0))
- Ubuntu 18.04
- drm-tip git kernel v4.20-rc4 (i.e. kernel.org v4.20-rc4 kernel + latest drm code from yesterday)
- Mesa git (c120dbfe) with AMD VEGAM renderer
- X server git version
- Proprietary GfxBench v5, but public GfxBench v4 should have same tests: http://gfxbench.com
Test-cases:
- Manhattan 3.0 offscreen: bin/testfw_app --gfx glfw --gl_api desktop_core --width 1920 --height 1080 --fullscreen 1 --test_id gl_manhattan_off
- Manhattan 3.1 onscreen: bin/testfw_app --gfx glfw --gl_api desktop_core --width 1920 --height 1080 --fullscreen 1 --test_id gl_manhattan31
Expected outcome:
- No GPU timeouts
Actual outcome:
- 1 out of 3 runs gives in dmesg: [ 2817.689624] [drm:drm_sched_job_timedout [gpu_sched]] ERROR ring gfx timeout, but soft recovered
NOTE: These were happening already when we started testing this machine in mid October, with Mesa 18cc65ed & drm-tip kernel v4.19-rc8.
Version: git