GPU crash on VegaM (amdgpu: The CS has been rejected)
Submitted by Rémi Verschelde
Assigned to Default DRI bug account
Link to original bug (#111921)
Description
Created attachment 145680
dmesg output after GPU crash with "amdgpu: The CS has been rejected"
Might be related to bug 111860.
In my case, the GPU crashes or fails to resume when I use the Godot Engine FOSS application: https://github.com/godotengine/godot
The application works fine for a time, but eventually it will freeze and this gets printed to the terminal:
amdgpu: The CS has been rejected, see dmesg for more information (-2).
amdgpu: The CS has been rejected, see dmesg for more information (-19).
(attaching dmesg)
At this point, I have to kill the application, and reboot if I want to use the GPU again.
This seems to happen mainly when alt-tabbing between Godot and the desktop or terminal (both of which run on the Intel HD 630 IGP), so it might be an issue with context switching?
I don't have precise steps to reproduce yet apart from using Godot (debug build from git master branch) and other applications in parallel, to eventually see it crash within 5-10 min.
I think the bug started to happen when I upgraded to kernel 5.2.x (now running 5.3.2, still having the bug). That's what bug 111860 claims too, so I'll attempt running 5.1.20 for a while to see if the bug still happens.
System info:
$ inxi
CPU: Quad Core Intel Core i7-8705G (-MT MCP-) speed/min/max: 1347/800/4100 MHz Kernel: 5.3.2-desktop-1.mga7 x86_64 Up: 2h 44m
Mem: 3451.1/15767.7 MiB (21.9%) Storage: 953.87 GiB (58.3% used) Procs: 241 Shell: bash 4.4.23 inxi: 3.0.33
$ inxi -G
Graphics: Device-1: Intel HD Graphics 630 driver: i915 v: kernel
Device-2: Advanced Micro Devices [AMD/ATI] Polaris 22 XL [Radeon RX Vega M GL] driver: amdgpu v: kernel
Display: x11 server: Mageia X.org 1.20.4 driver: amdgpu,intel FAILED: ati unloaded: fbdev,modesetting,vesa tty: N/A
OpenGL: renderer: Mesa DRI Intel HD Graphics 630 (Kaby Lake GT2) v: 4.5 Mesa 19.1.7
Attachment 145680, "dmesg output after GPU crash with "amdgpu: The CS has been rejected"":
dmesg-amdgpu-cs-rejected.log