Due to an influx of spam, we have had to impose restrictions on new accounts. Please see this wiki page for instructions on how to get full permissions. Sorry for the inconvenience.
Admin message
Our infrastructure migration is complete. Please remember to update your SSH remote to point to ssh.gitlab.freedesktop.org; SSH to the old hostname will time out. You should not see any problems apart from that. Please let us know if you do have any other issues.
Tried running Unreal Engine 4 demos from https://wiki.unrealengine.com/Linux_Demos
'Mobile Temple Demo' started, worked bad (image was too dark, possible to see almost nothing), but did not crash anything.
Next tried 'Effects Cave Demo': in a few seconds after first start of demo X server became not responding (not even possible to switch with ctrl+alt+f1), it was flickering between grey fill and normal desktop (seems kernel tried to restart GPU), then system became fully unresponsive: ssh did not work, power button did not send system to suspend. Attached kernel messages saved by syslog.
I never had GPU related stability problems with 3.12.x-3.14.x kernels before: several opengl 3d games worked fine, opengl based mpv video output worked fine, glxgears and stellarium worked fine too.
Did unreal engine try to use some non implemented or unstable features? Why kernel allowed to crash itself instead of killing userspace app?
I tried the Effects Cave Demo on a Radeon HD 6950: After it loaded the sound played but the screen was going into standby and the system wasn't even reacting to MagSysRq keys. Would love to give logs but they where wiped with the system reset.
OpenGL renderer string: Gallium 0.4 on AMD CAYMAN
OpenGL core profile version string: 3.3 (Core Profile) Mesa 10.3.0-devel (git-3c77d2a)
I can also reproduce this with the effects demo on this card:
[AMD/ATI] Redwood XT [Radeon HD 5670/5690/5730]
I was able to ssh into the machine and kill the application, which restored the display on the X server.
The following output was seen repeating in the terminal output of the application:
EE r600_shader.c:157 r600_pipe_shader_create - translation from TGSI failed !
EE r600_state_common.c:751 r600_shader_select - Failed to build shader variant (type=1) -1
as well as GPU lockups in the kernel like this:
[3114375.931718] radeon 0000:01:00.0: ring 0 stalled for more than 10086msec
[3114375.931722] radeon 0000:01:00.0: ring 0 stalled for more than 10086msec
[3114375.931728] radeon 0000:01:00.0: GPU lockup (waiting for 0x0000000004e7c894 last fence id 0x0000000004e7c891 on ring 0)
[3114375.931735] radeon 0000:01:00.0: GPU lockup (waiting for 0x0000000004e7c894 last fence id 0x0000000004e7c891 on ring 0)
[3114375.931816] radeon 0000:01:00.0: failed to get a new IB (-35)
[3114375.931822] [drm:radeon_cs_ib_fill] ERROR Failed to get ib !
[3114375.931831] radeon 0000:01:00.0: failed to get a new IB (-35)
[3114375.931837] [drm:radeon_cs_ib_fill] ERROR Failed to get ib !
[3114375.943817] radeon 0000:01:00.0: Saved 1623 dwords of commands on ring 0.
[3114375.943831] radeon 0000:01:00.0: GPU softreset: 0x00000009
[3114375.943833] radeon 0000:01:00.0: GRBM_STATUS = 0xF0001828
[3114375.943834] radeon 0000:01:00.0: GRBM_STATUS_SE0 = 0x80000003
[3114375.943836] radeon 0000:01:00.0: GRBM_STATUS_SE1 = 0x00000007
[3114375.943838] radeon 0000:01:00.0: SRBM_STATUS = 0x20000AC0
[3114375.943840] radeon 0000:01:00.0: SRBM_STATUS2 = 0x00000000
[3114375.943841] radeon 0000:01:00.0: R_008674_CP_STALLED_STAT1 = 0x00000000
[3114375.943843] radeon 0000:01:00.0: R_008678_CP_STALLED_STAT2 = 0x40040000
[3114375.943845] radeon 0000:01:00.0: R_00867C_CP_BUSY_STAT = 0x00048006
[3114375.943847] radeon 0000:01:00.0: R_008680_CP_STAT = 0x80268647
[3114375.943848] radeon 0000:01:00.0: R_00D034_DMA_STATUS_REG = 0x44C83D57
[3114375.944771] radeon 0000:01:00.0: GRBM_SOFT_RESET=0x00007F6B
[3114375.944824] radeon 0000:01:00.0: SRBM_SOFT_RESET=0x00000100
[3114375.945982] radeon 0000:01:00.0: GRBM_STATUS = 0x00003828
[3114375.945984] radeon 0000:01:00.0: GRBM_STATUS_SE0 = 0x00000007
[3114375.945986] radeon 0000:01:00.0: GRBM_STATUS_SE1 = 0x00000007
[3114375.945987] radeon 0000:01:00.0: SRBM_STATUS = 0x200000C0
[3114375.945989] radeon 0000:01:00.0: SRBM_STATUS2 = 0x00000000
[3114375.945991] radeon 0000:01:00.0: R_008674_CP_STALLED_STAT1 = 0x00000000
[3114375.945992] radeon 0000:01:00.0: R_008678_CP_STALLED_STAT2 = 0x00000000
[3114375.945994] radeon 0000:01:00.0: R_00867C_CP_BUSY_STAT = 0x00000000
[3114375.945996] radeon 0000:01:00.0: R_008680_CP_STAT = 0x00000000
[3114375.945998] radeon 0000:01:00.0: R_00D034_DMA_STATUS_REG = 0x44C83D57
[3114375.946012] radeon 0000:01:00.0: GPU reset succeeded, trying to resume
[3114376.017053] [drm] PCIE gen 2 link speeds already enabled
[3114376.019321] [drm] PCIE GART of 1024M enabled (table at 0x000000000025D000).
[3114376.019441] radeon 0000:01:00.0: WB enabled
[3114376.019443] radeon 0000:01:00.0: fence driver on ring 0 use gpu addr 0x0000000040000c00 and cpu addr 0xffff8800b9bafc00
[3114376.019444] radeon 0000:01:00.0: fence driver on ring 3 use gpu addr 0x0000000040000c0c and cpu addr 0xffff8800b9bafc0c
[3114376.020040] radeon 0000:01:00.0: fence driver on ring 5 use gpu addr 0x000000000005c418 and cpu addr 0xffffc9001249c418
[3114376.036607] [drm] ring test on 0 succeeded in 1 usecs
[3114376.036665] [drm] ring test on 3 succeeded in 1 usecs
[3114376.213817] [drm] ring test on 5 succeeded in 1 usecs
[3114376.213821] [drm] UVD initialized successfully.
This was using kernel 3.16.1 and Mesa 10.4.0-devel from today (g45b104e).
However - there is some severe and very annoying stuttering. Which I can
also replicate on games using that engine...
Can someone else confirm?
Yes, it's because of shader recompiles. This is being addressed and will hopefully be fixed before too long. It's not related to this bug report though.
However - there is some severe and very annoying stuttering. Which I can
also replicate on games using that engine...
Can someone else confirm?
Yes, it's because of shader recompiles. This is being addressed and will
hopefully be fixed before too long. It's not related to this bug report
though.
Thanks for the info. By adressed you mean in the engine or in the driver?
Also is there a bug report about this / can I track this somewhere?
@reporter:
Please check again, this bug can probably be closed...
This issue hasn't had any activity since 2019-11-19. The AMD driver stack changes rapidly and contains lots of shared code across products so it's possible that it has already been fixed. Please upgrade to a current stable kernel and userspace stack and try again. If you still experience this issue with the latest driver stack, please capture relevant logging and open a new issue referring back to this one.