Display freezes after "gr: TRAP ch 6"
Submitted by Tony Novak
Assigned to Nouveau Project
Created attachment 128783
dmesg showing error
Intermittently (a few times a day), my display will completely freeze and doesn't recover. Although the kernel doesn't hang and I can ssh in, I can't chvt to a non-graphical VT.
Whenever this occurs, I see a message like this in dmesg:
[12535.260195] nouveau 0000:01:00.0: gr: TRAP ch 6 [007f778000 Xwayland]
[12535.260211] nouveau 0000:01:00.0: gr: GPC0/TPC1/MP trap: global 00000000  warp 3d0001 [STACK_ERROR]
[12539.595312] nouveau 0000:01:00.0: fifo: SCHED_ERROR 0a [CTXSW_TIMEOUT]
[12539.595318] nouveau 0000:01:00.0: fifo: gr engine fault on channel 5, recovering...
Despite the "recovering..." message it never actually recovers and only a reboot solves the problem.
I'm using GNOME on Wayland, and I'm typically running gnome-terminal, Firefox, and/or Chromium. So far I haven't identified any specific action that triggers this failure.
My computer is a mid-2014 Macbook Pro with a GeForce 750M (GK107). I'm running nouveau with "nouveau.nofbaccel=1". I've tried adding "nouveau.config=NvGrUseFW=1" but it complains about not finding /lib/firmware/nvidia/gk107/fecs_inst.bin. Is there an external firmware blob available for this card?
I recently updated my kernel and several relevant packages, and have seen no difference in behavior. I'm running linux 4.9.0, and the latest version of mesa from git (36b5f1d200).
See my attached dmesg output. Is there any debug flag I can enable to shed more light on the situation?
Attachment 128783, "dmesg showing error":