Due to an influx of spam, we have had to impose restrictions on new accounts. Please see this wiki page for instructions on how to get full permissions. Sorry for the inconvenience.
Admin message
The migration is almost done, at least the rest should happen in the background. There are still a few technical difference between the old cluster and the new ones, and they are summarized in this issue. Please pay attention to the TL:DR at the end of the comment.
I've been having GPU hangs and picture corruption issues recently.
When the GPU hang happens, Sway will freeze and I am no longer able to interact or update my desktop. I have to switch to another TTY to pkill sway and then run until the next hang.
The issues don't seem to happen consistently with opening any particular program or after a certain period of time.
Similar setup with the same issue.
Happens at random times, but I notice it is mostly while using Google Meet over the day to pair with people.
Here the systemd output:
Feb 03 18:05:01 kernel: i915 0000:00:02.0: [drm] Resetting rcs0 for invalid CSB eventFeb 03 18:05:01 kernel: i915 0000:00:02.0: [drm] sway[1484] context reset due to GPU hangFeb 03 18:05:01 kernel: i915 0000:00:02.0: [drm] GPU HANG: ecode 11:1:85d7fffb, in sway [1484]Feb 03 18:05:01 kernel: GPU hangs can indicate a bug anywhere in the entire gfx stack, including userspace.Feb 03 18:05:01 kernel: Please file a _new_ bug report at https://gitlab.freedesktop.org/drm/intel/issues/new.Feb 03 18:05:01 kernel: Please see https://gitlab.freedesktop.org/drm/intel/-/wikis/How-to-file-i915-bugs for details.Feb 03 18:05:01 kernel: drm/i915 developers can then reassign to the right component if it's not a kernel issue.Feb 03 18:05:01 kernel: The GPU crash dump is required to analyze GPU hangs, so please always attach it.Feb 03 18:05:01 kernel: GPU crash dump saved to /sys/class/drm/card0/errorFeb 03 18:05:15 kernel: i915 0000:00:02.0: [drm] Resetting rcs0 for preemption time outFeb 03 18:05:15 kernel: i915 0000:00:02.0: [drm] sway[1484] context reset due to GPU hangFeb 03 18:05:15 kernel: i915 0000:00:02.0: [drm] GPU HANG: ecode 11:1:85dfbfff, in sway [1484]
This is a different architecture, Icelake (gen11). The other crashes in this bug report are specific to Haswell gt2 (gen7). That indicates you have a different problem that requires a fresh bug report and include the error state.
@ickle Turning off mitigations seems to have stopped my DE crashing, thanks for that.
Am I right in thinking that this would just be a temporary solution?
Is there any information I can give to help getting this fixed for when mitigations are on?
Same problem on thinkpad T450
Intel i5 5300U with iGPU HD-5500 with kernel 5.10.x and long term kernel 5.4.99
Fedora 33 x86_64
Gnome 3.38.3 wayland and video_2021-02-26_18-25-36
I have the same effects on debian/buster desktop machine on a Asus Z87M-PLUS with Intel(R) Core(TM) i5-4440 CPU
uname -m: x86_64 uname -r: 5.10.0-3-amd64. Unfortunately, I've added my logs to another thread:
Issue 3123