Due to an influx of spam, we have had to impose restrictions on new accounts. Please see this wiki page for instructions on how to get full permissions. Sorry for the inconvenience.
Admin message
The migration is almost done, at least the rest should happen in the background. There are still a few technical difference between the old cluster and the new ones, and they are summarized in this issue. Please pay attention to the TL:DR at the end of the comment.
Project 'drm/intel' was moved to 'drm/i915/kernel'. Please update any links and bookmarks that may still have the old path.
[i915] GPU HANG: ecode 7:1:8edcfc79, in systemd-logind [411]
I'm using Archlinux. After upgrading to linux-5.7.2, the GPU hangs right after Xorg starts. The system becomes unresponsive and visual artifacts are visible (the screen starts blinking randomly).
The issue is not reproducible with linux-lts-5.4.46 (from which I'm currently writing this message). And was also not reproducible with linux-5.6.15, which makes me believe it's something about linux-5.7.x that is not compatible with i915. (??)
Apologies if this has been reported before.
Edited
To upload designs, you'll need to enable LFS and have an admin enable hashed storage. More information
Child items
0
Show closed items
No child items are currently assigned. Use child items to break down this issue into smaller parts.
git bisect start# bad: [cc328351e1e44c4373fabb6f98f0b5d636890225] drm/i915/gt: Wait for the wa batch to be pinnedgit bisect bad cc328351e1e44c4373fabb6f98f0b5d636890225# good: [50148a25f841dc1e2cffe1669d4782dfa75d85d6] drm/i915/tgl: Move and restrict Wa_1408615072git bisect good 50148a25f841dc1e2cffe1669d4782dfa75d85d6# good: [c834d03cca61bf88aa8c2f9524360cce484686ac] drm/i915: Implement display w/a 1140 for glk/cnlgit bisect good c834d03cca61bf88aa8c2f9524360cce484686ac# bad: [26fc4e4ba1605e365482a86ec8f33d91552db271] drm/i915: Always propagate the invocation to i915_schedulegit bisect bad 26fc4e4ba1605e365482a86ec8f33d91552db271# good: [ee2413eeed767d1fb794651e63e45c5e292c8180] drm/i915: Add mechanism to submit a context WA on ring submissiongit bisect good ee2413eeed767d1fb794651e63e45c5e292c8180# bad: [64dc802aa5accfe36d431814ffdf15303b7f7b2b] drm/i915/phys: unconditionally call release_memory_regiongit bisect bad 64dc802aa5accfe36d431814ffdf15303b7f7b2b# bad: [47f8253d2b8947d79fd3196bf96c1959c0f25f20] drm/i915/gen7: Clear all EU/L3 residual contextsgit bisect bad 47f8253d2b8947d79fd3196bf96c1959c0f25f20# first bad commit: [47f8253d2b8947d79fd3196bf96c1959c0f25f20] drm/i915/gen7: Clear all EU/L3 residual contexts
I was getting the same error code in most 3D accelerated programs like Blender or even glxgears. Using drm-tip commit 999bc17a2471df17a3af3001d094cf6d5d4849b0 made the system freeze completely when launching those programs and I was unable to even switch to console to save a GPU crash dump. But recompiling with && 0 seems to have prevented the errors.
I also use an haswell cpu (celeron G1820) and can't start X11 session of KDE Plasma on Arch Linux after upgrade to kernel 5.7.2.
After I enter my password in login manager (SDDM), I only see a black screen with cursor during ~2 minutes, then my system goes back to login screen.
FWIW, I am also getting this issue on a Sandybridge CPU/GPU (Intel i5-2520M, Intel HD 3000). I'm on kernel 5.7.3 on Arch Linux, but I am not able to make the module crash (though it managed to reset once with 5.7.2). It just hangs the Xorg process to become an unkillable process (even with -9).
I also noticed this new release (5.7) has caused frequent artifacting when playing video through VA-API on MPV. Here's a video of the artifacting if that helps. I can reproduce the issue even with the compositor turned off, it just becomes less visible.
Edit: None of these are an issue with kernel 5.6.15
Edit2: I managed to track down the ecode from the journal log. Here's what it showed:
Operating System: Arch Linux
KDE Plasma Version: 5.18.5
KDE Frameworks Version: 5.70.0
Qt Version: 5.15.0
Kernel Version (that does NOT hang): 5.6.15-arch1-1
OS Type: 64-bit
Processors: 2 × Intel® Pentium® CPU G3258 @ 3.20GHz
Memory: 15.3 GiB of RAM
/sys/class/drm/card0/error reports: No error state collected
Here is excerpt (full journal linked also). Note on this install the HANG occurred in sddm-greeter:
Jun 23 17:10:09 archmain sddm[595]: Message received from greeter: ConnectJun 23 17:10:09 archmain sddm-greeter[647]: Loading file:///usr/share/sddm/themes/maldives/Main.qml...Jun 23 17:10:09 archmain sddm-greeter[647]: file:///usr/lib/qt/qml/SddmComponents/LayoutBox.qml:35:5: QML Connections: Implicitly defined onFoo properties in Connections are deprecated. Use this syntax instead: function onFoo(<arguments>) { ... }Jun 23 17:10:09 archmain sddm-greeter[647]: file:///usr/share/sddm/themes/maldives/Main.qml:40:5: QML Connections: Implicitly defined onFoo properties in Connections are deprecated. Use this syntax instead: function onFoo(<arguments>) { ... }Jun 23 17:10:09 archmain sddm-greeter[647]: Adding view for "HDMI1" QRect(0,0 1920x1080)Jun 23 17:10:09 archmain sddm-greeter[647]: Loading file:///usr/share/sddm/themes/maldives/Main.qml...Jun 23 17:10:09 archmain sddm-greeter[647]: file:///usr/lib/qt/qml/SddmComponents/LayoutBox.qml:35:5: QML Connections: Implicitly defined onFoo properties in Connections are deprecated. Use this syntax instead: function onFoo(<arguments>) { ... }Jun 23 17:10:09 archmain sddm-greeter[647]: file:///usr/share/sddm/themes/maldives/Main.qml:40:5: QML Connections: Implicitly defined onFoo properties in Connections are deprecated. Use this syntax instead: function onFoo(<arguments>) { ... }Jun 23 17:10:09 archmain sddm-greeter[647]: Adding view for "HDMI2" QRect(0,0 1920x1080)Jun 23 17:10:15 archmain kernel: i915 0000:00:02.0: GPU HANG: ecode 7:1:85ddfffd, in sddm-greeter [647]Jun 23 17:10:15 archmain kernel: GPU hangs can indicate a bug anywhere in the entire gfx stack, including userspace.Jun 23 17:10:15 archmain kernel: Please file a _new_ bug report at https://gitlab.freedesktop.org/drm/intel/issues/new.Jun 23 17:10:15 archmain kernel: Please see https://gitlab.freedesktop.org/drm/intel/-/wikis/How-to-file-i915-bugs for details.Jun 23 17:10:15 archmain kernel: drm/i915 developers can then reassign to the right component if it's not a kernel issue.Jun 23 17:10:15 archmain kernel: The GPU crash dump is required to analyze GPU hangs, so please always attach it.Jun 23 17:10:15 archmain kernel: GPU crash dump saved to /sys/class/drm/card0/errorJun 23 17:10:15 archmain kernel: i915 0000:00:02.0: Resetting chip for stopped heartbeat on rcs0Jun 23 17:10:15 archmain kernel: i915 0000:00:02.0: sddm-greeter[647] context reset due to GPU hangJun 23 17:10:15 archmain sddm-greeter[647]: Message received from daemon: CapabilitiesJun 23 17:10:15 archmain sddm-greeter[647]: Message received from daemon: HostNameJun 23 17:10:19 archmain dhcpcd[508]: enp2s0: no IPv6 Routers availableJun 23 17:10:21 archmain kernel: i915 0000:00:02.0: GPU HANG: ecode 7:1:85ddfffd, in sddm-greeter [647]Jun 23 17:10:21 archmain kernel: i915 0000:00:02.0: Resetting chip for stopped heartbeat on rcs0Jun 23 17:10:21 archmain kernel: i915 0000:00:02.0: sddm-greeter[647] context reset due to GPU hangJun 23 17:10:21 archmain sddm[595]: Message received from greeter: PowerOffJun 23 17:10:21 archmain systemd-logind[505]: System is powering down.Jun 23 17:10:21 archmain sddm[595]: Error from greeter session: "Process crashed"Jun 23 17:10:21 archmain sddm[595]: Auth: sddm-helper crashed (exit code 15)Jun 23 17:10:21 archmain sddm[595]: Error from greeter session: "Process crashed"Jun 23 17:10:21 archmain sddm[595]: Auth: sddm-helper exited with 15Jun 23 17:10:21 archmain sddm[595]: Greeter stopped.Jun 23 17:10:21 archmain systemd[1]: Stopping Session c1 of user sddm.
I see that kernel 5.7.6 had some updates for i915, but I still have the GPU hang issue as reported in #2063 (closed)
Jun 25 22:20:18 archless kernel: i915 0000:00:02.0: GPU HANG: ecode 7:1:85ddfffd, in Xorg [551]Jun 25 22:20:18 archless kernel: GPU hangs can indicate a bug anywhere in the entire gfx stack, including userspace.Jun 25 22:20:18 archless kernel: Please file a _new_ bug report at https://gitlab.freedesktop.org/drm/intel/issues/new.Jun 25 22:20:18 archless kernel: Please see https://gitlab.freedesktop.org/drm/intel/-/wikis/How-to-file-i915-bugs for details.Jun 25 22:20:18 archless kernel: drm/i915 developers can then reassign to the right component if it's not a kernel issue.Jun 25 22:20:18 archless kernel: The GPU crash dump is required to analyze GPU hangs, so please always attach it.Jun 25 22:20:18 archless kernel: GPU crash dump saved to /sys/class/drm/card0/errorJun 25 22:20:18 archless kernel: i915 0000:00:02.0: Resetting chip for stopped heartbeat on rcs0Jun 25 22:20:18 archless kernel: i915 0000:00:02.0: Xorg[551] context reset due to GPU hangJun 25 22:20:21 archless kernel: i915 0000:00:02.0: GPU HANG: ecode 7:1:85ddfffd, in kscreenlocker_g [3310]Jun 25 22:20:21 archless kernel: i915 0000:00:02.0: Resetting chip for stopped heartbeat on rcs0
I'm running a full Arch install on an Acer C720 with 4G RAM, & I'm having the same hang as the others. Everything ran perfectly through Kernel 5.6. I included the dmesg with Kernel 5.7.8-arch1-1. I can get more logs if you need them but would need to know what commands to run.dmesg_dump.txt
5.7.10-arch1-1 kernel, the same issue still on G3220
gpu_dump.txt
[ 22.545127] i915 0000:00:02.0: GPU HANG: ecode 7:1:8edcfc79, in plasmashell [702][ 22.545129] GPU hangs can indicate a bug anywhere in the entire gfx stack, including userspace.[ 22.545129] Please file a _new_ bug report at https://gitlab.freedesktop.org/drm/intel/issues/new.[ 22.545129] Please see https://gitlab.freedesktop.org/drm/intel/-/wikis/How-to-file-i915-bugs for details.[ 22.545130] drm/i915 developers can then reassign to the right component if it's not a kernel issue.[ 22.545130] The GPU crash dump is required to analyze GPU hangs, so please always attach it.[ 22.545131] GPU crash dump saved to /sys/class/drm/card0/error[ 22.545169] i915 0000:00:02.0: Resetting chip for stopped heartbeat on rcs0[ 22.648000] i915 0000:00:02.0: plasmashell[702] context reset due to GPU hang[ 29.586964] i915 0000:00:02.0: GPU HANG: ecode 7:1:8edcfc79, in Xorg [553][ 29.587007] i915 0000:00:02.0: Resetting chip for stopped heartbeat on rcs0[ 29.688047] i915 0000:00:02.0: Xorg[553] context reset due to GPU hang[ 32.574413] i915 0000:00:02.0: GPU HANG: ecode 7:1:8edcfc79, in kscreenlocker_g [691][ 32.574453] i915 0000:00:02.0: Resetting chip for stopped heartbeat on rcs0[ 32.677250] i915 0000:00:02.0: kscreenlocker_g[691] context reset due to GPU hang[ 38.547064] i915 0000:00:02.0: GPU HANG: ecode 7:1:85ddfffd, in QSGRenderThread [840][ 38.547688] i915 0000:00:02.0: Resetting chip for stopped heartbeat on rcs0[ 38.650605] i915 0000:00:02.0: QSGRenderThread[840] context reset due to GPU hang[ 44.520284] i915 0000:00:02.0: GPU HANG: ecode 7:1:8edcfc79, in Xorg [553][ 44.520325] i915 0000:00:02.0: Resetting chip for stopped heartbeat on rcs0[ 44.623911] i915 0000:00:02.0: Xorg[553] context reset due to GPU hang
Acer C720 with Fedora 32 and Kernel 5.7.10 does have the error when playing back video with mpv and hw accel/vaapi, using 5.6.19 atm since Kernel 5.7.x are unusable with that error because graphics corruption+lags on Desktop upon hang
i915 0000:00:02.0: GPU HANG: ecode 7:1:8edcfc79, in mpv/vo [2831]GPU hangs can indicate a bug anywhere in the entire gfx stack, including userspace.Please file a _new_ bug report at https://gitlab.freedesktop.org/drm/intel/issues/new.Please see https://gitlab.freedesktop.org/drm/intel/-/wikis/How-to-file-i915-bugs for details.drm/i915 developers can then reassign to the right component if it's not a kernel issue.The GPU crash dump is required to analyze GPU hangs, so please always attach it.GPU crash dump saved to /sys/class/drm/card0/errori915 0000:00:02.0: Resetting chip for stopped heartbeat on rcs0i915 0000:00:02.0: mpv/vo[2831] context reset due to GPU hangi915 0000:00:02.0: GPU HANG: ecode 7:1:8edcfc79, in mpv [2818]i915 0000:00:02.0: Resetting chip for stopped heartbeat on rcs0i915 0000:00:02.0: mpv[2818] context reset due to GPU hangi915 0000:00:02.0: GPU HANG: ecode 7:1:8edcfc79, in mpv/vo [2831]i915 0000:00:02.0: Resetting chip for stopped heartbeat on rcs0i915 0000:00:02.0: mpv/vo[2831] context reset due to GPU hangi915 0000:00:02.0: GPU HANG: ecode 7:1:85ddfffd, in mpv/vo [2831]i915 0000:00:02.0: Resetting chip for stopped heartbeat on rcs0i915 0000:00:02.0: mpv/vo[2831] context reset due to GPU hangi915 0000:00:02.0: GPU HANG: ecode 7:1:8edcfc79, in mpv [2818]i915 0000:00:02.0: Resetting chip for stopped heartbeat on rcs0i915 0000:00:02.0: mpv[2818] context reset due to GPU hang
[ 569.195453] i915 0000:00:02.0: [drm] GPU HANG: ecode 7:1:85ddfffd, in mpv [2694][ 569.195455] GPU hangs can indicate a bug anywhere in the entire gfx stack, including userspace.[ 569.195455] Please file a _new_ bug report at https://gitlab.freedesktop.org/drm/intel/issues/new.[ 569.195456] Please see https://gitlab.freedesktop.org/drm/intel/-/wikis/How-to-file-i915-bugs for details.[ 569.195457] drm/i915 developers can then reassign to the right component if it's not a kernel issue.[ 569.195457] The GPU crash dump is required to analyze GPU hangs, so please always attach it.[ 569.195458] GPU crash dump saved to /sys/class/drm/card0/error[ 569.198346] i915 0000:00:02.0: [drm] Resetting chip for stopped heartbeat on rcs0[ 569.301155] i915 0000:00:02.0: [drm] mpv[2694] context reset due to GPU hang
I tried booting with several combinations of kernel options on the 5.7.12-arch1-1 kernel, such as nomodeset, i915.modeset=0, intel_iommu=on, intel_iommu=igfs_off, intel_iommu=on,igfx_off.
Nothing I tried improved anything, and some options caused my C720 to not boot. Everything still works fine with the 5.4.55-1 lts kernel
I am also having this issue as of the past 1-2 months on a system I'm working on. I will update my post when I can access the system... The system is on Arch linux latest pacman updates. I tried various kernel boot options as well such as
intel_iommu=on
intel_iommu=off
i915.enable_fbc=1
Kernel current version is 5.7.12-arch1-1.
The system becomes unstable/unresponsive and often needs to be manually restarted after this. It seems to happen randomly, but with audio and web browsers. With web browsers it happens constantly with Brave, and frequently (but not as much) with firefox.
Jul 26 23:07:38 fred audit: BPF prog-id=16 op=UNLOADJul 26 23:07:38 fred audit: BPF prog-id=15 op=UNLOADJul 26 23:07:38 fred kernel: audit: type=1334 audit(1595822858.563:73): prog-id=16 op=UNLOADJul 26 23:07:38 fred kernel: audit: type=1334 audit(1595822858.563:74): prog-id=15 op=UNLOADJul 26 23:07:42 fred kernel: i915 0000:00:02.0: GPU HANG: ecode 7:1:8edcfc79, in brave [1422]Jul 26 23:07:42 fred kernel: GPU hangs can indicate a bug anywhere in the entire gfx stack, including userspace.Jul 26 23:07:42 fred kernel: Please file a _new_ bug report at https://gitlab.freedesktop.org/drm/intel/issues/new.Jul 26 23:07:42 fred kernel: Please see https://gitlab.freedesktop.org/drm/intel/-/wikis/How-to-file-i915-bugs for details.Jul 26 23:07:42 fred kernel: drm/i915 developers can then reassign to the right component if it's not a kernel issue.Jul 26 23:07:42 fred kernel: The GPU crash dump is required to analyze GPU hangs, so please always attach it.Jul 26 23:07:42 fred kernel: GPU crash dump saved to /sys/class/drm/card0/errorJul 26 23:07:42 fred kernel: i915 0000:00:02.0: Resetting chip for stopped heartbeat on rcs0Jul 26 23:07:42 fred kernel: i915 0000:00:02.0: brave[1422] context reset due to GPU hangJul 26 23:07:45 fred kernel: i915 0000:00:02.0: GPU HANG: ecode 7:1:8edcfc79Jul 26 23:07:45 fred kernel: i915 0000:00:02.0: Resetting chip for stopped heartbeat on rcs0Jul 26 23:07:51 fred kernel: i915 0000:00:02.0: GPU HANG: ecode 7:1:85ddfffdJul 26 23:07:51 fred kernel: i915 0000:00:02.0: Resetting chip for stopped heartbeat on rcs0Jul 26 23:07:53 fred dbus-daemon[315]: [system] Failed to activate service 'org.bluez': timed out (service_start_timeout=25000ms)Jul 26 23:07:54 fred kernel: i915 0000:00:02.0: GPU HANG: ecode 7:1:85ddfffd, in brave [1544]Jul 26 23:07:54 fred kernel: i915 0000:00:02.0: Resetting chip for stopped heartbeat on rcs0Jul 26 23:08:00 fred kernel: i915 0000:00:02.0: GPU HANG: ecode 7:1:85ddfffd, in brave [1544]Jul 26 23:08:00 fred kernel: i915 0000:00:02.0: Resetting chip for stopped heartbeat on rcs0Jul 26 23:08:00 fred kernel: i915 0000:00:02.0: brave[1544] context reset due to GPU hangJul 26 23:08:06 fred kernel: i915 0000:00:02.0: GPU HANG: ecode 7:1:85ddfffdJul 26 23:08:06 fred kernel: i915 0000:00:02.0: GPU HANG: ecode 7:1:85ddfffdJul 26 23:08:06 fred kernel: i915 0000:00:02.0: Resetting chip for stopped heartbeat on bcs0Jul 26 23:08:09 fred kernel: i915 0000:00:02.0: GPU HANG: ecode 7:1:8edcfc7b, in Xorg [338]Jul 26 23:08:09 fred kernel: i915 0000:00:02.0: GPU HANG: ecode 7:1:8edcfc7b, in Xorg [338]Jul 26 23:08:09 fred kernel: i915 0000:00:02.0: Resetting chip for stopped heartbeat on rcs0Jul 26 23:08:09 fred kernel: i915 0000:00:02.0: Xorg[338] context reset due to GPU hangJul 26 23:08:12 fred kernel: i915 0000:00:02.0: GPU HANG: ecode 7:1:8edcfc79, in brave [1573]Jul 26 23:08:12 fred kernel: i915 0000:00:02.0: Resetting chip for stopped heartbeat on rcs0Jul 26 23:08:15 fred kernel: i915 0000:00:02.0: GPU HANG: ecode 7:1:8edcfc79, in Xorg [338]Jul 26 23:08:15 fred kernel: i915 0000:00:02.0: Resetting chip for stopped heartbeat on bcs0Jul 26 23:08:18 fred kernel: i915 0000:00:02.0: GPU HANG: ecode 7:1:8edcfc79, in brave [1573]Jul 26 23:08:18 fred kernel: i915 0000:00:02.0: Resetting chip for stopped heartbeat on rcs0Jul 26 23:08:18 fred kernel: i915 0000:00:02.0: brave[1573] context reset due to GPU hangJul 26 23:08:24 fred kernel: i915 0000:00:02.0: GPU HANG: ecode 7:1:85ddfffd, in brave [1573]Jul 26 23:08:24 fred kernel: i915 0000:00:02.0: Resetting chip for stopped heartbeat on rcs0Jul 26 23:08:24 fred kernel: i915 0000:00:02.0: brave[1573] context reset due to GPU hangJul 26 23:08:30 fred kernel: i915 0000:00:02.0: GPU HANG: ecode 7:1:8edcfc7b, in Xorg [338]Jul 26 23:08:30 fred kernel: i915 0000:00:02.0: Resetting chip for stopped heartbeat on rcs0Jul 26 23:08:30 fred kernel: i915 0000:00:02.0: Xorg[338] context reset due to GPU hangJul 26 23:21:42 fred systemd[1]: Starting Cleanup of Temporary Directories...Jul 26 23:21:43 fred systemd[1]: systemd-tmpfiles-clean.service: Succeeded.Jul 26 23:21:43 fred systemd[1]: Finished Cleanup of Temporary Directories.Jul 26 23:21:43 fred audit[1]: SERVICE_START pid=1 uid=0 auid=4294967295 ses=4294967295```
[ 19.540009] i915 0000:00:02.0: [drm] GPU HANG: ecode 7:1:8edcfc79, in kscreenlocker_g [718][ 19.540011] GPU hangs can indicate a bug anywhere in the entire gfx stack, including userspace.[ 19.540012] Please file a _new_ bug report at https://gitlab.freedesktop.org/drm/intel/issues/new.[ 19.540012] Please see https://gitlab.freedesktop.org/drm/intel/-/wikis/How-to-file-i915-bugs for details.[ 19.540013] drm/i915 developers can then reassign to the right component if it's not a kernel issue.[ 19.540013] The GPU crash dump is required to analyze GPU hangs, so please always attach it.[ 19.540014] GPU crash dump saved to /sys/class/drm/card0/error[ 19.540048] i915 0000:00:02.0: [drm] Resetting chip for stopped heartbeat on rcs0[ 19.642021] i915 0000:00:02.0: [drm] kscreenlocker_g[718] context reset due to GPU hang[ 25.519871] i915 0000:00:02.0: [drm] GPU HANG: ecode 7:1:8edcfc7b, in kscreenlocker_g [718][ 25.519919] i915 0000:00:02.0: [drm] Resetting chip for stopped heartbeat on rcs0[ 25.621270] i915 0000:00:02.0: [drm] kscreenlocker_g[718] context reset due to GPU hang
git bisect bad 71fa42fac5d67773d83d6bc058d992a6226dbb01git bisect good 0419c450e1bb5eb1c68ba10efeefddaf556cde85git bisect bad 57a333f8f8896ca66e9d8153930247a3cfd7b592git bisect good 2c523b344dfa65a3738e7039832044aa133c75fb
finally tested 2 times both non working:
git bisect bad 9f880327160feb695de03caa29604883b0d00087
Processor Intel G3260, Integrated video driver only, Fedora 31 (KDE Plasma), Kernel 5.7.15. After i enter my password in login manager (SDDM) i only see a black screen with cursor, then my system goes back to login screen. The same effect on the Kernel 5.7.8-100. Works fine with the Kernel 5.6.19-200.
Aug 24 19:43:55 localhost.localdomain kernel: i915 0000:00:02.0: Resetting chip for stopped heartbeat on rcs0Aug 24 19:43:55 localhost.localdomain kernel: i915 0000:00:02.0: kwin_x11[1404] context reset due to GPU hangAug 24 19:43:55 localhost.localdomain kernel: [drm:intel_gt_verify_workarounds [i915]] *ERROR* GT workaround lost on init! (e184=0/0, expected 2000200)Aug 24 19:43:55 localhost.localdomain korgac[1462]: org.kde.pim.akonadicore: Job error: "" for collection: QVector()Aug 24 19:43:56 localhost.localdomain plasmashell[1416]: Connecting to deprecated signal QDBusConnectionInterface::serviceOwnerChanged(QString,QString,QString)Aug 24 19:43:56 localhost.localdomain kdeconnectd[1424]: kdeconnect.core: Could not query capabilities from notifications serverAug 24 19:43:57 localhost.localdomain plasmashell[1416]: Both point size and pixel size set. Using pixel size.Aug 24 19:43:57 localhost.localdomain plasmashell[1416]: Both point size and pixel size set. Using pixel size.Aug 24 19:43:57 localhost.localdomain plasmashell[1416]: Both point size and pixel size set. Using pixel size.Aug 24 19:43:57 localhost.localdomain plasmashell[1416]: trying to show an empty dialogAug 24 19:43:57 localhost.localdomain plasmashell[1416]: trying to show an empty dialogAug 24 19:43:58 localhost.localdomain plasmashell[1416]: Component "DBusProperty" is specified with non-relative URL "qrc:///kdeconnect/qml/DBusProperty.qml" in a qmldir file. >Aug 24 19:43:58 localhost.localdomain plasmashell[1416]: Component "PluginChecker" is specified with non-relative URL "qrc:///kdeconnect/qml/PluginChecker.qml" in a qmldir file>Aug 24 19:43:58 localhost.localdomain plasmashell[1416]: Component "RemoteKeyboard" is specified with non-relative URL "qrc:///kdeconnect/qml/RemoteKeyboard.qml" in a qmldir fi>Aug 24 19:43:59 localhost.localdomain systemd[1233]: dbus-:1.2-org.freedesktop.Notifications@0.service: Succeeded.Aug 24 19:43:59 localhost.localdomain cupsd[797]: REQUEST localhost - - "POST / HTTP/1.1" 200 564 Create-Printer-Subscriptions successful-okAug 24 19:43:59 localhost.localdomain systemd[1]: Starting PackageKit Daemon...Aug 24 19:43:59 localhost.localdomain plasmashell[1416]: QQuickItem::stackAfter: Cannot stack StatusNotifierItem_QMLTYPE_291(0x556731b506c0, parent=0x5567317f4ec0, geometry=0,0>Aug 24 19:43:59 localhost.localdomain PackageKit[1694]: daemon startAug 24 19:43:59 localhost.localdomain plasmashell[1416]: Both point size and pixel size set. Using pixel size.Aug 24 19:43:59 localhost.localdomain plasmashell[1416]: file:///usr/share/plasma/plasmoids/org.kde.plasma.battery/contents/ui/CompactRepresentation.qml:53:17: Unable to assign>Aug 24 19:44:00 localhost.localdomain plasmashell[1416]: trying to show an empty dialogAug 24 19:44:00 localhost.localdomain plasmashell[1416]: file:///usr/share/plasma/shells/org.kde.plasma.desktop/contents/views/Desktop.qml:146:19: QML Loader: Binding loop dete>Aug 24 19:44:00 localhost.localdomain plasmashell[1416]: file:///usr/share/plasma/shells/org.kde.plasma.desktop/contents/views/Desktop.qml:146:19: QML Loader: Binding loop dete>Aug 24 19:44:00 localhost.localdomain plasmashell[1416]: KAStatsFavoritesModel::setFavorites is ignoredAug 24 19:44:00 localhost.localdomain plasmashell[1416]: Entry is not valid "kontact.desktop" QSharedPointer(0x556732fab4e0)Aug 24 19:44:00 localhost.localdomain plasmashell[1416]: Entry is not valid "org.kde.ktpcontactlist.desktop" QSharedPointer(0x556732fab4e0)Aug 24 19:44:00 localhost.localdomain plasmashell[1416]: Entry is not valid "ktpcontactlist.desktop" QSharedPointer(0x556732fab4e0)Aug 24 19:44:00 localhost.localdomain plasmashell[1416]: Entry is not valid "org.kde.kate.desktop" QSharedPointer(0x556732fa1c10)Aug 24 19:44:00 localhost.localdomain plasmashell[1416]: Entry is not valid "kate.desktop" QSharedPointer(0x556732fa1c10)Aug 24 19:44:00 localhost.localdomain plasmashell[1416]: Entry is not valid "org.kde.apper.desktop" QSharedPointer(0x556732fa1c10)Aug 24 19:44:00 localhost.localdomain plasmashell[1416]: Entry is not valid "apper.desktop" QSharedPointer(0x556732fa1c10)Aug 24 19:44:00 localhost.localdomain plasmashell[1416]: Entry is not valid "kontact.desktop" QSharedPointer(0x556732fa1c10)Aug 24 19:44:00 localhost.localdomain plasmashell[1416]: Entry is not valid "org.kde.ktpcontactlist.desktop" QSharedPointer(0x556732fa1c10)Aug 24 19:44:00 localhost.localdomain plasmashell[1416]: Entry is not valid "ktpcontactlist.desktop" QSharedPointer(0x556732fa1c10)Aug 24 19:44:00 localhost.localdomain plasmashell[1416]: Entry is not valid "org.kde.kate.desktop" QSharedPointer(0x556732fa1c10)Aug 24 19:44:00 localhost.localdomain plasmashell[1416]: Entry is not valid "kate.desktop" QSharedPointer(0x556732fa1c10)Aug 24 19:44:00 localhost.localdomain plasmashell[1416]: Entry is not valid "org.kde.apper.desktop" QSharedPointer(0x556732fa1c10)Aug 24 19:44:00 localhost.localdomain plasmashell[1416]: Entry is not valid "apper.desktop" QSharedPointer(0x556732f9f230)Aug 24 19:44:00 localhost.localdomain plasmashell[1416]: trying to show an empty dialogAug 24 19:44:01 localhost.localdomain plasmashell[1416]: Both point size and pixel size set. Using pixel size.Aug 24 19:44:01 localhost.localdomain plasmashell[1416]: Both point size and pixel size set. Using pixel size.Aug 24 19:44:01 localhost.localdomain audit[1]: SERVICE_START pid=1 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:init_t:s0 msg='unit=packagekit comm="systemd" ex>Aug 24 19:44:01 localhost.localdomain plasmashell[1416]: Both point size and pixel size set. Using pixel size.Aug 24 19:44:01 localhost.localdomain plasmashell[1416]: Both point size and pixel size set. Using pixel size.Aug 24 19:44:01 localhost.localdomain plasmashell[1416]: Both point size and pixel size set. Using pixel size.Aug 24 19:44:01 localhost.localdomain plasmashell[1416]: Both point size and pixel size set. Using pixel size.Aug 24 19:44:01 localhost.localdomain plasmashell[1416]: Both point size and pixel size set. Using pixel size.Aug 24 19:44:01 localhost.localdomain plasmashell[1416]: Both point size and pixel size set. Using pixel size.Aug 24 19:44:01 localhost.localdomain systemd[1]: Started PackageKit Daemon.Aug 24 19:44:01 localhost.localdomain plasmashell[1416]: trying to show an empty dialogAug 24 19:44:01 localhost.localdomain kernel: i915 0000:00:02.0: GPU HANG: ecode 7:1:8edcfc79, in Xorg [840]Aug 24 19:44:01 localhost.localdomain kernel: i915 0000:00:02.0: Resetting chip for stopped heartbeat on rcs0Aug 24 19:44:01 localhost.localdomain pulseaudio[1475]: E: [pulseaudio] bluez5-util.c: GetManagedObjects() failed: org.freedesktop.DBus.Error.NoReply: Did not receive a reply. >Aug 24 19:44:01 localhost.localdomain kernel: i915 0000:00:02.0: Xorg[840] context reset due to GPU hangAug 24 19:44:01 localhost.localdomain kernel: [drm:intel_gt_verify_workarounds [i915]] *ERROR* GT workaround lost on init! (e184=0/0, expected 2000200)Aug 24 19:44:01 localhost.localdomain PackageKit[1694]: uid 1000 is trying to obtain org.freedesktop.packagekit.system-sources-refresh auth (only_trusted:0)Aug 24 19:44:01 localhost.localdomain PackageKit[1694]: uid 1000 obtained auth for org.freedesktop.packagekit.system-sources-refreshAug 24 19:44:03 localhost.localdomain PackageKit[1694]: refresh-cache transaction /1503_ccbbbcee from uid 1000 finished with success after 1912msAug 24 19:44:05 localhost.localdomain kwin_x11[1404]: qt.qpa.xcb: QXcbConnection: XCB error: 3 (BadWindow), sequence: 1092, resource id: 52428805, major code: 18 (ChangePropert>Aug 24 19:44:07 localhost.localdomain kernel: i915 0000:00:02.0: GPU HANG: ecode 7:1:8edcfc79, in Xorg [840]Aug 24 19:44:07 localhost.localdomain kernel: i915 0000:00:02.0: Resetting chip for stopped heartbeat on rcs0Aug 24 19:44:07 localhost.localdomain kernel: i915 0000:00:02.0: Xorg[840] context reset due to GPU hangAug 24 19:44:07 localhost.localdomain kernel: [drm:intel_gt_verify_workarounds [i915]] *ERROR* GT workaround lost on init! (e184=0/0, expected 2000200)Aug 24 19:44:10 localhost.localdomain kernel: i915 0000:00:02.0: GPU HANG: ecode 7:1:8edcfc7b, in Xorg [840]Aug 24 19:44:10 localhost.localdomain kernel: i915 0000:00:02.0: Resetting chip for stopped heartbeat on rcs0Aug 24 19:44:10 localhost.localdomain kernel: i915 0000:00:02.0: Xorg[840] context reset due to GPU hangAug 24 19:44:10 localhost.localdomain kernel: [drm:intel_gt_verify_workarounds [i915]] *ERROR* GT workaround lost on init! (e184=0/0, expected 2000200)Aug 24 19:44:10 localhost.localdomain korgac[1462]: The X11 connection broke (error 1). Did the X11 server die?
7:1:85ddfffd with kernel 5.7.17-200.fc32.x86_64 x86_64
7:1:8edcfc79 with 5.8.6-201.fc32.x86_64 x86_64
Results in a machine entirely without graphics from boot, the only means of interaction being ttys2-6.
GPU HANG: ecode 7:1:8edcfc79, in ksplashqml [1229]Kernel: 5.8.6-201.fc32.x86_64 x86_64Driver: 20200515Time: 1599592553 s 982336 usBoottime: 43 s 835163 usUptime: 39 s 394040 usCapture: 4294711040 jiffies; 335255 ms agoActive process (on ring rcs0): ksplashqml [1229]Reset count: 0Suspend count: 0Platform: HASWELL
GPU HANG: ecode 7:1:85ddfffd, in kwin_x11 [1215]Kernel: 5.7.17-200.fc32.x86_64 x86_64Driver: 20200313Time: 1598715941 s 22107 usBoottime: 43 s 837975 usUptime: 39 s 471519 usCapture: 4294711040 jiffies; 934957 ms agoActive process (on ring rcs0): kwin_x11 [1215]Reset count: 0Suspend count: 0Platform: HASWELL
Forget it. Freeloaders in Intel are even too lazy to revert their broken and obsecure "security fix". Basically they just throw out garbage (both chip and code) that doesn't work and say "hey I fixed my broken chip already", and expect you to buy newer (but also likely-broken) chip before you need to.
So it won't help even if we were all willing to give the world to them, if the world has nothing to do with money.
@tomty89: Once Ubuntu 20.10 releases (which has 5.8), I imagine there will be quite a few people affected. I hope that will encourage the solution for this problem. ):
Since I am using kernel release 5.7 (version 5.7.19 atm) for several weeks I was wondering why my system is not affected by this bug. Apparently all users reporting this bug are using the "HSW GT1" chipset according to the IDs posted here which I compared with https://cgit.freedesktop.org/mesa/mesa/tree/include/pci_ids/i965_pci_ids.h
My system uses the "HSW GT2" chipset as quoted from kodi.log:
In my case the intel_iommu=on flag does not help. I still have the crash. Also, I read in one of the other threads that this causes power consumption to go up significantly, can you confirm?
I am starting to wonder if we have 2 or 3 different issues going on: a Braswell that you and @guajardoscar are having, a Haswell (possibly exclusively GT1) and maybe a Sandy Bridge one (although could be related to either one of the other ones). My clues are that intel_iommu works on some cases, but not others and that @guajardoscar stated that his issue happens with 5.6 as well, while in my cases (and I think others too), kernel 5.6 works perfectly fine.
What happens if you apply the patch to 5.7? Does it also work? If it doesn't, maybe Cherryview/Braswell has two bugs. I have found the post that reported higher power consumption, it is here: https://bbs.archlinux.org/viewtopic.php?id=256520&p=3
I think intel_iommu enabled additional virtualization abilities. I remember seeing it for GPU passthrough (connecting a GPU directly to a VM), which would require Vt-d, but I don't know if intel_iommu requires Vt-d. In that case, it wouldn't make sense for my 2955U (also no Vt-d). But I am not sure exactly what all options even mean (soft, on, pt, igfx_off, etc).
As a summary. Seems like there are multiple problems (perhaps leading to the same effect - GPU hang):
The braswell/cherryview related one, which seems to be fixed in 5.9.1 (according to #2024 (comment 669575)) and can be worked around with intel_iommu=on
A haswell one (possibly affecting only GT1) one that is not fixed in 5.9.1 and where the intel_iommu workaround does not help, but some have reported that the patch in #2024 (comment 530965) helps, but at least one person reported a clocking issue.
Tried the && 0 patch on kernel 5.9 this time and no crash but got the clocking error again. My G3258 is only running at 800MHz instead of 4.2GHz. 5.6.14 remains the last working version.
G3220 on 5.10.0-rc1 still hangs. Part of my config:
CONFIG_INTEL_IOMMU_DEFAULT_ON=yCONFIG_INTEL_IOMMU_FLOPPY_WA=y# CONFIG_INTEL_IOMMU_SCALABLE_MODE_DEFAULT_ON is not setCONFIG_DRM_I915=mCONFIG_DRM_I915_FORCE_PROBE="*"CONFIG_DRM_I915_CAPTURE_ERROR=yCONFIG_DRM_I915_COMPRESS_ERROR=yCONFIG_DRM_I915_USERPTR=yCONFIG_DRM_I915_GVT=y# CONFIG_DRM_I915_WERROR is not set# CONFIG_DRM_I915_DEBUG is not set# CONFIG_DRM_I915_DEBUG_MMIO is not set# CONFIG_DRM_I915_SW_FENCE_DEBUG_OBJECTS is not set# CONFIG_DRM_I915_SW_FENCE_CHECK_DAG is not set# CONFIG_DRM_I915_DEBUG_GUC is not set# CONFIG_DRM_I915_SELFTEST is not set# CONFIG_DRM_I915_LOW_LEVEL_TRACEPOINTS is not set# CONFIG_DRM_I915_DEBUG_VBLANK_EVADE is not set# CONFIG_DRM_I915_DEBUG_RUNTIME_PM is not setCONFIG_DRM_I915_FENCE_TIMEOUT=10000CONFIG_DRM_I915_USERFAULT_AUTOSUSPEND=250CONFIG_DRM_I915_HEARTBEAT_INTERVAL=2500CONFIG_DRM_I915_PREEMPT_TIMEOUT=640CONFIG_DRM_I915_MAX_REQUEST_BUSYWAIT=8000CONFIG_DRM_I915_STOP_TIMEOUT=100CONFIG_DRM_I915_TIMESLICE_DURATION=1CONFIG_SND_HDA_I915=y
With the && 0 patch it works fine. And the CPU is not pegged to a frequency, I can see it going to 3GHz and lower.
My frequency scaling problem was fixed by setting the default scheduler to "ondemand" rather than "userspace". Turns out this was nothing to do with the patches or anything discussed here, but an unrelated change to the way intel_pstate works on older CPUs.
I have an n3050 and have been affected from 5.4.4x up to 5.4.52. 5.4.51 was a safe haven for me until I updated some packages (more than likely linux firmware)
[878391.747986] i915 0000:00:02.0: [drm] GPU HANG: ecode 7:1:85dfbff8, in Compositor [25772][878391.747987] GPU hangs can indicate a bug anywhere in the entire gfx stack, including userspace.[878391.747988] Please file a _new_ bug report at https://gitlab.freedesktop.org/drm/intel/issues/new.[878391.747988] Please see https://gitlab.freedesktop.org/drm/intel/-/wikis/How-to-file-i915-bugs for details.[878391.747989] drm/i915 developers can then reassign to the right component if it's not a kernel issue.[878391.747989] The GPU crash dump is required to analyze GPU hangs, so please always attach it.[878391.747990] GPU crash dump saved to /sys/class/drm/card0/error[878391.748025] i915 0000:00:02.0: [drm] Resetting chip for stopped heartbeat on rcs0[878391.850535] i915 0000:00:02.0: [drm] Compositor[25772] context reset due to GPU hang[878493.777763] i915 0000:00:02.0: [drm] GPU HANG: ecode 7:1:dddff7ff, in Xwayland [13123][878493.777984] i915 0000:00:02.0: [drm] Resetting chip for stopped heartbeat on rcs0[878493.879694] i915 0000:00:02.0: [drm] Xwayland[13123] context reset due to GPU hang[878501.777727] i915 0000:00:02.0: [drm] GPU HANG: ecode 7:1:dddff7ff, in Xwayland [13123][878501.777803] i915 0000:00:02.0: [drm] Resetting chip for stopped heartbeat on rcs0[878501.879826] i915 0000:00:02.0: [drm] Xwayland[13123] context reset due to GPU hang
It seemed to be triggered by Firefox somehow, but even after killing Firefox I had to reboot before I could use my window manager again.