Due to an influx of spam, we have had to impose restrictions on new accounts. Please see this wiki page for instructions on how to get full permissions. Sorry for the inconvenience.
Admin message
Our infrastructure migration is complete. Please remember to update your SSH remote to point to ssh.gitlab.freedesktop.org; SSH to the old hostname will time out. You should not see any problems apart from that. Please let us know if you do have any other issues.
My Laptop often experiences random complete graphical freezes, requiring a hard reboot. Sound persists, but mouse and keyboard are unresponsive.
I experience this freeze 1-2 times per day, typically when launching an application. Ironically, the most reliable way I can make the system freeze is by running lspci -nn to get information for this bug report.
I can reproduce this bug on kernels 5.12.x-5.17.4 (All kernels shipped in Fedora 35).
The process of writing this bug report has given me the most reliable way to reproduce my system freeze. I boot up, and repeatedly run the lspci -nn command, it will freeze within running the command five times.
I have since upgraded to Fedora 36, kernel 5.17.9, but this problem still persists. Is there any other information I need to provide for developers to at least look at this problem?
I upgraded to kernel 5.19.4 over the weekend, and still experience kernel freezes related to amdgpu drm. It's still random, but the freezes are notably more frequent with programs that interact with Wine/Proton (Steam & Bottles).
Running kernel 5.19.9 and linux-firmware 20220815-139
Please some developer reply so I know you've at least seen this bug report! I can ssh into my system during a freeze, maybe I can run live diagnostics or something? Anything to help find the root cause of this error.
May 06 22:32:32 fedorapc kernel: [drm] Fence fallback timer expired on ring gfxMay 06 22:32:32 fedorapc kernel: [drm] Fence fallback timer expired on ring sdma0
in your logs. This is often means that interrupts are not working properly on your platform. You might try a newer bios if one is available. Are you seeing any interrupts being delivered to any CPUs for amdgpu in /proc/interrupts?
I have been on the latest Dell BIOS 1.11.0 since April, but I see there was a new version released at the end of September, so I will upgrade and try that.
I do see interrupts for amdgpu. Here's the output of cat /proc/interrupts | grep amd:interrupts.log
I did try running 5.10.20 for a while. I even installed Debian for a few weeks and used 5.10.08.
They were definitely more stable, but I still experienced occasional freezes. The difference was those freezes had [CRTC:62:crtc-0] errors instead of [CRTC:67:crtc-0] errors.
It happens on my MSI A4DEK too. Any notebook with Renoir CPU + 6GB Navi 10 GPU suffers from this.
Me and a few tried to bisect and we found only red herrings.
I'm basically stuck on 5.11.22 which is ancient and doesn't support all the hardware on my notebook.
I'll keep adding logs to this issue. It's really hard to pin down, because I've not found any specific action to trigger the freeze. The most common triggers are launching Discord, Minecraft, and Steam. But, I can go days without issues or it freezes within minutes of booting without even launching any programs.
Here's a recent log where it froze withing a minute of a cold boot: 2022-11-16-dmesg.log
This is Fedora 36, kernel 6.0.8-200.fc36, amd-firmware 20221109-144.fc36
juuni 22 22:39:31 Zen kernel: [drm:amdgpu_dm_atomic_check [amdgpu]] *ERROR* [CRTC:63:crtc-1] hw_done or flip_done timed outjuuni 22 22:39:41 Zen kernel: [drm:amdgpu_dm_atomic_check [amdgpu]] *ERROR* [CRTC:66:crtc-2] hw_done or flip_done timed outjuuni 22 22:39:51 Zen kernel: [drm:amdgpu_dm_atomic_check [amdgpu]] *ERROR* [CRTC:69:crtc-3] hw_done or flip_done timed outjuuni 22 22:40:02 Zen kernel: [drm:amdgpu_dm_atomic_check [amdgpu]] *ERROR* [CRTC:63:crtc-1] hw_done or flip_done timed outjuuni 22 22:41:14 Zen kernel: [drm:amdgpu_dm_atomic_check [amdgpu]] *ERROR* [CRTC:66:crtc-2] hw_done or flip_done timed outjuuni 22 22:41:28 Zen kernel: [drm:amdgpu_dm_atomic_check [amdgpu]] *ERROR* [CRTC:66:crtc-2] hw_done or flip_done timed out
just right now, first time ever such problem, on my desktop:
Ryzen 5950X
Vega 64
linux-firmware 20230117.7e4f0ed-1 (had to downgrade linux-firmware yesterday from 20230404.2e92a49f-1 because that caused some issues with Ryzen 5950X), which previously worked fine.
I'm still using Fedora, now version 38, with GNOME, mesa 23.1.3-1.fc38. Since kernel 6.2.x, this problem has started occurring multiple times a day: just logging into the desktop, just launching Firefox, launching Steam. It seems like any random thing might cause this freeze.
But today, I had a slightly different behavior, maybe the logs can help. I restarted a game in Steam, causing the same "[CRTC:67:crtc-0] flip_done timed out" error, but the system managed to recover, then freeze again completely a few minutes later.
Unfortunately, I am still experiencing the same CRTC:67 error. For 6.4.x & 6.5.x kernels, it freezes about once every week. All freezes only occur when launching a program: launching Firefox, Thunderbird, Krita, Steam, and Discord all have caused this freeze.
This is an improvement over 6.3.x and earlier kernels, where I was experiencing this freeze several times a day, even multiple an hour.
Whatever this issue is (maybe a multitude of issues causing the same error) it goes back to previous lts kernels as well. Switching to 6.1-lts and 5.15-lts do not get rid of this error, it maybe only impacts the frequency of the error.
Unfortunately, yes it froze today with the same error. I have been running kernel 6.5.5-200.fc38.x86_64 on Fedora 38 for almost 2 weeks. Today was the first freeze on 6.5.5.
I'm running 6.5.5 right now. It's definitely better than 6.2-6.4, but only about as stable as 6.1 lts for me. The crashes are different now, so I'm pretty certain 6.5.5 fixed something, but there are more issues it seems.
I can always get a crash now by running Wayland and plugging or unplugging an extra monitor. But this does not happen on X11.
When it crashes on X11, the system is still running, just the entire GUI is unresponsive, keyboard capslock light is also not responsive, but plugging USB in/out causes notification sounds.
When it crashes on Wayland, Plasma does something and a few seconds later the UI is all back to normal.
Looking at logs in journalctl I have tons of:
[drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to process the buffer list -22!
And:
[drm:dm_plane_helper_prepare_fb [amdgpu]] *ERROR* Failed to pin framebuffer with error -22
[drm] Fence fallback timer expired on ring sdma0[drm] Fence fallback timer expired on ring gfx_high[drm] Fence fallback timer expired on ring gfx_high[drm] Fence fallback timer expired on ring gfx_low[drm] Fence fallback timer expired on ring gfx_low[drm] Fence fallback timer expired on ring sdma0[drm] Fence fallback timer expired on ring gfx_low[drm] Fence fallback timer expired on ring gfx_low[drm] Fence fallback timer expired on ring gfx_low[drm] Fence fallback timer expired on ring sdma0amdgpu 0000:09:00.0: [drm] *ERROR* [CRTC:67:crtc-0] flip_done timed out
And:
[drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx_low timeout, signaled seq=1573363, emitted seq=1573365[drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process information: process kwin_x11 pid 2296 thread kwin_x11:cs0 pid 2319
The actual CRTC:67 error isn't super common, but it's still there. These all happened on 6.5.5:
Linux version 6.5.5-gentoo (root@grigibook) (gcc (Gentoo 13.2.1_p20230826 p7) 13.2.1 20230826, GNU ld (Gentoo 2.40 p5) 2.40.0) #1 SMP PREEMPT_DYNAMIC Thu Oct 5 21:27:38 BST 2023
I suspect there are so many causes of the same symptoms, sadly.
On kernel 6.5.11, I'm still seeing the [CRTC:67:crtc-0] drm error, but the freeze behavior is slightly different.
Here's a log where the graphics froze after launching Steam, but then the screen turned on-off several times before freezing: 2023-11-14-dmesg_steam.log
And here's one where the graphics froze after launching Firefox. The screen turned on-off once then successfully recovered. However, it would later freeze after launching Steam: 2023-11-14-dmesg_firefox_steam.log