Workaround (on certain hardware) to GPU Reset with amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx timeout
Since these issues are getting old without solution (which could be understandable as its hard to reproduce) I looked for a workaround.
Related issues :
My own hardware is Ryzen 9 7940HS w/ Radeon 780M Graphics, and Radeon RX 7600S.
Last year I had problems with this new hardware that forced me to use a specific linux-firmware commit : #2575 (comment 1928691)
I decided to give it a try and do the same thing again (as my manually installed firmware files were replaced by debian updates) :
git clone https://git.kernel.org/pub/scm/linux/kernel/git/firmware/linux-firmware.git
cd linux-firmware
git checkout 2e77545089e5d2481cf6dcad8d468e138f3396b9
git revert 60dc78a708ad283b9c43b8b8b91ad4cc5568f076
git revert dcd30473de546d2c20502c544ba0de6a78370f37
sudo make install
Running 6.10 kernel, 4 days now and not a single crash (happened several times a day). kwin compositor sometimes stutters, especially with transparent windows animations, but it's still better than a whole system crash.
The base commit 2e7754508 is 1 year old, so I'll try to find some time to pinpoint on which commit exactly the issue appeared.