Phoenix 780M: X.org suddenly died
7840HS here, using 6.5.6-200.fc38.x86_64
and otherwise fully updated Fedora 38.
As I was scrolling Twitter in Firefox, X.org suddenly died and I had to use SysRq reboot because the driver never recovered and I couldn't even switch to console:
Oct 21 15:35:33 hp kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx_0.0.0 timeout, signaled seq=2668505, emitted seq=2668507
Oct 21 15:35:33 hp kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process information: process Xorg pid 1523 thread Xorg:cs0 pid 1558
Oct 21 15:35:33 hp kernel: amdgpu 0000:c3:00.0: amdgpu: GPU reset begin!
Oct 21 15:35:33 hp kernel: [drm:mes_v11_0_submit_pkt_and_poll_completion.constprop.0 [amdgpu]] *ERROR* MES failed to response msg=3
Oct 21 15:35:33 hp kernel: [drm:amdgpu_mes_unmap_legacy_queue [amdgpu]] *ERROR* failed to unmap legacy queue
Oct 21 15:35:33 hp kernel: [drm:mes_v11_0_submit_pkt_and_poll_completion.constprop.0 [amdgpu]] *ERROR* MES failed to response msg=3
Oct 21 15:35:33 hp kernel: [drm:amdgpu_mes_unmap_legacy_queue [amdgpu]] *ERROR* failed to unmap legacy queue
Oct 21 15:35:33 hp kernel: [drm:mes_v11_0_submit_pkt_and_poll_completion.constprop.0 [amdgpu]] *ERROR* MES failed to response msg=3
Oct 21 15:35:33 hp kernel: [drm:amdgpu_mes_unmap_legacy_queue [amdgpu]] *ERROR* failed to unmap legacy queue
Oct 21 15:35:34 hp kernel: [drm:mes_v11_0_submit_pkt_and_poll_completion.constprop.0 [amdgpu]] *ERROR* MES failed to response msg=3
Oct 21 15:35:34 hp kernel: [drm:amdgpu_mes_unmap_legacy_queue [amdgpu]] *ERROR* failed to unmap legacy queue
Oct 21 15:35:34 hp kernel: [drm:mes_v11_0_submit_pkt_and_poll_completion.constprop.0 [amdgpu]] *ERROR* MES failed to response msg=3
Oct 21 15:35:34 hp kernel: [drm:amdgpu_mes_unmap_legacy_queue [amdgpu]] *ERROR* failed to unmap legacy queue
Oct 21 15:35:34 hp kernel: [drm:mes_v11_0_submit_pkt_and_poll_completion.constprop.0 [amdgpu]] *ERROR* MES failed to response msg=3
Oct 21 15:35:34 hp kernel: [drm:amdgpu_mes_unmap_legacy_queue [amdgpu]] *ERROR* failed to unmap legacy queue
Oct 21 15:35:34 hp kernel: [drm:mes_v11_0_submit_pkt_and_poll_completion.constprop.0 [amdgpu]] *ERROR* MES failed to response msg=3
Oct 21 15:35:34 hp kernel: [drm:amdgpu_mes_unmap_legacy_queue [amdgpu]] *ERROR* failed to unmap legacy queue
Oct 21 15:35:34 hp kernel: [drm:mes_v11_0_submit_pkt_and_poll_completion.constprop.0 [amdgpu]] *ERROR* MES failed to response msg=3
Oct 21 15:35:34 hp kernel: [drm:amdgpu_mes_unmap_legacy_queue [amdgpu]] *ERROR* failed to unmap legacy queue
Oct 21 15:35:34 hp kernel: [drm:mes_v11_0_submit_pkt_and_poll_completion.constprop.0 [amdgpu]] *ERROR* MES failed to response msg=3
Oct 21 15:35:34 hp kernel: [drm:amdgpu_mes_unmap_legacy_queue [amdgpu]] *ERROR* failed to unmap legacy queue
Oct 21 15:35:35 hp kernel: [drm:gfx_v11_0_hw_fini [amdgpu]] *ERROR* failed to halt cp gfx
Oct 21 15:35:35 hp kernel: amdgpu 0000:c3:00.0: amdgpu: MODE2 reset
Oct 21 15:35:35 hp kernel: amdgpu 0000:c3:00.0: amdgpu: GPU reset succeeded, trying to resume
Oct 21 15:35:35 hp kernel: [drm] PCIE GART of 512M enabled (table at 0x000000801FD00000).
Oct 21 15:35:35 hp kernel: amdgpu 0000:c3:00.0: amdgpu: SMU is resuming...
Oct 21 15:35:35 hp kernel: amdgpu 0000:c3:00.0: amdgpu: SMU is resumed successfully!
Oct 21 15:35:35 hp kernel: [drm] DMUB hardware initialized: version=0x08002000
Oct 21 15:35:55 hp kernel: [drm:amdgpu_dm_process_dmub_aux_transfer_sync [amdgpu]] *ERROR* wait_for_completion_timeout timeout!
Oct 21 15:36:05 hp kernel: [drm:amdgpu_dm_process_dmub_aux_transfer_sync [amdgpu]] *ERROR* wait_for_completion_timeout timeout!
Oct 21 15:36:13 hp kernel: [drm:amdgpu_dm_process_dmub_aux_transfer_sync [amdgpu]] *ERROR* wait_for_completion_timeout timeout!
Oct 21 15:36:13 hp kernel: sysrq: Emergency Remount R/O
Is this a known issue? Should I try upgrading to 6.5.8?
Edited by Artem S. Tashkinov