7840h/780m system crash after update to linux kernel 6.10
I encountered crashes after update to linux kernel 6.10 (linux-cachyos-6.10.0 to be specific) on a 7840h/780m system with a 4k display. I can reproduce it every time with watching videos in browsers (firefox, chromium et.) System will crash in about 1-2mins after a video begin. I also experienced unusual choppiness while playing Dota 2 games.
Everything returned to normal after I downgraded to kernel 6.9.9.
Below are some relevant log lines.
Jul 16 10:13:32 PM kwin_wayland: kwin_wayland_drm: Pageflip timed out! This is a kernel bug
Jul 16 10:13:27 PM kwin_wayland: kwin_wayland_drm: Pageflip timed out! This is a kernel bug
Jul 16 10:13:22 PM kernel: amdgpu 0000:c6:00.0: amdgpu: RW: 0x0
Jul 16 10:13:22 PM kernel: amdgpu 0000:c6:00.0: amdgpu: MAPPING_ERROR: 0x0
Jul 16 10:13:22 PM kernel: amdgpu 0000:c6:00.0: amdgpu: PERMISSION_FAULTS: 0x1
Jul 16 10:13:22 PM kernel: amdgpu 0000:c6:00.0: amdgpu: WALKER_ERROR: 0x0
Jul 16 10:13:22 PM kernel: amdgpu 0000:c6:00.0: amdgpu: MORE_FAULTS: 0x1
Jul 16 10:13:22 PM kernel: amdgpu 0000:c6:00.0: amdgpu: Faulty UTCL2 client ID: unknown (0x1d)
Jul 16 10:13:22 PM kernel: amdgpu 0000:c6:00.0: amdgpu: MMVM_L2_PROTECTION_FAULT_STATUS:0x00103A11
Jul 16 10:13:22 PM kernel: amdgpu 0000:c6:00.0: amdgpu: in page starting at address 0x000080010b584000 from client 18
Jul 16 10:13:22 PM kernel: amdgpu 0000:c6:00.0: amdgpu: in process RDD Process pid 2433 thread cachy-brow:cs0 pid 2493)
Jul 16 10:13:22 PM kernel: amdgpu 0000:c6:00.0: amdgpu: [mmhub] page fault (src_id:0 ring:8 vmid:1 pasid:32789)
UPDATE 1
linux-firmware version is 20240703.e94a2a3b-1
result of "sudo cat /sys/kernel/debug/dri/0/amdgpu_firmware_info"
VCE feature version: 0, firmware version: 0x00000000
UVD feature version: 0, firmware version: 0x00000000
MC feature version: 0, firmware version: 0x00000000
ME feature version: 35, firmware version: 0x00000027
PFP feature version: 35, firmware version: 0x00000030
CE feature version: 0, firmware version: 0x00000000
RLC feature version: 1, firmware version: 0x00000080
RLC SRLC feature version: 0, firmware version: 0x00000000
RLC SRLG feature version: 0, firmware version: 0x00000000
RLC SRLS feature version: 0, firmware version: 0x00000000
RLCP feature version: 1, firmware version: 0x0000000f
RLCV feature version: 0, firmware version: 0x00000000
MEC feature version: 35, firmware version: 0x00000027
IMU feature version: 0, firmware version: 0x0b012d00
SOS feature version: 0, firmware version: 0x00000000
ASD feature version: 553648345, firmware version: 0x210000d9
TA XGMI feature version: 0x00000000, firmware version: 0x00000000
TA RAS feature version: 0x00000000, firmware version: 0x00000000
TA HDCP feature version: 0x00000000, firmware version: 0x1700003f
TA DTM feature version: 0x00000000, firmware version: 0x12000016
TA RAP feature version: 0x00000000, firmware version: 0x00000000
TA SECUREDISPLAY feature version: 0x00000000, firmware version: 0x00000000
SMC feature version: 0, program: 0, firmware version: 0x004c4e00 (76.78.0)
SDMA0 feature version: 60, firmware version: 0x00000012
VCN feature version: 0, firmware version: 0x08115002
DMCU feature version: 0, firmware version: 0x00000000
DMCUB feature version: 0, firmware version: 0x08003d00
TOC feature version: 0, firmware version: 0x0000000b
MES_KIQ feature version: 6, firmware version: 0x00000073
MES feature version: 1, firmware version: 0x0000005f
VPE feature version: 0, firmware version: 0x00000000
VBIOS version: 113-PHXGENERIC-001