The system dies when watching HW accelerated YouTube videos in Firefox
Sometimes when watching HW accelerated YouTube videos in Firefox the system dies
Hardware description:
- CPU :Ryzen 7 7840HS
- GPU: c3:00.0 VGA compatible controller [0300]: Advanced Micro Devices, Inc. [AMD/ATI] Phoenix1 [1002:15bf] (rev d4)
- System Memory: 32GB
- Display(s): 2560x1600@120Hz
- Type of Display Connection: eDP
System information:
- Distro name and Version: Fedora 41
- Kernel version: Linux elite 6.12.10-200.fc41.x86_64 #1 (closed) SMP PREEMPT_DYNAMIC Fri Jan 17 18:05:24 UTC 2025 x86_64 GNU/Linux
How to reproduce the issue:
Watch a YouTube video in Firefox 134.0
[ 11.195147] [drm] amdgpu kernel modesetting enabled.
[ 11.195722] amdgpu: Virtual CRAT table created for CPU
[ 11.195733] amdgpu: Topology: Add CPU node
[ 11.200636] amdgpu 0000:c3:00.0: amdgpu: Fetched VBIOS from VFCT
[ 11.200639] amdgpu: ATOM BIOS: 113-PHXGENERIC-001
[ 11.249198] amdgpu 0000:c3:00.0: vgaarb: deactivate vga console
[ 11.249204] amdgpu 0000:c3:00.0: amdgpu: Trusted Memory Zone (TMZ) feature enabled
[ 11.249284] amdgpu 0000:c3:00.0: amdgpu: VRAM: 4096M 0x0000008000000000 - 0x00000080FFFFFFFF (4096M used)
[ 11.249287] amdgpu 0000:c3:00.0: amdgpu: GART: 512M 0x00007FFF00000000 - 0x00007FFF1FFFFFFF
[ 11.249538] [drm] amdgpu: 4096M of VRAM memory ready
[ 11.249543] [drm] amdgpu: 13912M of GTT memory ready.
[ 11.275961] amdgpu 0000:c3:00.0: amdgpu: reserve 0x4000000 from 0x80f8000000 for PSP TMR
[ 11.812768] amdgpu 0000:c3:00.0: amdgpu: RAS: optional ras ta ucode is not available
[ 11.820972] amdgpu 0000:c3:00.0: amdgpu: RAP: optional rap ta ucode is not available
[ 11.820975] amdgpu 0000:c3:00.0: amdgpu: SECUREDISPLAY: securedisplay ta ucode is not available
[ 11.852679] amdgpu 0000:c3:00.0: amdgpu: SMU is initialized successfully!
[ 11.857917] snd_hda_intel 0000:c3:00.1: bound 0000:c3:00.0 (ops amdgpu_dm_audio_component_bind_ops [amdgpu])
[ 11.959804] kfd kfd: amdgpu: Allocated 3969056 bytes on gart
[ 11.959820] kfd kfd: amdgpu: Total number of KFD nodes to be created: 1
[ 11.960545] amdgpu: Virtual CRAT table created for GPU
[ 11.960669] amdgpu: Topology: Add dGPU node [0x15bf:0x1002]
[ 11.960670] kfd kfd: amdgpu: added device 1002:15bf
[ 11.960683] amdgpu 0000:c3:00.0: amdgpu: SE 1, SH per SE 2, CU per SH 6, active_cu_number 12
[ 11.960688] amdgpu 0000:c3:00.0: amdgpu: ring gfx_0.0.0 uses VM inv eng 0 on hub 0
[ 11.960690] amdgpu 0000:c3:00.0: amdgpu: ring comp_1.0.0 uses VM inv eng 1 on hub 0
[ 11.960691] amdgpu 0000:c3:00.0: amdgpu: ring comp_1.1.0 uses VM inv eng 4 on hub 0
[ 11.960692] amdgpu 0000:c3:00.0: amdgpu: ring comp_1.2.0 uses VM inv eng 6 on hub 0
[ 11.960692] amdgpu 0000:c3:00.0: amdgpu: ring comp_1.3.0 uses VM inv eng 7 on hub 0
[ 11.960693] amdgpu 0000:c3:00.0: amdgpu: ring comp_1.0.1 uses VM inv eng 8 on hub 0
[ 11.960693] amdgpu 0000:c3:00.0: amdgpu: ring comp_1.1.1 uses VM inv eng 9 on hub 0
[ 11.960694] amdgpu 0000:c3:00.0: amdgpu: ring comp_1.2.1 uses VM inv eng 10 on hub 0
[ 11.960695] amdgpu 0000:c3:00.0: amdgpu: ring comp_1.3.1 uses VM inv eng 11 on hub 0
[ 11.960695] amdgpu 0000:c3:00.0: amdgpu: ring sdma0 uses VM inv eng 12 on hub 0
[ 11.960696] amdgpu 0000:c3:00.0: amdgpu: ring vcn_unified_0 uses VM inv eng 0 on hub 8
[ 11.960696] amdgpu 0000:c3:00.0: amdgpu: ring jpeg_dec uses VM inv eng 1 on hub 8
[ 11.960697] amdgpu 0000:c3:00.0: amdgpu: ring mes_kiq_3.1.0 uses VM inv eng 13 on hub 0
[ 11.967885] amdgpu 0000:c3:00.0: amdgpu: Runtime PM not available
[ 11.969128] [drm] Initialized amdgpu 3.59.0 for 0000:c3:00.0 on minor 1
[ 11.981730] fbcon: amdgpudrmfb (fb0) is primary device
[ 12.764505] amdgpu 0000:c3:00.0: [drm] fb0: amdgpudrmfb frame buffer device
Jan 20 12:08:33 elite kernel: gmc_v11_0_process_interrupt: 11 callbacks suppressed
Jan 20 12:08:33 elite kernel: amdgpu 0000:c3:00.0: amdgpu: [gfxhub] page fault (src_id:0 ring:24 vmid:6 pasid:32770)
Jan 20 12:08:33 elite kernel: amdgpu 0000:c3:00.0: amdgpu: in process Xorg pid 1915 thread Xorg:cs0 pid 1925)
Jan 20 12:08:33 elite kernel: amdgpu 0000:c3:00.0: amdgpu: in page starting at address 0x000080011be54000 from client 10
Jan 20 12:08:33 elite kernel: amdgpu 0000:c3:00.0: amdgpu: GCVM_L2_PROTECTION_FAULT_STATUS:0x00601031
Jan 20 12:08:33 elite kernel: amdgpu 0000:c3:00.0: amdgpu: Faulty UTCL2 client ID: TCP (0x8)
Jan 20 12:08:33 elite kernel: amdgpu 0000:c3:00.0: amdgpu: MORE_FAULTS: 0x1
Jan 20 12:08:33 elite kernel: amdgpu 0000:c3:00.0: amdgpu: WALKER_ERROR: 0x0
Jan 20 12:08:33 elite kernel: amdgpu 0000:c3:00.0: amdgpu: PERMISSION_FAULTS: 0x3
Jan 20 12:08:33 elite kernel: amdgpu 0000:c3:00.0: amdgpu: MAPPING_ERROR: 0x0
Jan 20 12:08:33 elite kernel: amdgpu 0000:c3:00.0: amdgpu: RW: 0x0
Jan 20 12:08:33 elite kernel: amdgpu 0000:c3:00.0: amdgpu: [gfxhub] page fault (src_id:0 ring:24 vmid:6 pasid:32770)
Jan 20 12:08:33 elite kernel: amdgpu 0000:c3:00.0: amdgpu: in process Xorg pid 1915 thread Xorg:cs0 pid 1925)
Jan 20 12:08:33 elite kernel: amdgpu 0000:c3:00.0: amdgpu: in page starting at address 0x000080011be54000 from client 10
Jan 20 12:08:33 elite kernel: amdgpu 0000:c3:00.0: amdgpu: GCVM_L2_PROTECTION_FAULT_STATUS:0x00000000
Jan 20 12:08:33 elite kernel: amdgpu 0000:c3:00.0: amdgpu: Faulty UTCL2 client ID: CB/DB (0x0)
Jan 20 12:08:33 elite kernel: amdgpu 0000:c3:00.0: amdgpu: MORE_FAULTS: 0x0
Jan 20 12:08:33 elite kernel: amdgpu 0000:c3:00.0: amdgpu: WALKER_ERROR: 0x0
Jan 20 12:08:33 elite kernel: amdgpu 0000:c3:00.0: amdgpu: PERMISSION_FAULTS: 0x0
Jan 20 12:08:33 elite kernel: amdgpu 0000:c3:00.0: amdgpu: MAPPING_ERROR: 0x0
Jan 20 12:08:33 elite kernel: amdgpu 0000:c3:00.0: amdgpu: RW: 0x0
Jan 20 12:08:43 elite kernel: amdgpu 0000:c3:00.0: amdgpu: Dumping IP State
Jan 20 12:08:43 elite kernel: amdgpu 0000:c3:00.0: amdgpu: Dumping IP State Completed
Jan 20 12:08:43 elite kernel: amdgpu 0000:c3:00.0: amdgpu: ring gfx_0.0.0 timeout, but soft recovered
Firmware used:
cat /sys/kernel/debug/dri/128/amdgpu_firmware_info
VCE feature version: 0, firmware version: 0x00000000
UVD feature version: 0, firmware version: 0x00000000
MC feature version: 0, firmware version: 0x00000000
ME feature version: 35, firmware version: 0x00000028
PFP feature version: 35, firmware version: 0x00000030
CE feature version: 0, firmware version: 0x00000000
RLC feature version: 1, firmware version: 0x00000080
RLC SRLC feature version: 0, firmware version: 0x00000000
RLC SRLG feature version: 0, firmware version: 0x00000000
RLC SRLS feature version: 0, firmware version: 0x00000000
RLCP feature version: 1, firmware version: 0x0000000f
RLCV feature version: 0, firmware version: 0x00000000
MEC feature version: 35, firmware version: 0x00000028
IMU feature version: 0, firmware version: 0x0b012d00
SOS feature version: 0, firmware version: 0x00000000
ASD feature version: 553648363, firmware version: 0x210000eb
TA XGMI feature version: 0x00000000, firmware version: 0x00000000
TA RAS feature version: 0x00000000, firmware version: 0x00000000
TA HDCP feature version: 0x00000000, firmware version: 0x17000042
TA DTM feature version: 0x00000000, firmware version: 0x12000018
TA RAP feature version: 0x00000000, firmware version: 0x00000000
TA SECUREDISPLAY feature version: 0x00000000, firmware version: 0x00000000
SMC feature version: 0, program: 0, firmware version: 0x004c5700 (76.87.0)
SDMA0 feature version: 60, firmware version: 0x00000015
VCN feature version: 0, firmware version: 0x09117008
DMCU feature version: 0, firmware version: 0x00000000
DMCUB feature version: 0, firmware version: 0x08004900
TOC feature version: 0, firmware version: 0x0000000b
MES_KIQ feature version: 6, firmware version: 0x00000073
MES feature version: 1, firmware version: 0x0000006f
VPE feature version: 0, firmware version: 0x00000000
VBIOS version: 113-PHXGENERIC-001
An xorg.log snippet:
12542.014] (EE)
[ 12542.014] (EE) Backtrace:
[ 12542.015] (EE) unw_get_proc_name failed: no unwind info found [-10]
[ 12542.015] (EE) 0: /usr/libexec/Xorg (?+0x0) [0x52eabc]
[ 12542.046] (EE) unw_get_proc_name failed: no unwind info found [-10]
[ 12542.046] (EE) 1: /lib64/libc.so.6 (?+0x0) [0x7fedc5d8b090]
[ 12542.047] (EE) 2: /lib64/libc.so.6 (__pthread_kill_implementation+0x114) [0x7fedc5de40f4]
[ 12542.047] (EE) 3: /lib64/libc.so.6 (gsignal+0x1e) [0x7fedc5d8afde]
[ 12542.047] (EE) 4: /lib64/libc.so.6 (abort+0xdf) [0x7fedc5d72942]
[ 12542.047] (EE) unw_get_proc_name failed: no unwind info found [-10]
[ 12542.047] (EE) 5: /lib64/libgallium-24.3.2.so (?+0x0) [0x7fedc31bcac0]
[ 12542.047] (EE) unw_get_proc_name failed: no unwind info found [-10]
[ 12542.047] (EE) 6: /lib64/libgallium-24.3.2.so (?+0x0) [0x7fedc31bfdea]
[ 12542.047] (EE) unw_get_proc_name failed: no unwind info found [-10]
[ 12542.047] (EE) 7: /lib64/libgallium-24.3.2.so (?+0x0) [0x7fedc2cc54a1]
[ 12542.047] (EE) unw_get_proc_name failed: no unwind info found [-10]
[ 12542.047] (EE) 8: /lib64/libgallium-24.3.2.so (?+0x0) [0x7fedc2cfa49c]
[ 12542.048] (EE) 9: /lib64/libc.so.6 (start_thread+0x3b8) [0x7fedc5de2148]
[ 12542.048] (EE) 10: /lib64/libc.so.6 (__clone3+0x2c) [0x7fedc5e660cc]
[ 12542.048] (EE)
[ 12542.048] (EE)
Fatal server error:
[ 12542.048] (EE) Caught signal 6 (Aborted). Server aborting
Edited by Artem S. Tashkinov