The screen occasionally froze with amdgpu errors starting with GPU fault detected
Brief summary of the problem:
The screen froze twice when I was using Firefox Nightly 126.0a1 (and earlier) on Wayland in Plasma 6.0.2 (and earlier) on Wayland in a Fedora 40 KDE Plasma installation. The first crash was with 6.8-rc2 and the second with 6.8.1. I was looking at my Instagram feed pages in Firefox which had videos shown at the times of the crashes. The screen froze for at least 2 minutes then it went black. amdgpu errors were in the journal starting with "amdgpu: GPU fault detected" in the RDD child process of firefox which plays videos. Errors like amdgpu: IH ring buffer overflow were then shown. "[drm] ERROR [CRTC:51:crtc-0] flip_done timed out", "kwin_wayland_drm: Pageflip timed out! This is a kernel bug" and warnings in amdgpu_dm_atomic_commit_tail were repeated for minutes. The following is first couple minutes of the journal when the crash happened with 6.8.1.
Mar 26 08:43:00 kernel: gmc_v8_0_process_interrupt: 484 callbacks suppressed
Mar 26 08:43:00 kernel: amdgpu 0000:00:01.0: amdgpu: GPU fault detected: 147 0x00020002 for process RDD Process pid 9307 thread firefox-bi:cs0 pid 9338
Mar 26 08:43:00 kernel: amdgpu 0000:00:01.0: amdgpu: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x00112400
Mar 26 08:43:00 kernel: amdgpu 0000:00:01.0: amdgpu: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0B000002
Mar 26 08:43:00 kernel: amdgpu 0000:00:01.0: amdgpu: VM fault (0x02, vmid 5, pasid 32807) at page 1123328, write from 'CB0' (0x43423000) (0)
Mar 26 08:43:00 kernel: amdgpu 0000:00:01.0: amdgpu: GPU fault detected: 147 0x00024002 for process RDD Process pid 9307 thread firefox-bi:cs0 pid 9338
Mar 26 08:43:00 kernel: amdgpu 0000:00:01.0: amdgpu: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x00112408
Mar 26 08:43:00 kernel: amdgpu 0000:00:01.0: amdgpu: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0B040002
Mar 26 08:43:00 kernel: amdgpu 0000:00:01.0: amdgpu: VM fault (0x02, vmid 5, pasid 32807) at page 1123336, write from 'CB1' (0x43423100) (64)
Mar 26 08:43:00 kernel: amdgpu 0000:00:01.0: amdgpu: GPU fault detected: 147 0x00020002 for process RDD Process pid 9307 thread firefox-bi:cs0 pid 9338
Mar 26 08:43:00 kernel: amdgpu 0000:00:01.0: amdgpu: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x00112432
Mar 26 08:43:00 kernel: amdgpu 0000:00:01.0: amdgpu: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0B040002
Mar 26 08:43:00 kernel: amdgpu 0000:00:01.0: amdgpu: VM fault (0x02, vmid 5, pasid 32807) at page 1123378, write from 'CB1' (0x43423100) (64)
Mar 26 08:43:00 kernel: amdgpu 0000:00:01.0: amdgpu: GPU fault detected: 147 0x00024002 for process RDD Process pid 9307 thread firefox-bi:cs0 pid 9338
Mar 26 08:43:00 kernel: amdgpu 0000:00:01.0: amdgpu: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x00112446
Mar 26 08:43:00 kernel: amdgpu 0000:00:01.0: amdgpu: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0B000002
Mar 26 08:43:00 kernel: amdgpu 0000:00:01.0: amdgpu: VM fault (0x02, vmid 5, pasid 32807) at page 1123398, write from 'CB0' (0x43423000) (0)
Mar 26 08:43:00 kernel: amdgpu 0000:00:01.0: amdgpu: GPU fault detected: 147 0x000a4002 for process RDD Process pid 9307 thread firefox-bi:cs0 pid 9338
Mar 26 08:43:00 kernel: amdgpu 0000:00:01.0: amdgpu: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x0011245A
Mar 26 08:43:00 kernel: amdgpu 0000:00:01.0: amdgpu: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0B040002
Mar 26 08:43:00 kernel: amdgpu 0000:00:01.0: amdgpu: VM fault (0x02, vmid 5, pasid 32807) at page 1123418, write from 'CB1' (0x43423100) (64)
Mar 26 08:43:00 kernel: amdgpu 0000:00:01.0: amdgpu: GPU fault detected: 147 0x000a0002 for process RDD Process pid 9307 thread firefox-bi:cs0 pid 9338
Mar 26 08:43:00 kernel: amdgpu 0000:00:01.0: amdgpu: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x0011246E
Mar 26 08:43:00 kernel: amdgpu 0000:00:01.0: amdgpu: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0B040002
Mar 26 08:43:00 kernel: amdgpu 0000:00:01.0: amdgpu: VM fault (0x02, vmid 5, pasid 32807) at page 1123438, write from 'CB1' (0x43423100) (64)
Mar 26 08:43:00 kernel: amdgpu 0000:00:01.0: amdgpu: GPU fault detected: 147 0x00120002 for process RDD Process pid 9307 thread firefox-bi:cs0 pid 9338
Mar 26 08:43:00 kernel: amdgpu 0000:00:01.0: amdgpu: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x00112480
Mar 26 08:43:00 kernel: amdgpu 0000:00:01.0: amdgpu: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0B040002
Mar 26 08:43:00 kernel: amdgpu 0000:00:01.0: amdgpu: VM fault (0x02, vmid 5, pasid 32807) at page 1123456, write from 'CB1' (0x43423100) (64)
Mar 26 08:43:00 kernel: amdgpu 0000:00:01.0: amdgpu: GPU fault detected: 147 0x000a4002 for process RDD Process pid 9307 thread firefox-bi:cs0 pid 9338
Mar 26 08:43:00 kernel: amdgpu 0000:00:01.0: amdgpu: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x00112491
Mar 26 08:43:01 kernel: amdgpu 0000:00:01.0: amdgpu: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0B000002
Mar 26 08:43:01 kernel: amdgpu 0000:00:01.0: amdgpu: VM fault (0x02, vmid 5, pasid 32807) at page 1123473, write from 'CB0' (0x43423000) (0)
Mar 26 08:43:01 kernel: amdgpu 0000:00:01.0: amdgpu: GPU fault detected: 147 0x00124002 for process RDD Process pid 9307 thread firefox-bi:cs0 pid 9338
Mar 26 08:43:01 kernel: amdgpu 0000:00:01.0: amdgpu: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x001124A3
Mar 26 08:43:01 kernel: amdgpu 0000:00:01.0: amdgpu: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0B040002
Mar 26 08:43:01 kernel: amdgpu 0000:00:01.0: amdgpu: VM fault (0x02, vmid 5, pasid 32807) at page 1123491, write from 'CB1' (0x43423100) (64)
Mar 26 08:43:01 kernel: amdgpu 0000:00:01.0: amdgpu: GPU fault detected: 147 0x00024002 for process RDD Process pid 9307 thread firefox-bi:cs0 pid 9338
Mar 26 08:43:01 kernel: amdgpu 0000:00:01.0: amdgpu: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x001124B4
Mar 26 08:43:01 kernel: amdgpu 0000:00:01.0: amdgpu: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0B040002
Mar 26 08:43:01 kernel: amdgpu 0000:00:01.0: amdgpu: VM fault (0x02, vmid 5, pasid 32807) at page 1123508, write from 'CB1' (0x43423100) (64)
Mar 26 08:43:01 kernel: amdgpu 0000:00:01.0: amdgpu: IH ring buffer overflow (0x00085390, 0x00003F00, 0x000053A0)
Mar 26 08:43:01 kernel: amdgpu 0000:00:01.0: amdgpu: IH ring buffer overflow (0x00084760, 0x00006130, 0x00004770)
Mar 26 08:43:11 kernel: amdgpu 0000:00:01.0: [drm] *ERROR* [CRTC:51:crtc-0] flip_done timed out
Mar 26 08:43:15 kwin_wayland[5832]: kwin_wayland_drm: Pageflip timed out! This is a kernel bug
Mar 26 08:43:20 kwin_wayland[5832]: kwin_wayland_drm: Pageflip timed out! This is a kernel bug
Mar 26 08:43:25 kwin_wayland[5832]: kwin_wayland_drm: Pageflip timed out! This is a kernel bug
Mar 26 08:43:53 kwin_wayland[5832]: kwin_wayland_drm: Pageflip timed out! This is a kernel bug
Mar 26 08:43:58 kwin_wayland[5832]: kwin_wayland_drm: Pageflip timed out! This is a kernel bug
Mar 26 08:44:03 kwin_wayland[5832]: kwin_wayland_drm: Pageflip timed out! This is a kernel bug
Mar 26 08:44:08 kwin_wayland[5832]: kwin_wayland_drm: Pageflip timed out! This is a kernel bug
Mar 26 08:44:16 kwin_wayland[5832]: kwin_wayland_drm: Pageflip timed out! This is a kernel bug
Mar 26 08:44:20 kwin_wayland[5832]: This plugin does not support raise()
Mar 26 08:44:20 kwin_wayland_wrapper[5832]: amdgpu: amdgpu_cs_ctx_create2 failed. (-13)
Mar 26 08:44:20 kwin_wayland_wrapper[5832]: amdgpu: amdgpu_cs_ctx_create2 failed. (-13)
Mar 26 08:44:21 kwin_wayland[5832]: kwin_wayland_drm: Pageflip timed out! This is a kernel bug
Mar 26 08:44:26 kwin_wayland[5832]: kwin_wayland_drm: Pageflip timed out! This is a kernel bug
Mar 26 08:44:27 uresourced[1249]: Setting resources on user-1000.slice (MemoryMin: 0, MemoryLow: 0, CPUWeight: 100, IOWeight: 100)
Mar 26 08:44:27 uresourced[1249]: Setting resources on user@1000.service (MemoryMin: 0, MemoryLow: 0, CPUWeight: 100, IOWeight: 100)
Mar 26 08:44:27 uresourced[1249]: Setting resources on user.slice (MemoryMin: 0, MemoryLow: 0, CPUWeight: -, IOWeight: -)
Mar 26 08:44:29 uresourced[1249]: Setting resources on user.slice (MemoryMin: 262144000, MemoryLow: 0, CPUWeight: -, IOWeight: -)
Mar 26 08:44:29 uresourced[1249]: Setting resources on user-1000.slice (MemoryMin: 262144000, MemoryLow: 0, CPUWeight: 500, IOWeight: 500)
Mar 26 08:44:29 uresourced[1249]: Setting resources on user@1000.service (MemoryMin: 262144000, MemoryLow: 0, CPUWeight: 500, IOWeight: 500)
Mar 26 08:44:29 wireplumber[5831]: GetManagedObjects() failed: org.freedesktop.DBus.Error.NameHasNoOwner
Mar 26 08:44:29 wireplumber[5831]: <WpPortalPermissionStorePlugin:0x564126971e20> Failed to call Lookup: GDBus.Error:org.freedesktop.portal.Error.NotFound: No entry for camera
Mar 26 08:44:31 kwin_wayland[5832]: kwin_wayland_drm: Pageflip timed out! This is a kernel bug
Mar 26 08:44:36 kwin_wayland[5832]: kwin_wayland_drm: Pageflip timed out! This is a kernel bug
Mar 26 08:44:41 kwin_wayland[5832]: kwin_wayland_drm: Pageflip timed out! This is a kernel bug
Mar 26 08:44:46 kwin_wayland[5832]: kwin_wayland_drm: Pageflip timed out! This is a kernel bug
Mar 26 08:44:51 kwin_wayland[5832]: kwin_wayland_drm: Pageflip timed out! This is a kernel bug
Mar 26 08:44:56 kwin_wayland[5832]: kwin_wayland_drm: Pageflip timed out! This is a kernel bug
Mar 26 08:44:58 kwin_wayland[5832]: kwin_wayland_drm: No drm events for gpu "/dev/dri/card1" within last 30 seconds
Mar 26 08:44:59 kwin_wayland[5832]: kwin_core: Could not find window with uuid "{5028cc34-e5ad-469f-b7d8-3d5d61201a65}"
Mar 26 08:44:59 kwin_wayland[5832]: kwin_core: Could not find window with uuid "{5028cc34-e5ad-469f-b7d8-3d5d61201a65}"
Mar 26 08:44:59 systemd[1]: Started getty@tty4.service - Getty on tty4.
Mar 26 08:44:59 audit[1]: SERVICE_START pid=1 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:init_t:s0 msg='unit=getty@tty4 comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success'
Mar 26 08:44:59 agetty[10613]: failed to open credentials directory
Mar 26 08:44:59 kernel: fbcon: Taking over console
Mar 26 08:44:59 systemd[1]: systemd-vconsole-setup.service: Deactivated successfully.
Mar 26 08:44:59 systemd[1]: Stopped systemd-vconsole-setup.service - Virtual Console Setup.
Mar 26 08:44:59 systemd[1]: Stopping systemd-vconsole-setup.service - Virtual Console Setup...
Mar 26 08:44:59 audit[1]: SERVICE_STOP pid=1 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:init_t:s0 msg='unit=systemd-vconsole-setup comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success'
Mar 26 08:44:59 kwin_wayland[5832]: kwin_core: Could not find window with uuid "{5028cc34-e5ad-469f-b7d8-3d5d61201a65}"
Mar 26 08:44:59 systemd[1]: Starting systemd-vconsole-setup.service - Virtual Console Setup...
Mar 26 08:44:59 uresourced[1249]: Setting resources on user-1000.slice (MemoryMin: 0, MemoryLow: 0, CPUWeight: 100, IOWeight: 100)
Mar 26 08:44:59 uresourced[1249]: Setting resources on user@1000.service (MemoryMin: 0, MemoryLow: 0, CPUWeight: 100, IOWeight: 100)
Mar 26 08:44:59 uresourced[1249]: Setting resources on user.slice (MemoryMin: 0, MemoryLow: 0, CPUWeight: -, IOWeight: -)
Mar 26 08:45:04 kwin_wayland[5832]: kwin_wayland_drm: Pageflip timed out! This is a kernel bug
Mar 26 08:45:09 kwin_wayland[5832]: kwin_wayland_drm: Pageflip timed out! This is a kernel bug
Mar 26 08:45:09 kernel: amdgpu 0000:00:01.0: [drm] *ERROR* flip_done timed out
Mar 26 08:45:09 kernel: amdgpu 0000:00:01.0: [drm] *ERROR* [CRTC:51:crtc-0] commit wait timed out
Mar 26 08:45:14 kwin_wayland[5832]: kwin_wayland_drm: Pageflip timed out! This is a kernel bug
Mar 26 08:45:19 kwin_wayland[5832]: kwin_wayland_drm: Pageflip timed out! This is a kernel bug
Mar 26 08:45:19 kernel: amdgpu 0000:00:01.0: [drm] *ERROR* flip_done timed out
Mar 26 08:45:19 kernel: amdgpu 0000:00:01.0: [drm] *ERROR* [PLANE:46:plane-2] commit wait timed out
Mar 26 08:45:24 kwin_wayland[5832]: kwin_wayland_drm: Pageflip timed out! This is a kernel bug
Mar 26 08:45:29 kwin_wayland[5832]: kwin_wayland_drm: Pageflip timed out! This is a kernel bug
Mar 26 08:45:29 kernel: amdgpu 0000:00:01.0: [drm] *ERROR* flip_done timed out
Mar 26 08:45:29 kernel: amdgpu 0000:00:01.0: [drm] *ERROR* [PLANE:49:plane-3] commit wait timed out
Mar 26 08:45:29 kernel: ------------[ cut here ]------------
Mar 26 08:45:29 kernel: WARNING: CPU: 0 PID: 5137 at drivers/gpu/drm/amd/amdgpu/../display/amdgpu_dm/amdgpu_dm.c:8446 amdgpu_dm_atomic_commit_tail+0x3c9b/0x3e30 [amdgpu]
Mar 26 08:45:29 kernel: Modules linked in: uinput snd_seq_dummy snd_hrtimer nft_fib_inet nft_fib_ipv4 nft_fib_ipv6 nft_fib nft_reject_inet nf_reject_ipv4 nf_reject_ipv6 nft_reject nf_log_syslog nft_log nft_ct nft_chain_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 ip_set nf_tables sunrpc binfmt_misc snd_ctl_led ledtrig_audio btusb snd_hda_codec_realtek snd_hda_codec_generic snd_hda_codec_hdmi edac_mce_amd btrtl iwlmvm snd_hda_intel snd_intel_dspcfg btintel snd_intel_sdw_acpi kvm_amd btbcm ccp uvcvideo snd_hda_codec btmtk mac80211 uvc libarc4 snd_hda_core kvm snd_hwdep videobuf2_vmalloc videobuf2_memops hp_wmi videobuf2_v4l2 sparse_keymap videobuf2_common irqbypass bluetooth snd_seq iwlwifi platform_profile videodev snd_seq_device mc pcspkr snd_pcm wmi_bmof acpi_cpufreq cfg80211 i2c_piix4 fam15h_power k10temp rfkill snd_timer snd vfat soundcore fat i2c_scmi wireless_hotkey joydev loop dm_multipath nfnetlink zram amdgpu hid_logitech_hidpp crct10dif_pclmul crc32_pclmul crc32c_intel polyval_clmulni polyval_generic amdxcp
Mar 26 08:45:29 kernel: i2c_algo_bit drm_ttm_helper ghash_clmulni_intel ttm sha512_ssse3 drm_exec sha256_ssse3 sha1_ssse3 sp5100_tco wdat_wdt r8169 gpu_sched drm_suballoc_helper drm_buddy drm_display_helper realtek cec video wmi hid_multitouch hid_logitech_dj serio_raw scsi_dh_rdac scsi_dh_emc scsi_dh_alua ip6_tables ip_tables fuse i2c_dev
Mar 26 08:45:29 kernel: CPU: 0 PID: 5137 Comm: kworker/0:0 Not tainted 6.8.1-300.fc40.x86_64 #1
Mar 26 08:45:29 kernel: Hardware name: HP HP Laptop 15-bw0xx/8332, BIOS F.52 12/03/2019
Mar 26 08:45:29 kernel: Workqueue: events fbcon_register_existing_fbs
Mar 26 08:45:29 kernel: RIP: 0010:amdgpu_dm_atomic_commit_tail+0x3c9b/0x3e30 [amdgpu]
Mar 26 08:45:29 kernel: Code: 88 fe ff ff 48 8d 95 c4 fe ff ff 48 8b b1 50 01 00 00 48 8b b8 e0 a8 03 00 e8 91 df 26 00 4c 8b 9d 70 fe ff ff e9 91 f7 ff ff <0f> 0b e9 a5 f4 ff ff 0f 0b 0f 0b e9 9c c9 ff ff 0f 0b e9 c2 f4 ff
Mar 26 08:45:29 kernel: RSP: 0018:ffffbb9cc2c17950 EFLAGS: 00010002
Mar 26 08:45:29 kernel: RAX: 0000000000000286 RBX: 0000000000000286 RCX: ffff95585059b118
Mar 26 08:45:29 kernel: RDX: 0000000000000001 RSI: 0000000000000293 RDI: ffff95584c380178
Mar 26 08:45:29 kernel: RBP: ffffbb9cc2c17ba0 R08: ffffbb9cc2c178b4 R09: ffff9558453e7f60
Mar 26 08:45:29 kernel: R10: ffffbb9cc2c178ac R11: 0000000000000002 R12: ffff95585059b118
Mar 26 08:45:29 kernel: R13: 0000000000000000 R14: ffff95585059b000 R15: ffff955752874a00
Mar 26 08:45:29 kernel: FS: 0000000000000000(0000) GS:ffff955937400000(0000) knlGS:0000000000000000
Mar 26 08:45:29 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Mar 26 08:45:29 kernel: CR2: 00007f18e03fa000 CR3: 00000001bf358000 CR4: 00000000001506f0
Mar 26 08:45:29 kernel: Call Trace:
Mar 26 08:45:29 kernel: <TASK>
Mar 26 08:45:29 kernel: ? amdgpu_dm_atomic_commit_tail+0x3c9b/0x3e30 [amdgpu]
Mar 26 08:45:29 kernel: ? __warn+0x81/0x130
Mar 26 08:45:29 kernel: ? amdgpu_dm_atomic_commit_tail+0x3c9b/0x3e30 [amdgpu]
Mar 26 08:45:29 kernel: ? report_bug+0x16f/0x1a0
Mar 26 08:45:29 kernel: ? handle_bug+0x3c/0x80
Mar 26 08:45:29 kernel: ? exc_invalid_op+0x17/0x70
Mar 26 08:45:29 kernel: ? asm_exc_invalid_op+0x1a/0x20
Mar 26 08:45:29 kernel: ? amdgpu_dm_atomic_commit_tail+0x3c9b/0x3e30 [amdgpu]
Mar 26 08:45:29 kernel: ? mutex_lock_io+0x10/0x50
Mar 26 08:45:29 kernel: ? __pfx_cpumask_weight.constprop.0+0x1/0x10
Mar 26 08:45:29 kernel: commit_tail+0x94/0x130
Mar 26 08:45:29 kernel: drm_atomic_helper_commit+0x11a/0x140
Mar 26 08:45:29 kernel: drm_atomic_commit+0x98/0xd0
Mar 26 08:45:29 kernel: ? __pfx___drm_printfn_info+0x10/0x10
Mar 26 08:45:29 kernel: drm_client_modeset_commit_atomic+0x203/0x250
Mar 26 08:45:29 kernel: drm_client_modeset_commit_locked+0x5a/0x160
Mar 26 08:45:29 kernel: drm_client_modeset_commit+0x25/0x40
Mar 26 08:45:29 kernel: __drm_fb_helper_restore_fbdev_mode_unlocked+0x85/0xd0
Mar 26 08:45:29 kernel: drm_fb_helper_set_par+0x30/0x40
Mar 26 08:45:29 kernel: fbcon_init+0x297/0x540
Mar 26 08:45:29 kernel: visual_init+0xcc/0x120
Mar 26 08:45:29 kernel: do_bind_con_driver.isra.0+0x1b3/0x3b0
Mar 26 08:45:29 kernel: do_take_over_console+0x107/0x190
Mar 26 08:45:29 kernel: do_fbcon_takeover+0x5a/0xc0
Mar 26 08:45:29 kernel: fbcon_register_existing_fbs+0x3f/0x70
Mar 26 08:45:29 kernel: process_one_work+0x170/0x330
Mar 26 08:45:29 kernel: worker_thread+0x273/0x3c0
Mar 26 08:45:29 kernel: ? __pfx_worker_thread+0x10/0x10
Mar 26 08:45:29 kernel: kthread+0xe8/0x120
Mar 26 08:45:29 kernel: ? __pfx_kthread+0x10/0x10
Mar 26 08:45:29 kernel: ret_from_fork+0x34/0x50
Mar 26 08:45:29 kernel: ? __pfx_kthread+0x10/0x10
Mar 26 08:45:29 kernel: ret_from_fork_asm+0x1b/0x30
Mar 26 08:45:29 kernel: </TASK>
Mar 26 08:45:29 kernel: ---[ end trace 0000000000000000 ]---
Mar 26 08:45:29 kernel: ------------[ cut here ]------------
Mar 26 08:45:29 kernel: WARNING: CPU: 0 PID: 5137 at drivers/gpu/drm/amd/amdgpu/../display/amdgpu_dm/amdgpu_dm.c:7977 amdgpu_dm_atomic_commit_tail+0x3cab/0x3e30 [amdgpu]
Mar 26 08:45:29 kernel: Modules linked in: uinput snd_seq_dummy snd_hrtimer nft_fib_inet nft_fib_ipv4 nft_fib_ipv6 nft_fib nft_reject_inet nf_reject_ipv4 nf_reject_ipv6 nft_reject nf_log_syslog nft_log nft_ct nft_chain_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 ip_set nf_tables sunrpc binfmt_misc snd_ctl_led ledtrig_audio btusb snd_hda_codec_realtek snd_hda_codec_generic snd_hda_codec_hdmi edac_mce_amd btrtl iwlmvm snd_hda_intel snd_intel_dspcfg btintel snd_intel_sdw_acpi kvm_amd btbcm ccp uvcvideo snd_hda_codec btmtk mac80211 uvc libarc4 snd_hda_core kvm snd_hwdep videobuf2_vmalloc videobuf2_memops hp_wmi videobuf2_v4l2 sparse_keymap videobuf2_common irqbypass bluetooth snd_seq iwlwifi platform_profile videodev snd_seq_device mc pcspkr snd_pcm wmi_bmof acpi_cpufreq cfg80211 i2c_piix4 fam15h_power k10temp rfkill snd_timer snd vfat soundcore fat i2c_scmi wireless_hotkey joydev loop dm_multipath nfnetlink zram amdgpu hid_logitech_hidpp crct10dif_pclmul crc32_pclmul crc32c_intel polyval_clmulni polyval_generic amdxcp
Mar 26 08:45:29 kernel: i2c_algo_bit drm_ttm_helper ghash_clmulni_intel ttm sha512_ssse3 drm_exec sha256_ssse3 sha1_ssse3 sp5100_tco wdat_wdt r8169 gpu_sched drm_suballoc_helper drm_buddy drm_display_helper realtek cec video wmi hid_multitouch hid_logitech_dj serio_raw scsi_dh_rdac scsi_dh_emc scsi_dh_alua ip6_tables ip_tables fuse i2c_dev
Mar 26 08:45:29 kernel: CPU: 0 PID: 5137 Comm: kworker/0:0 Tainted: G W 6.8.1-300.fc40.x86_64 #1
Mar 26 08:45:29 kernel: Hardware name: HP HP Laptop 15-bw0xx/8332, BIOS F.52 12/03/2019
Mar 26 08:45:29 kernel: Workqueue: events fbcon_register_existing_fbs
Mar 26 08:45:29 kernel: RIP: 0010:amdgpu_dm_atomic_commit_tail+0x3cab/0x3e30 [amdgpu]
Mar 26 08:45:29 kernel: Code: 00 00 48 8b b8 e0 a8 03 00 e8 91 df 26 00 4c 8b 9d 70 fe ff ff e9 91 f7 ff ff 0f 0b e9 a5 f4 ff ff 0f 0b 0f 0b e9 9c c9 ff ff <0f> 0b e9 c2 f4 ff ff 48 89 f9 49 8b 7d 28 48 39 79 28 41 0f 95 c1
Mar 26 08:45:29 kernel: RSP: 0018:ffffbb9cc2c17950 EFLAGS: 00010086
Mar 26 08:45:29 kernel: RAX: ffff95585059b000 RBX: 0000000000000286 RCX: ffff95585059b118
Mar 26 08:45:29 kernel: RDX: 0000000000000001 RSI: 0000000000000293 RDI: ffff95584c380178
Mar 26 08:45:29 kernel: RBP: ffffbb9cc2c17ba0 R08: ffffbb9cc2c178b4 R09: ffff9558453e7f60
Mar 26 08:45:29 kernel: R10: ffffbb9cc2c178ac R11: 0000000000000002 R12: ffff95585059b118
Mar 26 08:45:29 kernel: R13: 0000000000000000 R14: ffff95585059b000 R15: ffff955752874a00
Mar 26 08:45:29 kernel: FS: 0000000000000000(0000) GS:ffff955937400000(0000) knlGS:0000000000000000
Mar 26 08:45:29 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Mar 26 08:45:29 kernel: CR2: 00007f18e03fa000 CR3: 00000001bf358000 CR4: 00000000001506f0
Mar 26 08:45:29 kernel: Call Trace:
Mar 26 08:45:29 kernel: <TASK>
Mar 26 08:45:29 kernel: ? amdgpu_dm_atomic_commit_tail+0x3cab/0x3e30 [amdgpu]
Mar 26 08:45:29 kernel: ? __warn+0x81/0x130
Mar 26 08:45:29 kernel: ? amdgpu_dm_atomic_commit_tail+0x3cab/0x3e30 [amdgpu]
Mar 26 08:45:29 kernel: ? report_bug+0x16f/0x1a0
Mar 26 08:45:29 kernel: ? handle_bug+0x3c/0x80
Mar 26 08:45:29 kernel: ? exc_invalid_op+0x17/0x70
Mar 26 08:45:29 kernel: ? asm_exc_invalid_op+0x1a/0x20
Mar 26 08:45:29 kernel: ? amdgpu_dm_atomic_commit_tail+0x3cab/0x3e30 [amdgpu]
Mar 26 08:45:29 kernel: ? mutex_lock_io+0x10/0x50
Mar 26 08:45:29 kernel: ? __pfx_cpumask_weight.constprop.0+0x1/0x10
Mar 26 08:45:29 kernel: commit_tail+0x94/0x130
Mar 26 08:45:29 kernel: drm_atomic_helper_commit+0x11a/0x140
Mar 26 08:45:29 kernel: drm_atomic_commit+0x98/0xd0
Mar 26 08:45:29 kernel: ? __pfx___drm_printfn_info+0x10/0x10
Mar 26 08:45:29 kernel: drm_client_modeset_commit_atomic+0x203/0x250
Mar 26 08:45:29 kernel: drm_client_modeset_commit_locked+0x5a/0x160
Mar 26 08:45:29 kernel: drm_client_modeset_commit+0x25/0x40
Mar 26 08:45:29 kernel: __drm_fb_helper_restore_fbdev_mode_unlocked+0x85/0xd0
Mar 26 08:45:29 kernel: drm_fb_helper_set_par+0x30/0x40
Mar 26 08:45:29 kernel: fbcon_init+0x297/0x540
Mar 26 08:45:29 kernel: visual_init+0xcc/0x120
Mar 26 08:45:29 kernel: do_bind_con_driver.isra.0+0x1b3/0x3b0
Mar 26 08:45:29 kernel: do_take_over_console+0x107/0x190
Mar 26 08:45:29 kernel: do_fbcon_takeover+0x5a/0xc0
Mar 26 08:45:29 kernel: fbcon_register_existing_fbs+0x3f/0x70
Mar 26 08:45:29 kernel: process_one_work+0x170/0x330
Mar 26 08:45:29 kernel: worker_thread+0x273/0x3c0
Mar 26 08:45:29 kernel: ? __pfx_worker_thread+0x10/0x10
Mar 26 08:45:29 kernel: kthread+0xe8/0x120
Mar 26 08:45:29 kernel: ? __pfx_kthread+0x10/0x10
Mar 26 08:45:29 kernel: ret_from_fork+0x34/0x50
Mar 26 08:45:29 kernel: ? __pfx_kthread+0x10/0x10
Mar 26 08:45:29 kernel: ret_from_fork_asm+0x1b/0x30
Mar 26 08:45:29 kernel: </TASK>
Mar 26 08:45:29 kernel: ---[ end trace 0000000000000000 ]---
I tried to switch to another VT which was shown minutes later but was unresponsive. I used Sysrq+alt+r,e,i,s,u,b. The problem would be difficult to bisect since it happened infrequently.
Hardware description:
- CPU: AMD A10-9620P
- GPU: integrated AMD Radeon R5. 00:01.0 VGA compatible controller [0300]: Advanced Micro Devices, Inc. [AMD/ATI] Wani [Radeon R5/R6/R7 Graphics] [1002:9874] (rev ca)
- System Memory: 8 GB
- Display(s): Integrated Elan touchscreen
- Type of Display Connection: eDP
System information:
- Distro name and Version: Fedora 40
- Kernel version: 6.8.1
- Custom kernel: N/A
- AMD official driver version: N/A
How to reproduce the issue:
- Boot a Fedora 40 KDE Plasma installation updated to 2024-3-26 with updates-testing enabled
- Log in to Plasma 6.0.2 on Wayland
- Install and enable the RPM Fusion free repository with sudo dnf install https://mirrors.rpmfusion.org/free/fedora/rpmfusion-free-release-$(rpm -E %fedora).noarch.rpm
- I have mesa-va-drivers-freeworld-24.0.0-3.fc40 from RPM Fusion installed which has h264 and h265 hardware acceleration enabled for AMD GPUs. If mesa-va-drivers from the Fedora mesa package isn't installed, install mesa-va-drivers-freeworld with sudo dnf install mesa-va-drivers-freeworld If mesa-va-drivers from the Fedora mesa package is installed, use sudo dnf swap mesa-va-drivers mesa-va-drivers-freeworld
- Download Firefox Nightly 126.0a1 (2024-3-26) from https://www.mozilla.org/en-US/firefox/all/#product-desktop-nightly
- Decompress the Firefox Nightly file
- Run Firefox Nightly on Wayland
- Log in to your Instagram account in Firefox if you have one
- scroll up and down the Instagram news feed which contains videos. The crash might take a long time to reproduce since it doesn't normally happen.