RX6800XT occasionally freezes for minutes
Hi, I my RX6800XT PC recently freezes occasionally, about one time a day on average. I checked several similar issues and they seems specific to 780m, 880m or other laptops.
Specs: os: archlinux kernel: 6.12.6 PC specs: 12600kf + 6800xt + 48gb dram
Here is the log from journalctl
Dec 23 23:20:42 ykpcx kernel: amdgpu 0000:03:00.0: [drm] *ERROR* [CRTC:91:crtc-0] flip_done timed out
Dec 23 23:21:37 ykpcx kernel: amdgpu 0000:03:00.0: [drm] *ERROR* flip_done timed out
Dec 23 23:21:37 ykpcx kernel: amdgpu 0000:03:00.0: [drm] *ERROR* [CRTC:91:crtc-0] commit wait timed out
Dec 23 23:21:47 ykpcx kernel: amdgpu 0000:03:00.0: [drm] *ERROR* flip_done timed out
Dec 23 23:21:47 ykpcx kernel: amdgpu 0000:03:00.0: [drm] *ERROR* [PLANE:88:plane-8] commit wait timed out
Dec 23 23:21:47 ykpcx kernel: ------------[ cut here ]------------
Dec 23 23:21:47 ykpcx kernel: WARNING: CPU: 12 PID: 659 at drivers/gpu/drm/amd/amdgpu/../display/amdgpu_dm/amdgpu_dm.c:8609 amdgpu_dm_atomic_commit_tail+0x3b4f/0x3c30 [amdgpu]
Dec 23 23:21:47 ykpcx kernel: Modules linked in: tcp_diag inet_diag snd_seq_dummy rfcomm snd_hrtimer snd_seq cmac algif_hash algif_skcipher af_alg bnep nct6683 snd_sof_pci_intel_tgl snd_sof_pci_intel_cnl snd_sof_intel_hda_generic soundwire_intel soundwire_cadence vfat fat snd_sof_intel_hda_common intel_rapl_msr snd_soc_hdac_hda intel_rapl_common snd_sof_intel_hda_mlink intel_uncore_frequency snd_sof_intel_hda intel_uncore_frequency_common snd_sof_pci intel_tcc_cooling snd_sof_xtensa_dsp x86_pkg_temp_thermal snd_sof intel_powerclamp amdgpu snd_sof_utils iwlmvm snd_soc_acpi_intel_match coretemp soundwire_generic_allocation snd_soc_acpi soundwire_bus kvm_intel snd_soc_avs snd_soc_hda_codec snd_hda_codec_realtek snd_hda_ext_core kvm snd_hda_codec_generic crct10dif_pclmul mac80211 snd_hda_scodec_component snd_soc_core crc32_pclmul polyval_clmulni snd_compress polyval_generic snd_hda_codec_hdmi ac97_bus libarc4 ghash_clmulni_intel snd_pcm_dmaengine sha512_ssse3 amdxcp sha1_ssse3 snd_hda_intel drm_exec aesni_intel snd_intel_dspcfg
Dec 23 23:21:47 ykpcx kernel: snd_usb_audio gpu_sched btusb gf128mul iwlwifi snd_intel_sdw_acpi btrtl crypto_simd drm_buddy snd_usbmidi_lib cryptd snd_hda_codec btintel iTCO_wdt i2c_algo_bit btbcm snd_ump intel_pmc_bxt drm_suballoc_helper snd_hda_core snd_rawmidi rapl iTCO_vendor_support ee1004 wmi_bmof mxm_wmi igc snd_seq_device snd_hwdep btmtk drm_ttm_helper intel_cstate cfg80211 bluetooth snd_pcm ucsi_ccg spi_nor ttm i2c_i801 intel_uncore ptp mei_me typec_ucsi i2c_smbus drm_display_helper mtd pcspkr snd_timer mei pps_core i2c_mux typec cec snd rfkill roles mc soundcore crc16 video intel_pmc_core intel_vsec mousedev joydev serial_multi_instantiate pmt_telemetry wmi pmt_class acpi_pad pinctrl_alderlake acpi_tad mac_hid sg crypto_user loop dm_mod nfnetlink zram 842_decompress 842_compress lz4hc_compress lz4_compress ip_tables x_tables hid_generic usbhid nvme nvme_core sha256_ssse3 spi_intel_pci spi_intel nvme_auth uas usb_storage btrfs blake2b_generic libcrc32c crc32c_generic crc32c_intel xor raid6_pq
Dec 23 23:21:47 ykpcx kernel: CPU: 12 UID: 0 PID: 659 Comm: systemd-logind Tainted: G W 6.12.4-arch1-1 #1 7c6e9657ff18c9ac01abfce09529b4cbe186f917
Dec 23 23:21:47 ykpcx kernel: Tainted: [W]=WARN
Dec 23 23:21:47 ykpcx kernel: Hardware name: Micro-Star International Co., Ltd. MS-7D25/PRO Z690-A WIFI DDR4(MS-7D25), BIOS 1.A0 01/09/2023
Dec 23 23:21:47 ykpcx kernel: RIP: 0010:amdgpu_dm_atomic_commit_tail+0x3b4f/0x3c30 [amdgpu]
Dec 23 23:21:47 ykpcx kernel: Code: 49 e4 e9 dc fd ff ff 49 8d 87 50 31 04 00 c6 85 38 fe ff ff 00 48 89 85 48 fe ff ff e9 d8 cb ff ff 0f 0b e9 fc f2 ff ff 0f 0b <0f> 0b e9 12 f3 ff ff 0f 0b e9 11 cc ff ff 48 c7 85 28 fe ff ff 00
Dec 23 23:21:47 ykpcx kernel: RSP: 0018:ffffbf7788e0b470 EFLAGS: 00010082
Dec 23 23:21:47 ykpcx kernel: RAX: 0000000000000001 RBX: 0000000000000286 RCX: ffffa0fd861e5118
Dec 23 23:21:47 ykpcx kernel: RDX: 0000000000000001 RSI: 0000000000000297 RDI: ffffa0fda6180178
Dec 23 23:21:47 ykpcx kernel: RBP: ffffbf7788e0b6c0 R08: ffffbf7788e0b35c R09: 0000000000000000
Dec 23 23:21:47 ykpcx kernel: R10: ffffbf7788e0b3c8 R11: ffffbf7788e0b3cc R12: ffffbf7788e0b528
Dec 23 23:21:47 ykpcx kernel: R13: 0000000000000000 R14: ffffa1005f7ad000 R15: ffffa0fd861e5000
Dec 23 23:21:47 ykpcx kernel: FS: 0000763102ef5900(0000) GS:ffffa108cfc00000(0000) knlGS:0000000000000000
Dec 23 23:21:47 ykpcx kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Dec 23 23:21:47 ykpcx kernel: CR2: 00007450e0d8fe68 CR3: 000000012f9d2000 CR4: 0000000000f52ef0
Dec 23 23:21:47 ykpcx kernel: PKRU: 55555554
Dec 23 23:21:47 ykpcx kernel: Call Trace:
Dec 23 23:21:47 ykpcx kernel: <TASK>
Dec 23 23:21:47 ykpcx kernel: ? amdgpu_dm_atomic_commit_tail+0x3b4f/0x3c30 [amdgpu f86366d5e18d6e04f864e4ac42180b398aeb6a75]
Dec 23 23:21:47 ykpcx kernel: ? __warn.cold+0x93/0xf6
Dec 23 23:21:47 ykpcx kernel: ? amdgpu_dm_atomic_commit_tail+0x3b4f/0x3c30 [amdgpu f86366d5e18d6e04f864e4ac42180b398aeb6a75]
Dec 23 23:21:47 ykpcx kernel: ? report_bug+0xff/0x140
Dec 23 23:21:47 ykpcx kernel: ? handle_bug+0x58/0x90
Dec 23 23:21:47 ykpcx kernel: ? exc_invalid_op+0x17/0x70
Dec 23 23:21:47 ykpcx kernel: ? asm_exc_invalid_op+0x1a/0x20
Dec 23 23:21:47 ykpcx kernel: ? amdgpu_dm_atomic_commit_tail+0x3b4f/0x3c30 [amdgpu f86366d5e18d6e04f864e4ac42180b398aeb6a75]
Dec 23 23:21:47 ykpcx kernel: commit_tail+0x91/0x130
Dec 23 23:21:47 ykpcx kernel: drm_atomic_helper_commit+0x11a/0x140
Dec 23 23:21:47 ykpcx kernel: drm_atomic_commit+0xa6/0xe0
Dec 23 23:21:47 ykpcx kernel: ? __pfx___drm_printfn_info+0x10/0x10
Dec 23 23:21:47 ykpcx kernel: drm_client_modeset_commit_atomic+0x203/0x250
Dec 23 23:21:47 ykpcx kernel: drm_client_modeset_commit_locked+0x5a/0x160
Dec 23 23:21:47 ykpcx kernel: __drm_fb_helper_restore_fbdev_mode_unlocked+0x5e/0xd0
Dec 23 23:21:47 ykpcx kernel: drm_fb_helper_set_par+0x30/0x40
Dec 23 23:21:47 ykpcx kernel: fb_set_var+0x25c/0x460
Dec 23 23:21:47 ykpcx kernel: ? update_load_avg+0x7e/0x7b0
Dec 23 23:21:47 ykpcx kernel: ? __dequeue_entity+0x3f5/0x4b0
Dec 23 23:21:47 ykpcx kernel: ? sched_clock+0x10/0x30
Dec 23 23:21:47 ykpcx kernel: ? sched_clock_cpu+0xf/0x1d0
Dec 23 23:21:47 ykpcx kernel: ? psi_group_change+0x13b/0x310
Dec 23 23:21:47 ykpcx kernel: fbcon_blank+0x271/0x330
Dec 23 23:21:47 ykpcx kernel: do_unblank_screen+0xad/0x150
Dec 23 23:21:47 ykpcx kernel: complete_change_console+0x54/0x120
Dec 23 23:21:47 ykpcx kernel: vt_ioctl+0xec3/0x12c0
Dec 23 23:21:47 ykpcx kernel: ? syscall_exit_to_user_mode+0x37/0x1c0
Dec 23 23:21:47 ykpcx kernel: tty_ioctl+0xe2/0x8a0
Dec 23 23:21:47 ykpcx kernel: ? __seccomp_filter+0x303/0x520
Dec 23 23:21:47 ykpcx kernel: __x64_sys_ioctl+0x91/0xd0
Dec 23 23:21:47 ykpcx kernel: do_syscall_64+0x82/0x190
Dec 23 23:21:47 ykpcx kernel: ? evdev_ioctl+0x6f/0x90
Dec 23 23:21:47 ykpcx kernel: ? syscall_exit_to_user_mode+0x37/0x1c0
Dec 23 23:21:47 ykpcx kernel: ? do_syscall_64+0x8e/0x190
Dec 23 23:21:47 ykpcx kernel: ? syscall_exit_to_user_mode+0x37/0x1c0
Dec 23 23:21:47 ykpcx kernel: ? do_syscall_64+0x8e/0x190
Dec 23 23:21:47 ykpcx kernel: ? syscall_exit_to_user_mode+0x37/0x1c0
Dec 23 23:21:47 ykpcx kernel: ? do_syscall_64+0x8e/0x190
Dec 23 23:21:47 ykpcx kernel: ? syscall_exit_to_user_mode+0x37/0x1c0
Dec 23 23:21:47 ykpcx kernel: ? do_syscall_64+0x8e/0x190
Dec 23 23:21:47 ykpcx kernel: ? syscall_exit_to_user_mode+0x37/0x1c0
Dec 23 23:21:47 ykpcx kernel: ? do_syscall_64+0x8e/0x190
Dec 23 23:21:47 ykpcx kernel: ? do_user_addr_fault+0x36c/0x620
Dec 23 23:21:47 ykpcx kernel: entry_SYSCALL_64_after_hwframe+0x76/0x7e
Dec 23 23:21:47 ykpcx kernel: RIP: 0033:0x763103123ced
Dec 23 23:21:47 ykpcx kernel: Code: 04 25 28 00 00 00 48 89 45 c8 31 c0 48 8d 45 10 c7 45 b0 10 00 00 00 48 89 45 b8 48 8d 45 d0 48 89 45 c0 b8 10 00 00 00 0f 05 <89> c2 3d 00 f0 ff ff 77 1a 48 8b 45 c8 64 48 2b 04 25 28 00 00 00
Dec 23 23:21:47 ykpcx kernel: RSP: 002b:00007ffcb8ea59d0 EFLAGS: 00000246 ORIG_RAX: 0000000000000010
Dec 23 23:21:47 ykpcx kernel: RAX: ffffffffffffffda RBX: 0000000000000023 RCX: 0000763103123ced
Dec 23 23:21:47 ykpcx kernel: RDX: 0000000000000001 RSI: 0000000000005605 RDI: 0000000000000023
Dec 23 23:21:47 ykpcx kernel: RBP: 00007ffcb8ea5a20 R08: 00007ffcb8ea59b0 R09: 000060c9e07f6860
Dec 23 23:21:47 ykpcx kernel: R10: 00007ffcb8ea5a00 R11: 0000000000000246 R12: 0000000000000000
Dec 23 23:21:47 ykpcx kernel: R13: 00007ffcb8ea5ab0 R14: 000060c9e07c71c0 R15: 000060c9e07c81c0
Dec 23 23:21:47 ykpcx kernel: </TASK>
Dec 23 23:21:47 ykpcx kernel: ---[ end trace 0000000000000000 ]---