amdgpu bug
When plugging my laptop into it's USB-C Dock, amdgpu spams the journal, and I can no longer run two screens at 2560x1440. This started on 2/15/2022 after updating. I have been able to get both screens working now, but I have to run one at 1080p and one at 1440p. It used to work great with both at 1440p. When unplugging, it often will not re-wake the laptop screen. This is all new behavior since that update. I've updated to a few other kernels I'll detail below.
Hardware description:
- CPU: AMD Ryzen 5 5500U with Radeon Graphics
- GPU: 05:00.0 VGA compatible controller [0300]: Advanced Micro Devices, Inc. [AMD/ATI] Lucienne [1002:164c] (rev c2)
- System Memory: 40GB (4GB allocated to GPU)
- Display(s): 1080p laptop screen, 4k and 2k hooked to dock (run both at 2k)
- Type of Display Connection: USB-C to Dock. Displayport and HDMI
System information:
- Distro name and Version: Opensuse Tumbleweed-20220306
- Kernel version: 5.17.0-rc7-1.g04b7727-default
- Custom kernel: from OBS home_tiwai_kernel_drm-tip
- AMD official driver version: N/A (built into kernel)
Also tested with latest tumbleweed kernel, and http://download.opensuse.org/repositories/Kernel:/HEAD/standard/ kernel-repo Opensuse Bugzilla: https://bugzilla.suse.com/show_bug.cgi?id=1196778
How to reproduce the issue:
- Plug into USB-C Anker dock
Attached files:
Log files (for system lockups / game freezes / crashes)
2022-02-18T09:08:02.625936-05:00 localhost kernel: [ 91.051548][ T5215] ------------[ cut here ]------------
2022-02-18T09:08:02.625954-05:00 localhost kernel: [ 91.051554][ T5215] WARNING: CPU: 10 PID: 5215 at drivers/gpu/drm/drm_dp_mst_topology.c:4404 drm_dp_atomic_find_vcpi_slots.cold+0x0/0x24
2022-02-18T09:08:02.625958-05:00 localhost kernel: [ 91.051567][ T5215] Modules linked in: rfcomm xt_CHECKSUM xt_conntrack ipt_REJECT xt_tcpudp nf_nat_tftp nf_conntrack_tftp bridge stp llc xt_mark xt_MASQUERADE xt_comment tun nft_objref af_packet nf_conntrack_netbios_ns nf_conntrack_broadcast nft_fib_inet nft_fib_ipv4 nft_fib_ipv6 nft_fib nft_reject_inet nf_reject_ipv4 nf_reject_ipv6 nft_reject nft_ct nft_chain_nat nf_tables ebtable_nat ebtable_broute ip6table_nat ip6table_mangle ip6table_raw ip6table_security iptable_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 iptable_mangle iptable_raw iptable_security ip_set nfnetlink snd_usb_audio snd_usbmidi_lib ebtable_filter snd_rawmidi snd_seq_device ebtables ip6table_filter ip6_tables iptable_filter bpfilter cmac algif_hash algif_skcipher af_alg bnep dmi_sysfs r8153_ecm cdc_ether usbnet r8152 mii btusb btrtl btbcm btintel bluetooth uvcvideo videobuf2_vmalloc videobuf2_memops videobuf2_v4l2 videobuf2_common videodev ecdh_generic mc intel_rapl_msr intel_rapl_common edac_mce_amd
2022-02-18T09:08:02.625960-05:00 localhost kernel: [ 91.051650][ T5215] kvm_amd kvm irqbypass snd_ctl_led snd_hda_codec_realtek pcspkr snd_hda_codec_generic snd_hda_codec_hdmi joydev snd_hda_intel snd_intel_dspcfg snd_intel_sdw_acpi snd_hda_codec snd_hda_core snd_pci_acp6x r8169 snd_hwdep thinkpad_acpi snd_pcm snd_pci_acp5x realtek mdio_devres think_lmi snd_rn_pci_acp3x ledtrig_audio efi_pstore firmware_attributes_class wmi_bmof k10temp snd_timer i2c_piix4 snd_pci_acp3x libphy platform_profile snd soundcore tiny_power_button ac ucsi_acpi i2c_scmi typec_ucsi typec roles acpi_cpufreq button nls_iso8859_1 nls_cp437 vfat fat fuse configfs ip_tables x_tables dm_crypt essiv authenc trusted asn1_encoder tee hid_logitech_hidpp hid_logitech_dj hid_generic usbhid rtw89pci(OE) rtw89core(OE) amdgpu crct10dif_pclmul crc32_pclmul ghash_clmulni_intel mac80211 libarc4 xhci_pci xhci_pci_renesas cfg80211 xhci_hcd aesni_intel crypto_simd cryptd usbcore drm_ttm_helper nvme ttm serio_raw ccp iommu_v2 nvme_core sp5100_tco gpu_sched rfkill wmi battery
2022-02-18T09:08:02.625962-05:00 localhost kernel: [ 91.051742][ T5215] video btrfs blake2b_generic libcrc32c crc32c_intel xor raid6_pq sg dm_multipath dm_mod scsi_dh_rdac scsi_dh_emc scsi_dh_alua msr efivarfs
2022-02-18T09:08:02.625964-05:00 localhost kernel: [ 91.051759][ T5215] CPU: 10 PID: 5215 Comm: kwin_wayland Tainted: G OE 5.16.8-1-default #1 openSUSE Tumbleweed 257f8f36371552cd38032922fd021edb6811ecdc
2022-02-18T09:08:02.625967-05:00 localhost kernel: [ 91.051766][ T5215] Hardware name: LENOVO 20Y7003EUS/20Y7003EUS, BIOS R1OET31W (1.10 ) 11/29/2021
2022-02-18T09:08:02.625968-05:00 localhost kernel: [ 91.051770][ T5215] RIP: 0010:drm_dp_atomic_find_vcpi_slots.cold+0x0/0x24
2022-02-18T09:08:02.625970-05:00 localhost kernel: [ 91.051776][ T5215] Code: 28 ea 57 a4 e8 a4 7c fe ff 49 8b 45 58 48 89 ea 48 c7 c6 70 88 90 a4 48 8b 78 08 e8 78 14 00 00 b8 f4 ff ff ff e9 55 fc c5 ff <0f> 0b 49 8b 46 58 4c 89 e2 48 c7 c6 a0 88 90 a4 41 bf ea ff ff ff
2022-02-18T09:08:02.625972-05:00 localhost kernel: [ 91.051779][ T5215] RSP: 0018:ffffc35b81daf790 EFLAGS: 00010246
2022-02-18T09:08:02.625973-05:00 localhost kernel: [ 91.051783][ T5215] RAX: ffff9f65d94afd98 RBX: ffff9f65d94afd80 RCX: 0000000000000000
2022-02-18T09:08:02.625974-05:00 localhost kernel: [ 91.051786][ T5215] RDX: ffff9f65d94afcc0 RSI: ffff9f661018a500 RDI: ffff9f6660a7d580
2022-02-18T09:08:02.625976-05:00 localhost kernel: [ 91.051788][ T5215] RBP: ffff9f65d94af508 R08: 0000000000000013 R09: ffffc35b81daf8a8
2022-02-18T09:08:02.625977-05:00 localhost kernel: [ 91.051790][ T5215] R10: 00000000000001b5 R11: 0000000000000000 R12: ffff9f65d3d8b800
2022-02-18T09:08:02.625979-05:00 localhost kernel: [ 91.051792][ T5215] R13: ffff9f65d94af500 R14: ffff9f661018a500 R15: 0000000000000013
2022-02-18T09:08:02.625980-05:00 localhost kernel: [ 91.051794][ T5215] FS: 00007fa96d006e40(0000) GS:ffff9f6860080000(0000) knlGS:0000000000000000
2022-02-18T09:08:02.625981-05:00 localhost kernel: [ 91.051798][ T5215] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
2022-02-18T09:08:02.625982-05:00 localhost kernel: [ 91.051800][ T5215] CR2: 00007fa95c50d0e0 CR3: 000000014a278000 CR4: 0000000000350ee0
2022-02-18T09:08:02.626005-05:00 localhost kernel: [ 91.051803][ T5215] Call Trace:
2022-02-18T09:08:02.626007-05:00 localhost kernel: [ 91.051806][ T5215] <TASK>
2022-02-18T09:08:02.626008-05:00 localhost kernel: [ 91.051812][ T5215] ? dc_link_should_enable_fec+0x3d/0x60 [amdgpu 24ef79240fc120e4583a1855d40c25a864867216]
2022-02-18T09:08:02.630215-05:00 localhost kernel: [ 91.052250][ T5215] compute_mst_dsc_configs_for_link+0x7dd/0xa20 [amdgpu 24ef79240fc120e4583a1855d40c25a864867216]
2022-02-18T09:08:02.630227-05:00 localhost kernel: [ 91.052432][ T5215] compute_mst_dsc_configs_for_state+0x1eb/0x340 [amdgpu 24ef79240fc120e4583a1855d40c25a864867216]
2022-02-18T09:08:02.630230-05:00 localhost kernel: [ 91.052605][ T5215] amdgpu_dm_atomic_check+0xce4/0xf00 [amdgpu 24ef79240fc120e4583a1855d40c25a864867216]
2022-02-18T09:08:02.630232-05:00 localhost kernel: [ 91.052777][ T5215] drm_atomic_check_only+0x595/0x8d0
2022-02-18T09:08:02.630234-05:00 localhost kernel: [ 91.052781][ T5215] drm_mode_atomic_ioctl+0x73c/0xaf0
2022-02-18T09:08:02.630236-05:00 localhost kernel: [ 91.052784][ T5215] ? drm_plane_get_damage_clips.cold+0x1c/0x1c
2022-02-18T09:08:02.630237-05:00 localhost kernel: [ 91.052788][ T5215] ? drm_atomic_set_property+0xb50/0xb50
2022-02-18T09:08:02.630238-05:00 localhost kernel: [ 91.052789][ T5215] drm_ioctl_kernel+0xb0/0x140
2022-02-18T09:08:02.630239-05:00 localhost kernel: [ 91.052794][ T5215] drm_ioctl+0x220/0x3e0
2022-02-18T09:08:02.630241-05:00 localhost kernel: [ 91.052796][ T5215] ? drm_atomic_set_property+0xb50/0xb50
2022-02-18T09:08:02.630243-05:00 localhost kernel: [ 91.052797][ T5215] ? amdgpu_drm_ioctl+0x6a/0x80 [amdgpu 24ef79240fc120e4583a1855d40c25a864867216]
2022-02-18T09:08:02.630246-05:00 localhost kernel: [ 91.052938][ T5215] amdgpu_drm_ioctl+0x49/0x80 [amdgpu 24ef79240fc120e4583a1855d40c25a864867216]
2022-02-18T09:08:02.630247-05:00 localhost kernel: [ 91.053078][ T5215] __x64_sys_ioctl+0x82/0xb0
2022-02-18T09:08:02.630249-05:00 localhost kernel: [ 91.053081][ T5215] do_syscall_64+0x5c/0x80
2022-02-18T09:08:02.630250-05:00 localhost kernel: [ 91.053084][ T5215] ? __x64_sys_ioctl+0x9a/0xb0
2022-02-18T09:08:02.630251-05:00 localhost kernel: [ 91.053085][ T5215] ? syscall_exit_to_user_mode+0x18/0x40
2022-02-18T09:08:02.630252-05:00 localhost kernel: [ 91.053088][ T5215] ? do_syscall_64+0x69/0x80
2022-02-18T09:08:02.630254-05:00 localhost kernel: [ 91.053089][ T5215] ? do_syscall_64+0x69/0x80
2022-02-18T09:08:02.630265-05:00 localhost kernel: [ 91.053091][ T5215] ? do_syscall_64+0x69/0x80
2022-02-18T09:08:02.630267-05:00 localhost kernel: [ 91.053092][ T5215] ? exc_page_fault+0x68/0x150
2022-02-18T09:08:02.630268-05:00 localhost kernel: [ 91.053094][ T5215] entry_SYSCALL_64_after_hwframe+0x44/0xae
2022-02-18T09:08:02.630270-05:00 localhost kernel: [ 91.053097][ T5215] RIP: 0033:0x7fa96f8a688b
2022-02-18T09:08:02.630272-05:00 localhost kernel: [ 91.053099][ T5215] Code: ff ff ff 85 c0 79 9b 49 c7 c4 ff ff ff ff 5b 5d 4c 89 e0 41 5c c3 66 0f 1f 84 00 00 00 00 00 f3 0f 1e fa b8 10 00 00 00 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d 5d 75 0f 00 f7 d8 64 89 01 48
2022-02-18T09:08:02.630274-05:00 localhost kernel: [ 91.053101][ T5215] RSP: 002b:00007ffc93f8fbb8 EFLAGS: 00000246 ORIG_RAX: 0000000000000010
2022-02-18T09:08:02.630275-05:00 localhost kernel: [ 91.053103][ T5215] RAX: ffffffffffffffda RBX: 00007ffc93f8fc00 RCX: 00007fa96f8a688b
2022-02-18T09:08:02.630277-05:00 localhost kernel: [ 91.053104][ T5215] RDX: 00007ffc93f8fc00 RSI: 00000000c03864bc RDI: 0000000000000013
2022-02-18T09:08:02.630278-05:00 localhost kernel: [ 91.053105][ T5215] RBP: 00000000c03864bc R08: 0000000000000000 R09: 0000000000000000
2022-02-18T09:08:02.630279-05:00 localhost kernel: [ 91.053106][ T5215] R10: 000000000000000d R11: 0000000000000246 R12: 000055c2e2d4dbf0
2022-02-18T09:08:02.630281-05:00 localhost kernel: [ 91.053106][ T5215] R13: 0000000000000013 R14: 000055c2e2d87840 R15: 000055c2e2d201f0
2022-02-18T09:08:02.630282-05:00 localhost kernel: [ 91.053108][ T5215] </TASK>
2022-02-18T09:08:02.630283-05:00 localhost kernel: [ 91.053109][ T5215] ---[ end trace dd8544f5a3a3564e ]---
2022-02-18T09:08:02.630285-05:00 localhost kernel: [ 91.053111][ T5215] amdgpu 0000:04:00.0: [drm] *ERROR* cannot allocate and release VCPI on [MST PORT:00000000224748f2] in the same state
The error I'm seeing now from journalctl -f -g amdgpu is:
amdgpu 0000:05:00.0: [drm] *ERROR* cannot allocate and release VCPI on [MST PORT:0000000050490ba5] in the same state
Mar 04 12:18:28 localhost.localdomain kernel: snd_hda_codec_generic snd_hda_codec_hdmi snd_hda_intel snd_intel_dspcfg snd_intel_sdw_acpi think_lmi wmi_bmof firmware_at>
Mar 04 12:18:28 localhost.localdomain kernel: ? dc_link_should_enable_fec+0x3d/0x60 [amdgpu a112560892c61ad1d7b16506e460ce7b00266659]
Mar 04 12:18:28 localhost.localdomain kernel: compute_mst_dsc_configs_for_link+0x949/0xa20 [amdgpu a112560892c61ad1d7b16506e460ce7b00266659]
Mar 04 12:18:28 localhost.localdomain kernel: compute_mst_dsc_configs_for_state+0x1eb/0x340 [amdgpu a112560892c61ad1d7b16506e460ce7b00266659]
Mar 04 12:18:28 localhost.localdomain kernel: amdgpu_dm_atomic_check+0xce4/0xf00 [amdgpu a112560892c61ad1d7b16506e460ce7b00266659]
Mar 04 12:18:28 localhost.localdomain kernel: amdgpu_drm_ioctl+0x49/0x80 [amdgpu a112560892c61ad1d7b16506e460ce7b00266659]