WARNING: CPU: 14 PID: 104 at drivers/gpu/drm/amd/amdgpu/../display/dc/dce/dmub_psr.c:123 dmub_psr_get_state+0xc6/0xd0 [amdgpu]
[58387.044010] ------------[ cut here ]------------
[58387.044028] WARNING: CPU: 14 PID: 104 at drivers/gpu/drm/amd/amdgpu/../display/dc/dce/dmub_psr.c:123 dmub_psr_get_state+0xc6/0xd0 [amdgpu]
[58387.046385] Modules linked in: exfat cdc_acm snd_usb_audio snd_usbmidi_lib snd_rawmidi uas usb_storage hid_lenovo snd_seq_dummy snd_hrtimer xt_MASQUERADE xt_conntrack ipt_REJECT nft_compat n
f_nat_tftp nf_conntrack_tftp nft_fib_inet nft_fib_ipv4 nft_fib_ipv6 nft_fib qrtr_mhi nft_reject_inet nf_reject_ipv4 nf_reject_ipv6 nft_reject nft_ct nft_chain_nat sunrpc nf_nat nf_conntrack nf_
defrag_ipv6 nf_defrag_ipv4 bridge stp llc ip_set nf_tables nfnetlink uvcvideo uvc videobuf2_vmalloc videobuf2_memops videobuf2_v4l2 videobuf2_common videodev mc binfmt_misc vfat fat snd_acp6x_p
dm_dma snd_soc_dmic snd_soc_acp6x_mach snd_sof_amd_rembrandt snd_sof_amd_renoir qrtr snd_sof_amd_acp ath11k_pci snd_sof_pci snd_sof_xtensa_dsp ath11k snd_sof snd_ctl_led qmi_helpers snd_sof_uti
ls snd_hda_codec_realtek snd_hda_codec_generic snd_soc_core snd_hda_codec_hdmi mac80211 intel_rapl_msr snd_hda_intel intel_rapl_common snd_compress snd_intel_dspcfg snd_intel_sdw_acpi ac97_bus edac_mce_amd snd_pcm_dmaengine snd_hda_codec snd_pci_ps
[58387.046632] snd_rpl_pci_acp6x snd_pci_acp6x kvm_amd snd_hda_core snd_hwdep snd_seq libarc4 snd_seq_device kvm snd_pcm cfg80211 thinkpad_acpi irqbypass ledtrig_audio snd_pci_acp5x snd_timer platform_profile snd_rn_pci_acp3x rapl nxp_nci_i2c snd_acp_config nxp_nci snd snd_soc_acpi think_lmi nci firmware_attributes_class pcspkr snd_pci_acp3x wmi_bmof nfc mhi soundcore k10temp i2c_piix4 rfkill acpi_tad amd_pmc joydev squashfs loop zram dm_crypt amdgpu i2c_algo_bit drm_ttm_helper ttm iommu_v2 drm_buddy nvme gpu_sched drm_display_helper nvme_core crct10dif_pclmul crc32_pclmul crc32c_intel video polyval_clmulni ucsi_acpi hid_multitouch polyval_generic ghash_clmulni_intel typec_ucsi sha512_ssse3 sp5100_tco ccp cec r8169 typec nvme_common wmi i2c_hid_acpi i2c_hid serio_raw ip6_tables ip_tables i2c_dev fuse
[58387.046858] CPU: 14 PID: 104 Comm: kworker/14:0H Not tainted 6.3.8-200.fc38.x86_64 #1
[58387.046873] Hardware name: LENOVO 21CFCTO1WW/21CFCTO1WW, BIOS R23ET65W (1.35 ) 03/21/2023
[58387.046881] Workqueue: events_highpri dm_irq_work_func [amdgpu]
[58387.048958] RIP: 0010:dmub_psr_get_state+0xc6/0xd0 [amdgpu]
[58387.051075] Code: 00 00 74 b4 48 8b 44 24 08 65 48 2b 04 25 28 00 00 00 75 1a 48 83 c4 10 5b 5d 41 5c 41 5d c3 cc cc cc cc 3d ff 00 00 00 75 da <0f> 0b eb d6 e8 81 18 63 ea 90 90 90 90 90 90 90 90 90 90 90 90 90
[58387.051087] RSP: 0018:ffffa41ac0513ca0 EFLAGS: 00010246
[58387.051099] RAX: 00000000000000ff RBX: 00000000000003e9 RCX: 000000000000000e
[58387.051108] RDX: 0000000000000000 RSI: 0000000000001681 RDI: ffff8a3ad7080000
[58387.051115] RBP: ffff8a3ad6c46000 R08: 0000000000000000 R09: ffffa41b009e3900
[58387.051121] R10: 0000000000000000 R11: fefefefefefefeff R12: 0000000000000000
[58387.051125] R13: ffffa41ac0513cdc R14: ffff8a3ac44af140 R15: 0000000000000000
[58387.051132] FS: 0000000000000000(0000) GS:ffff8a41df180000(0000) knlGS:0000000000000000
[58387.051140] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[58387.051147] CR2: 00007f7f60000020 CR3: 00000001de022000 CR4: 0000000000750ee0
[58387.051155] PKRU: 55555554
[58387.051160] Call Trace:
[58387.051173] <TASK>
[58387.051179] ? dmub_psr_get_state+0xc6/0xd0 [amdgpu]
[58387.053141] ? __warn+0x81/0x130
[58387.053175] ? dmub_psr_get_state+0xc6/0xd0 [amdgpu]
[58387.055331] ? report_bug+0x171/0x1a0
[58387.055358] ? handle_bug+0x3c/0x80
[58387.055376] ? exc_invalid_op+0x17/0x70
[58387.055390] ? asm_exc_invalid_op+0x1a/0x20
[58387.055429] ? dmub_psr_get_state+0xc6/0xd0 [amdgpu]
[58387.057544] dmub_psr_enable+0xd4/0x120 [amdgpu]
[58387.059461] dc_link_set_psr_allow_active+0x27e/0x3b0 [amdgpu]
[58387.061459] dc_link_handle_hpd_rx_irq+0x318/0x350 [amdgpu]
[58387.063369] handle_hpd_rx_irq+0xca/0x490 [amdgpu]
[58387.065428] ? __schedule+0x451/0x14a0
[58387.065443] ? blk_mq_run_hw_queues+0x8a/0x110
[58387.065471] process_one_work+0x1c7/0x3d0
[58387.065489] worker_thread+0x51/0x390
[58387.065500] ? __pfx_worker_thread+0x10/0x10
[58387.065509] kthread+0xde/0x110
[58387.065526] ? __pfx_kthread+0x10/0x10
[58387.065541] ret_from_fork+0x2c/0x50
[58387.065579] </TASK>
[58387.065583] ---[ end trace 0000000000000000 ]---
Designs
- Show closed items
Activity
-
Newest first Oldest first
-
Show all activity Show comments only Show history only
- Developer
Two things:
- This is new, right? Was it caused by a kernel upgrade or a linux-firmware upgrade? Can you bisect it?
- Can you please upgrade to the latest DMUB F/W at linux-firmware.git. This specifically has a variety of PSR related fixes that might be relevant for the issue.
- Mario Limonciello added PSR Rembrandt labels
- Mario Limonciello added DC label
added DC label
- Developer
Also; please run this script and provide the output to confirm what TCON is in your panel. https://gitlab.freedesktop.org/drm/amd/-/blob/master/scripts/psr.py
Edited by Mario Limonciello - Author
> 1. This is new, right? Was it caused by a kernel upgrade or a linux-firmware upgrade? Not new. > 2. Can you please upgrade to the latest DMUB F/W at linux-firmware.git. I think I already have the latest version. > ls -l /usr/lib/firmware/amdgpu/yellow_carp_dmcub.bin.xz -rw-r--r--. 1 root root 144680 May 16 02:00 /usr/lib/firmware/amdgpu/yellow_carp_dmcub.bin.xz
Fedora 38 Linux 6.3.8-200
> doas ./psr.py DRI device 1 DMCUB F/W version: 0x0400002e Could not read DPCD, is the panel on? > doas dmesg | grep -i dpcd [ 3.511047] [drm] PSR support 1, DC PSR ver 1, sink PSR ver 3 DPCD caps 0x7b su_y_granularity 4
Edited by Patch Adam - Developer
That's a bit weird that it's not able to read the DPCD. Presumably you did do that with the panel on?
But that is the older firmware. The newer one should be
0x0400003c
. Can you please fetch it from linux-firmware.git and manually update it? - Author
I'm on KDE, so no Gnome panel.
Edited by Patch Adam Collapse replies - Developer
"Panel" meant the display panel. Like is your monitor on, or did you do this with lid closed/over SSH.
- Author
The display is on for the user.
The display is not available through sudo.
- Author
I have the binary now. Let me see what happens.
- Author
> ls -l /usr/lib/firmware/amdgpu/yellow_carp_dmcub.bin.xz* -rw-r--r--. 1 root root 145528 Jun 22 16:26 /usr/lib/firmware/amdgpu/yellow_carp_dmcub.bin.xz -rw-r--r--. 1 root root 144680 Jun 22 16:25 /usr/lib/firmware/amdgpu/yellow_carp_dmcub.bin.xz_old I see no difference after reboot. > doas ./psr.py DRI device 1 DMCUB F/W version: 0x0400002e Could not read DPCD, is the panel on? > doas dmesg | grep -i dpcd [ 3.379811] [drm] PSR support 1, DC PSR ver 1, sink PSR ver 3 DPCD caps 0x7b su_y_granularity 4
Edited by Patch Adam - Developer
For Feodra you'll need to rebuild your initramfs to get the new update included.
- Author
> doas /usr/bin/dracut --force
- Author
> doas ./psr.py failed to find drm_dp_aux_dev > doas dmesg | grep -i dpcd [no answer]
And my external displays are offline now.
- Author
> doas dmesg | grep amdgpu [ 3.214556] [drm] amdgpu kernel modesetting enabled. [ 3.220024] amdgpu: Virtual CRAT table created for CPU [ 3.220033] amdgpu: Topology: Add CPU node [ 3.220111] amdgpu 0000:04:00.0: enabling device (0006 -> 0007) [ 3.221765] amdgpu 0000:04:00.0: amdgpu: Fetched VBIOS from VFCT [ 3.221767] amdgpu: ATOM BIOS: 113-REMBRANDT-X37 [ 3.225095] amdgpu 0000:04:00.0: xz decompression failed (xz_ret=6) [ 3.225099] amdgpu 0000:04:00.0: Direct firmware load for amdgpu/yellow_carp_dmcub.bin failed with error -22 [ 3.225101] [drm:dm_early_init [amdgpu]] *ERROR* DMUB firmware loading failed: -19 [ 3.225318] [drm:amdgpu_device_init [amdgpu]] *ERROR* early_init of IP block <dm> failed -19 [ 3.254338] amdgpu 0000:04:00.0: amdgpu: Fatal error during GPU init [ 3.254339] amdgpu 0000:04:00.0: amdgpu: amdgpu: finishing device.
- Author
What compression level is required here? The default for xz is 6.
- Developer
I don't know; I've never compressed my firmware. You can just use it uncompressed if you want.
- Author
> doas dmesg | grep amdgpu [ 2.945378] [drm] amdgpu kernel modesetting enabled. [ 2.950958] amdgpu: Virtual CRAT table created for CPU [ 2.950966] amdgpu: Topology: Add CPU node [ 2.951045] amdgpu 0000:04:00.0: enabling device (0006 -> 0007) [ 2.952633] amdgpu 0000:04:00.0: amdgpu: Fetched VBIOS from VFCT [ 2.952634] amdgpu: ATOM BIOS: 113-REMBRANDT-X37 [ 2.998955] amdgpu 0000:04:00.0: vgaarb: deactivate vga console [ 2.998963] amdgpu 0000:04:00.0: amdgpu: Trusted Memory Zone (TMZ) feature disabled as experimental (default) [ 2.999014] amdgpu 0000:04:00.0: amdgpu: VRAM: 1024M 0x000000F400000000 - 0x000000F43FFFFFFF (1024M used) [ 2.999016] amdgpu 0000:04:00.0: amdgpu: GART: 1024M 0x0000000000000000 - 0x000000003FFFFFFF [ 2.999018] amdgpu 0000:04:00.0: amdgpu: AGP: 267419648M 0x000000F800000000 - 0x0000FFFFFFFFFFFF [ 2.999734] [drm] amdgpu: 1024M of VRAM memory ready [ 2.999737] [drm] amdgpu: 15418M of GTT memory ready. [ 3.001567] amdgpu 0000:04:00.0: amdgpu: Will use PSP to load VCN firmware [ 3.187068] amdgpu 0000:04:00.0: amdgpu: RAS: optional ras ta ucode is not available [ 3.199397] amdgpu 0000:04:00.0: amdgpu: RAP: optional rap ta ucode is not available [ 3.199398] amdgpu 0000:04:00.0: amdgpu: SECUREDISPLAY: securedisplay ta ucode is not available [ 3.201575] amdgpu 0000:04:00.0: amdgpu: SMU is initialized successfully! [ 3.306217] kfd kfd: amdgpu: Allocated 3969056 bytes on gart [ 3.306299] amdgpu: sdma_bitmap: 3 [ 3.315933] amdgpu: HMM registered 1024MB device memory [ 3.316716] amdgpu: SRAT table not found [ 3.316718] amdgpu: Virtual CRAT table created for GPU [ 3.316844] amdgpu: Topology: Add dGPU node [0x1681:0x1002] [ 3.316846] kfd kfd: amdgpu: added device 1002:1681 [ 3.316856] amdgpu 0000:04:00.0: amdgpu: SE 1, SH per SE 2, CU per SH 6, active_cu_number 12 [ 3.316923] amdgpu 0000:04:00.0: amdgpu: ring gfx_0.0.0 uses VM inv eng 0 on hub 0 [ 3.316925] amdgpu 0000:04:00.0: amdgpu: ring comp_1.0.0 uses VM inv eng 1 on hub 0 [ 3.316926] amdgpu 0000:04:00.0: amdgpu: ring comp_1.1.0 uses VM inv eng 4 on hub 0 [ 3.316927] amdgpu 0000:04:00.0: amdgpu: ring comp_1.2.0 uses VM inv eng 5 on hub 0 [ 3.316928] amdgpu 0000:04:00.0: amdgpu: ring comp_1.3.0 uses VM inv eng 6 on hub 0 [ 3.316928] amdgpu 0000:04:00.0: amdgpu: ring comp_1.0.1 uses VM inv eng 7 on hub 0 [ 3.316929] amdgpu 0000:04:00.0: amdgpu: ring comp_1.1.1 uses VM inv eng 8 on hub 0 [ 3.316930] amdgpu 0000:04:00.0: amdgpu: ring comp_1.2.1 uses VM inv eng 9 on hub 0 [ 3.316931] amdgpu 0000:04:00.0: amdgpu: ring comp_1.3.1 uses VM inv eng 10 on hub 0 [ 3.316931] amdgpu 0000:04:00.0: amdgpu: ring kiq_2.1.0 uses VM inv eng 11 on hub 0 [ 3.316932] amdgpu 0000:04:00.0: amdgpu: ring sdma0 uses VM inv eng 12 on hub 0 [ 3.316933] amdgpu 0000:04:00.0: amdgpu: ring vcn_dec_0 uses VM inv eng 0 on hub 1 [ 3.316934] amdgpu 0000:04:00.0: amdgpu: ring vcn_enc_0.0 uses VM inv eng 1 on hub 1 [ 3.316935] amdgpu 0000:04:00.0: amdgpu: ring vcn_enc_0.1 uses VM inv eng 4 on hub 1 [ 3.316936] amdgpu 0000:04:00.0: amdgpu: ring jpeg_dec uses VM inv eng 5 on hub 1 [ 3.320770] [drm] Initialized amdgpu 3.52.0 20150101 for 0000:04:00.0 on minor 1 [ 3.328503] fbcon: amdgpudrmfb (fb0) is primary device [ 3.328507] amdgpu 0000:04:00.0: [drm] fb0: amdgpudrmfb frame buffer device [ 13.498622] snd_hda_intel 0000:04:00.1: bound 0000:04:00.0 (ops amdgpu_dm_audio_component_bind_ops [amdgpu])
Oh look....
> doas ./psr.py DRI device 1 DMCUB F/W version: 0x0400003c ○ PSR 2 with Y coordinates (eDP 1.4a) [3] ○ Sink OUI: Parade ○ resv_40f: 01 ○ ID String: 08-03 ○ PSR Status: 00-00-02
and...
> doas dmesg | grep -i dpcd [ 3.297678] [drm] PSR support 1, DC PSR ver 1, sink PSR ver 3 DPCD caps 0x7b su_y_granularity 4 > doas dmesg | grep DMUB [ 3.000323] [drm] Loading DMUB firmware via PSP: version=0x0400003C [ 3.204805] [drm] DMUB hardware initialized: version=0x0400003C
Edited by Patch Adam - Developer
OK this looks good now. You do have the same TCON in your panel as the other PSR issue with the hangs (#2443 (closed)), and this firmware upgrade should be helping it. If you get that warning again with the updated firmware, can you please run the script again when it happens to see what
PSR Status
showsEdited by Mario Limonciello - Author
OK
- Author
This is what happened.
I unplugged everything from the T14, then entered sleep mode, re-opened the lid, attached a USB drive directly to the T14, and worked on it while watching dmesgs.
[ 1740.859086] scsi 0:0:0:0: Direct-Access MS 0 PQ: 0 ANSI: 6 [ 1740.862758] sd 0:0:0:0: Attached scsi generic sg0 type 0 [ 1740.863387] sd 0:0:0:0: [sda] Spinning up disk... [ 1741.880663] ...ready [ 1743.953986] sd 0:0:0:0: [sda] 3907029168 512-byte logical blocks: (2.00 TB/1.82 TiB) [ 1743.954009] sd 0:0:0:0: [sda] 4096-byte physical blocks [ 1743.954289] sd 0:0:0:0: [sda] Write Protect is off [ 1743.954300] sd 0:0:0:0: [sda] Mode Sense: 43 00 00 00 [ 1743.954698] sd 0:0:0:0: [sda] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA [ 1743.954985] sd 0:0:0:0: [sda] Preferred minimum I/O size 512 bytes not a multiple of physical block size (4096 bytes) [ 1743.955004] sd 0:0:0:0: [sda] Optimal transfer size 33553920 bytes not a multiple of physical block size (4096 bytes) [ 1743.983530] sda: sda1 [ 1743.984182] sd 0:0:0:0: [sda] Attached SCSI disk [ 1750.203191] exFAT-fs (sda1): Volume was not properly unmounted. Some data may be corrupt. Please run fsck. [ 1750.211322] ------------[ cut here ]------------ [ 1750.211336] WARNING: CPU: 14 PID: 6589 at drivers/gpu/drm/amd/amdgpu/../display/dc/dce/dmub_psr.c:123 dmub_psr_get_state+0xc6/0xd0 [amdgpu] [ 1750.213751] Modules linked in: exfat snd_usb_audio snd_usbmidi_lib snd_rawmidi uas usb_storage hid_lenovo snd_seq_dummy snd_hrtimer xt_MASQUERADE xt_conntrack ipt_REJECT nft_compat nf_nat_tftp nf_conntrack_tftp nft_fib_inet nft_fib_ipv4 nft_fib_ipv6 nft_fib nft_reject_inet nf_reject_ipv4 nf_reject_ipv6 nft_reject nft_ct qrtr_mhi nft_chain_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 sunrpc bridge stp llc ip_set nf_tables nfnetlink uvcvideo uvc videobuf2_vmalloc videobuf2_memops videobuf2_v4l2 videobuf2_common videodev mc binfmt_misc vfat fat qrtr snd_soc_dmic snd_acp6x_pdm_dma snd_soc_acp6x_mach ath11k_pci snd_sof_amd_rembrandt snd_sof_amd_renoir snd_sof_amd_acp ath11k snd_sof_pci snd_sof_xtensa_dsp snd_sof snd_ctl_led qmi_helpers snd_sof_utils snd_hda_codec_realtek mac80211 snd_soc_core snd_hda_codec_generic snd_hda_codec_hdmi intel_rapl_msr intel_rapl_common snd_compress ac97_bus snd_pcm_dmaengine edac_mce_amd snd_hda_intel snd_pci_ps snd_intel_dspcfg snd_rpl_pci_acp6x snd_intel_sdw_acpi kvm_amd [ 1750.213976] snd_hda_codec libarc4 snd_hda_core snd_pci_acp6x kvm snd_hwdep snd_seq irqbypass cfg80211 snd_seq_device snd_pci_acp5x snd_pcm think_lmi thinkpad_acpi rapl nxp_nci_i2c snd_rn_pci_acp3x ledtrig_audio nxp_nci pcspkr firmware_attributes_class wmi_bmof snd_acp_config nci platform_profile snd_timer snd_soc_acpi snd snd_pci_acp3x k10temp mhi nfc i2c_piix4 soundcore rfkill acpi_tad amd_pmc joydev squashfs loop zram dm_crypt amdgpu i2c_algo_bit drm_ttm_helper ttm nvme iommu_v2 drm_buddy gpu_sched nvme_core drm_display_helper crct10dif_pclmul crc32_pclmul crc32c_intel video polyval_clmulni hid_multitouch polyval_generic ucsi_acpi ghash_clmulni_intel sha512_ssse3 typec_ucsi sp5100_tco ccp cec r8169 typec nvme_common wmi i2c_hid_acpi i2c_hid serio_raw ip6_tables ip_tables i2c_dev fuse [ 1750.214227] CPU: 14 PID: 6589 Comm: kworker/14:9H Not tainted 6.3.8-200.fc38.x86_64 #1 [ 1750.214243] Hardware name: LENOVO 21CFCTO1WW/21CFCTO1WW, BIOS R23ET65W (1.35 ) 03/21/2023 [ 1750.214251] Workqueue: events_highpri dm_irq_work_func [amdgpu] [ 1750.216436] RIP: 0010:dmub_psr_get_state+0xc6/0xd0 [amdgpu] [ 1750.218544] Code: 00 00 74 b4 48 8b 44 24 08 65 48 2b 04 25 28 00 00 00 75 1a 48 83 c4 10 5b 5d 41 5c 41 5d c3 cc cc cc cc 3d ff 00 00 00 75 da <0f> 0b eb d6 e8 81 18 3c f3 90 90 90 90 90 90 90 90 90 90 90 90 90 [ 1750.218556] RSP: 0018:ffffb8fa4b99bca0 EFLAGS: 00010246 [ 1750.218568] RAX: 00000000000000ff RBX: 00000000000003e9 RCX: 000000000000000e [ 1750.218577] RDX: 0000000000000000 RSI: 0000000000001681 RDI: ffff92d78ed00000 [ 1750.218584] RBP: ffff92d782111c00 R08: 0000000000000000 R09: ffffb8fa809e3e00 [ 1750.218590] R10: 0000000000000000 R11: 0000000000000100 R12: 0000000000000000 [ 1750.218595] R13: ffffb8fa4b99bcdc R14: ffff92d7814eb5c0 R15: 0000000000000000 [ 1750.218602] FS: 0000000000000000(0000) GS:ffff92de9f180000(0000) knlGS:0000000000000000 [ 1750.218611] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [ 1750.218618] CR2: 00007f3e4c01fdd0 CR3: 0000000150388000 CR4: 0000000000750ee0 [ 1750.218626] PKRU: 55555554 [ 1750.218631] Call Trace: [ 1750.218643] <TASK> [ 1750.218653] ? dmub_psr_get_state+0xc6/0xd0 [amdgpu] [ 1750.220715] ? __warn+0x81/0x130 [ 1750.220751] ? dmub_psr_get_state+0xc6/0xd0 [amdgpu] [ 1750.222750] ? report_bug+0x171/0x1a0 [ 1750.222777] ? handle_bug+0x3c/0x80 [ 1750.222794] ? exc_invalid_op+0x17/0x70 [ 1750.222808] ? asm_exc_invalid_op+0x1a/0x20 [ 1750.222831] ? dmub_psr_get_state+0xc6/0xd0 [amdgpu] [ 1750.224881] dmub_psr_enable+0xd4/0x120 [amdgpu] [ 1750.226859] dc_link_set_psr_allow_active+0x27e/0x3b0 [amdgpu] [ 1750.228803] dc_link_handle_hpd_rx_irq+0x318/0x350 [amdgpu] [ 1750.230854] handle_hpd_rx_irq+0xca/0x490 [amdgpu] [ 1750.232996] ? __schedule+0x451/0x14a0 [ 1750.233016] process_one_work+0x1c7/0x3d0 [ 1750.233037] worker_thread+0x51/0x390 [ 1750.233048] ? __pfx_worker_thread+0x10/0x10 [ 1750.233057] kthread+0xde/0x110 [ 1750.233074] ? __pfx_kthread+0x10/0x10 [ 1750.233088] ret_from_fork+0x2c/0x50 [ 1750.233118] </TASK> [ 1750.233123] ---[ end trace 0000000000000000 ]--- [ 1763.158829] sd 0:0:0:0: [sda] Synchronizing SCSI cache [ 1763.279703] sd 0:0:0:0: [sda] Synchronize Cache(10) failed: Result: hostbyte=DID_ERROR driverbyte=DRIVER_OK
Edited by Patch Adam - Developer
@effective-light can you take a look at this? A PSR timeout warning is happening on the Parade 08-03 TCON even with the newer DMUB F/W. No hang with the newer F/W though.
- Mario Limonciello mentioned in issue #2453 (closed)
mentioned in issue #2453 (closed)
- Mario Limonciello mentioned in issue #2443 (closed)
mentioned in issue #2443 (closed)
- Author
Linux version 6.3.8-200.fc38.x86_64
[31939.860379] r8169 0000:01:00.0 enp1s0f0: Link is Up - 1Gbps/Full - flow control off [31940.423834] ------------[ cut here ]------------ [31940.423840] WARNING: CPU: 4 PID: 2635 at drivers/gpu/drm/amd/amdgpu/../display/dc/dce/dmub_psr.c:123 dmub_psr_get_state+0xc6/0xd0 [amdgpu] [31940.424531] Modules linked in: exfat uas usb_storage cdc_acm rfcomm snd_seq_dummy snd_hrtimer bnep xt_MASQUERADE xt_conntrack ipt_REJECT nft_compat btusb btrtl btbcm btintel btmtk bluetooth nf_nat_tftp nf_conntrack_tftp nft_fib_inet nft_fib_ipv4 nft_fib_ipv6 nft_fib snd_usb_audio nft_reject_inet nf_reject_ipv4 nf_reject_ipv6 nft_reject snd_usbmidi_lib snd_rawmidi nft_ct nft_chain_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 sunrpc bridge stp llc ip_set nf_tables nfnetlink qrtr_mhi uvcvideo uvc videobuf2_vmalloc videobuf2_memops videobuf2_v4l2 videobuf2_common videodev mc binfmt_misc vfat fat snd_acp6x_pdm_dma snd_soc_dmic snd_soc_acp6x_mach snd_sof_amd_rembrandt snd_sof_amd_renoir snd_sof_amd_acp snd_sof_pci snd_sof_xtensa_dsp snd_sof snd_ctl_led qrtr snd_hda_codec_realtek snd_sof_utils ath11k_pci snd_hda_codec_generic snd_soc_core snd_hda_codec_hdmi ath11k intel_rapl_msr intel_rapl_common snd_hda_intel snd_compress edac_mce_amd snd_intel_dspcfg ac97_bus snd_intel_sdw_acpi snd_pcm_dmaengine snd_hda_codec [31940.424657] snd_pci_ps qmi_helpers snd_rpl_pci_acp6x kvm_amd snd_hda_core snd_pci_acp6x snd_hwdep mac80211 kvm snd_seq libarc4 irqbypass rapl thinkpad_acpi snd_seq_device think_lmi firmware_attributes_class snd_pcm wmi_bmof cfg80211 pcspkr ledtrig_audio nxp_nci_i2c snd_pci_acp5x snd_timer snd_rn_pci_acp3x snd_acp_config nxp_nci platform_profile snd nci snd_soc_acpi mhi snd_pci_acp3x k10temp i2c_piix4 nfc soundcore joydev rfkill amd_pmc acpi_tad squashfs loop zram dm_crypt amdgpu hid_lenovo i2c_algo_bit drm_ttm_helper ttm iommu_v2 drm_buddy gpu_sched drm_display_helper ccp cec nvme nvme_core crct10dif_pclmul crc32_pclmul video hid_multitouch ucsi_acpi crc32c_intel polyval_clmulni polyval_generic ghash_clmulni_intel typec_ucsi sha512_ssse3 r8169 sp5100_tco typec nvme_common wmi i2c_hid_acpi i2c_hid serio_raw ip6_tables ip_tables i2c_dev fuse [31940.424738] CPU: 4 PID: 2635 Comm: Xorg Not tainted 6.3.8-200.fc38.x86_64 #1 [31940.424743] Hardware name: LENOVO 21CFCTO1WW/21CFCTO1WW, BIOS R23ET65W (1.35 ) 03/21/2023 [31940.424746] RIP: 0010:dmub_psr_get_state+0xc6/0xd0 [amdgpu] [31940.425412] Code: 00 00 74 b4 48 8b 44 24 08 65 48 2b 04 25 28 00 00 00 75 1a 48 83 c4 10 5b 5d 41 5c 41 5d c3 cc cc cc cc 3d ff 00 00 00 75 da <0f> 0b eb d6 e8 81 58 52 d7 90 90 90 90 90 90 90 90 90 90 90 90 90 [31940.425417] RSP: 0018:ffffb93dccccf678 EFLAGS: 00010246 [31940.425423] RAX: 00000000000000ff RBX: 00000000000003e9 RCX: 0000000000000004 [31940.425427] RDX: 0000000000000000 RSI: 0000000000001681 RDI: ffff8ee1d6900000 [31940.425431] RBP: ffff8ee1d1586800 R08: 0000000000000000 R09: ffffb93e009e3e00 [31940.425435] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000 [31940.425438] R13: ffffb93dccccf6b4 R14: ffff8ee1c4498a80 R15: 0000000000000000 [31940.425442] FS: 00007f46fd93fac0(0000) GS:ffff8ee8def00000(0000) knlGS:0000000000000000 [31940.425447] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [31940.425450] CR2: 00007f3c2447c000 CR3: 0000000141b92000 CR4: 0000000000750ee0 [31940.425455] PKRU: 55555554 [31940.425457] Call Trace: [31940.425464] <TASK> [31940.425467] ? dmub_psr_get_state+0xc6/0xd0 [amdgpu] [31940.426364] ? __warn+0x81/0x130 [31940.426375] ? dmub_psr_get_state+0xc6/0xd0 [amdgpu] [31940.427021] ? report_bug+0x171/0x1a0 [31940.427033] ? handle_bug+0x3c/0x80 [31940.427042] ? exc_invalid_op+0x17/0x70 [31940.427049] ? asm_exc_invalid_op+0x1a/0x20 [31940.427061] ? dmub_psr_get_state+0xc6/0xd0 [amdgpu] [31940.427728] ? dmub_psr_get_state+0x7a/0xd0 [amdgpu] [31940.428360] dmub_psr_enable+0xd4/0x120 [amdgpu] [31940.428990] dc_link_set_psr_allow_active+0x27e/0x3b0 [amdgpu] [31940.429644] dc_set_psr_allow_active+0x91/0xf0 [amdgpu] [31940.430219] amdgpu_dm_atomic_commit_tail+0x581/0x3620 [amdgpu] [31940.430865] ? dcn31_populate_dml_pipes_from_context+0x5c/0x320 [amdgpu] [31940.431479] ? dcn30_internal_validate_bw+0xf1/0x9d0 [amdgpu] [31940.432043] ? dcn31_calculate_wm_and_dlg_fp+0x55/0x540 [amdgpu] [31940.432690] ? dcn31_calculate_wm_and_dlg+0x45/0x60 [amdgpu] [31940.433278] ? dcn31_validate_bandwidth+0x12f/0x2d0 [amdgpu] [31940.433840] ? dc_validate_global_state+0x30a/0x3e0 [amdgpu] [31940.434419] ? ww_mutex_lock+0x18/0x80 [31940.434429] ? dma_resv_iter_first_unlocked+0x66/0x70 [31940.434439] ? dma_resv_get_fences+0x61/0x220 [31940.434450] ? dma_resv_get_singleton+0x46/0x140 [31940.434459] ? wait_for_completion_timeout+0x13e/0x170 [31940.434465] ? wait_for_completion_interruptible+0x139/0x1e0 [31940.434475] commit_tail+0x94/0x130 [31940.434485] drm_atomic_helper_commit+0x11a/0x140 [31940.434492] drm_atomic_commit+0x9a/0xd0 [31940.434499] ? __pfx___drm_printfn_info+0x10/0x10 [31940.434510] drm_atomic_connector_commit_dpms+0xce/0xf0 [31940.434517] drm_mode_obj_set_property_ioctl+0x197/0x3d0 [31940.434530] ? __pfx_drm_connector_property_set_ioctl+0x10/0x10 [31940.434539] drm_connector_property_set_ioctl+0x3d/0x60 [31940.434547] drm_ioctl_kernel+0xcd/0x170 [31940.434555] drm_ioctl+0x26d/0x4b0 [31940.434563] ? __pfx_drm_connector_property_set_ioctl+0x10/0x10 [31940.434575] amdgpu_drm_ioctl+0x4e/0x90 [amdgpu] [31940.435085] __x64_sys_ioctl+0x94/0xd0 [31940.435093] do_syscall_64+0x60/0x90 [31940.435098] ? do_writev+0x117/0x160 [31940.435106] ? syscall_exit_to_user_mode+0x1b/0x40 [31940.435111] ? do_syscall_64+0x6c/0x90 [31940.435114] ? syscall_exit_to_user_mode+0x1b/0x40 [31940.435118] ? do_syscall_64+0x6c/0x90 [31940.435121] ? syscall_exit_to_user_mode+0x1b/0x40 [31940.435124] ? do_syscall_64+0x6c/0x90 [31940.435127] ? do_syscall_64+0x6c/0x90 [31940.435130] entry_SYSCALL_64_after_hwframe+0x72/0xdc [31940.435137] RIP: 0033:0x7f46fe00bedd [31940.435187] Code: 04 25 28 00 00 00 48 89 45 c8 31 c0 48 8d 45 10 c7 45 b0 10 00 00 00 48 89 45 b8 48 8d 45 d0 48 89 45 c0 b8 10 00 00 00 0f 05 <89> c2 3d 00 f0 ff ff 77 1a 48 8b 45 c8 64 48 2b 04 25 28 00 00 00 [31940.435190] RSP: 002b:00007ffd687f3ac0 EFLAGS: 00000246 ORIG_RAX: 0000000000000010 [31940.435194] RAX: ffffffffffffffda RBX: 000055a676483e10 RCX: 00007f46fe00bedd [31940.435196] RDX: 00007ffd687f3b50 RSI: 00000000c01064ab RDI: 000000000000000e [31940.435198] RBP: 00007ffd687f3b10 R08: 0000000000000000 R09: 000055a676487dc0 [31940.435200] R10: 000055a677060da0 R11: 0000000000000246 R12: 00007ffd687f3b50 [31940.435202] R13: 00000000c01064ab R14: 000000000000000e R15: 000055a676483bf0 [31940.435206] </TASK> [31940.435208] ---[ end trace 0000000000000000 ]--- [31943.489663] usb 7-1: new high-speed USB device number 14 using xhci_hcd
I upgraded linux-firmware and have kind of the same issue. The difference being dmub_psr_enable+0x10a/0x120.
amdgpu_warning amdgpu_firmware_info_july
WARNING: CPU: 10 PID: 18529 at drivers/gpu/drm/amd/amdgpu/../display/dc/dce/dmub_psr.c:223 dmub_psr_enable+0x10a/0x120 [amdgpu] CPU: 10 PID: 18529 Comm: kworker/10:2H Not tainted 6.3.8-200.fc38.x86_64 #1 Hardware name: LENOVO 21CFCTO1WW/21CFCTO1WW, BIOS R23ET65W (1.35 ) 03/21/2023 Workqueue: events_highpri dm_irq_work_func [amdgpu] RIP: 0010:dmub_psr_enable+0x10a/0x120 [amdgpu] Code: cf 81 fb e8 03 00 00 74 21 48 8b 44 24 48 65 48 2b 04 25 28 00 00 00 75 15 48 83 c4 50 5b 5d 41 5c 41 5d 41 5e c3 cc cc cc cc <0f> 0b eb db e8 4d 36 52 c4 66 66 2e 0f 1f 84 00 00 00 00 00 66 90 RSP: 0018:ffffb4b288ad3cd8 EFLAGS: 00010246 RAX: 000118e5b6aeee4b RBX: 00000000000003e9 RCX: 000000000000000a RDX: 0000000000161d92 RSI: 0000000000161535 RDI: 000118e5b698d0b9 RBP: 0000000000000000 R08: 0000000000000000 R09: ffffb4b2ffbe3e00 R10: 0000000000000000 R11: fefefefefefefeff R12: ffff8ff216e94cb0 R13: 0000000000000000 R14: ffff8ff2102b7b40 R15: 0000000000000000 FS: 0000000000000000(0000) GS:ffff8ff52f080000(0000) knlGS:0000000000000000 CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 CR2: 00007f861da10000 CR3: 00000003a1022000 CR4: 0000000000750ee0 PKRU: 55555554 Call Trace: <TASK> ? dmub_psr_enable+0x10a/0x120 [amdgpu] ? __warn+0x81/0x130 ? dmub_psr_enable+0x10a/0x120 [amdgpu] ? report_bug+0x171/0x1a0 ? handle_bug+0x3c/0x80 ? exc_invalid_op+0x17/0x70 ? asm_exc_invalid_op+0x1a/0x20 ? dmub_psr_enable+0x10a/0x120 [amdgpu] ? dmub_psr_enable+0xbd/0x120 [amdgpu] dc_link_set_psr_allow_active+0x27e/0x3b0 [amdgpu] dc_link_handle_hpd_rx_irq+0x318/0x350 [amdgpu] handle_hpd_rx_irq+0xca/0x490 [amdgpu] ? __schedule+0x451/0x14a0 process_one_work+0x1c7/0x3d0 worker_thread+0x51/0x390 ? __pfx_worker_thread+0x10/0x10 kthread+0xde/0x110 ? __pfx_kthread+0x10/0x10 ret_from_fork+0x2c/0x50 </TASK>
- Author
Following yesterday's update (appended), the problem seems to have found a solution. It is too early to close the ticket, however. Let see what happens next week.
> doas ./psr.py DRI device 1 DMCUB F/W version: 0x0400003c Could not read DPCD, is the panel on?
> doas dmesg | grep -i dpcd [ 3.411082] [drm] PSR support 1, DC PSR ver 1, sink PSR ver 3 DPCD caps 0x7b su_y_granularity 4
> doas dmesg | grep DMUB [ 3.118128] [drm] Loading DMUB firmware via PSP: version=0x0400003C [ 3.323307] [drm] DMUB hardware initialized: version=0x0400003C
> kernel Upgrade to new version 6.3.11-200.fc38 Release notes: The 6.3.11 stable kernel update contains a number of important fixes across the tree. > linux-firmware-whence Upgrade to new version 20230625-151.fc38 Release notes: Update to upstream 20230625 release: wilc1000: update WILC1000 firmware to v16.0 ice: update ice DDP wireless_edge package to 1.3.10.0 amdgpu: DMCUB updates for DCN 3.1.4 and 3.1.5 amdgpu: update DMCUB to v0.0.172.0 for various AMDGPU ASICs qcom: Update the microcode files for Adreno a630 GPUs. qcom: sdm845: rename the modem firmware qcom: sdm845: update remoteproc firmware rtl_bt: Update RTL8852A BT USB firmware to 0xDAC7_480D rtl_bt: Update RTL8852C BT USB firmware to 0x040D_7225 update firmware for MT7921/MT7922 WiFi device update firmware for mediatek MT7921/MT7922 bluetooth chip (MT7922) i915: Add HuC v8.5.0 for MTL mediatek: Update mt8195 SCP firmware to support hevc qcom: apq8016: add Dragonboard 410c WiFi and modem firmware cirrus: Add firmware for new Asus ROG Laptops brcm: Add symlinks from Pine64 devices to AW-CM256SM.txt amdgpu: Update GC 11.0.1 and 11.0.4 rtw89: 8851b: add firmware v0.29.41.0 amdgpu: various firmware updates for amd.5.5 release ice: update ice DDP comms package to 1.3.40.0 rtlwifi: Add firmware v6.0 for RTL8192FU rtlwifi: Update firmware for RTL8188EU to v28.0 cxgb4: Update firmware to revision 1.27.3.0 > linux-firmware Upgrade to new version 20230625-151.fc38 Release notes: Update to upstream 20230625 release: wilc1000: update WILC1000 firmware to v16.0 ice: update ice DDP wireless_edge package to 1.3.10.0 amdgpu: DMCUB updates for DCN 3.1.4 and 3.1.5 amdgpu: update DMCUB to v0.0.172.0 for various AMDGPU ASICs qcom: Update the microcode files for Adreno a630 GPUs. qcom: sdm845: rename the modem firmware qcom: sdm845: update remoteproc firmware rtl_bt: Update RTL8852A BT USB firmware to 0xDAC7_480D rtl_bt: Update RTL8852C BT USB firmware to 0x040D_7225 update firmware for MT7921/MT7922 WiFi device update firmware for mediatek MT7921/MT7922 bluetooth chip (MT7922) i915: Add HuC v8.5.0 for MTL mediatek: Update mt8195 SCP firmware to support hevc qcom: apq8016: add Dragonboard 410c WiFi and modem firmware cirrus: Add firmware for new Asus ROG Laptops brcm: Add symlinks from Pine64 devices to AW-CM256SM.txt amdgpu: Update GC 11.0.1 and 11.0.4 rtw89: 8851b: add firmware v0.29.41.0 amdgpu: various firmware updates for amd.5.5 release ice: update ice DDP comms package to 1.3.40.0 rtlwifi: Add firmware v6.0 for RTL8192FU rtlwifi: Update firmware for RTL8188EU to v28.0 cxgb4: Update firmware to revision 1.27.3.0 > amd-gpu-firmware Upgrade to new version 20230625-151.fc38 Release notes: Update to upstream 20230625 release: wilc1000: update WILC1000 firmware to v16.0 ice: update ice DDP wireless_edge package to 1.3.10.0 amdgpu: DMCUB updates for DCN 3.1.4 and 3.1.5 amdgpu: update DMCUB to v0.0.172.0 for various AMDGPU ASICs qcom: Update the microcode files for Adreno a630 GPUs. qcom: sdm845: rename the modem firmware qcom: sdm845: update remoteproc firmware rtl_bt: Update RTL8852A BT USB firmware to 0xDAC7_480D rtl_bt: Update RTL8852C BT USB firmware to 0x040D_7225 update firmware for MT7921/MT7922 WiFi device update firmware for mediatek MT7921/MT7922 bluetooth chip (MT7922) i915: Add HuC v8.5.0 for MTL mediatek: Update mt8195 SCP firmware to support hevc qcom: apq8016: add Dragonboard 410c WiFi and modem firmware cirrus: Add firmware for new Asus ROG Laptops brcm: Add symlinks from Pine64 devices to AW-CM256SM.txt amdgpu: Update GC 11.0.1 and 11.0.4 rtw89: 8851b: add firmware v0.29.41.0 amdgpu: various firmware updates for amd.5.5 release ice: update ice DDP comms package to 1.3.40.0 rtlwifi: Add firmware v6.0 for RTL8192FU rtlwifi: Update firmware for RTL8188EU to v28.0 cxgb4: Update firmware to revision 1.27.3.0 > intel-gpu-firmware Upgrade to new version 20230625-151.fc38 Release notes: Update to upstream 20230625 release: wilc1000: update WILC1000 firmware to v16.0 ice: update ice DDP wireless_edge package to 1.3.10.0 amdgpu: DMCUB updates for DCN 3.1.4 and 3.1.5 amdgpu: update DMCUB to v0.0.172.0 for various AMDGPU ASICs qcom: Update the microcode files for Adreno a630 GPUs. qcom: sdm845: rename the modem firmware qcom: sdm845: update remoteproc firmware rtl_bt: Update RTL8852A BT USB firmware to 0xDAC7_480D rtl_bt: Update RTL8852C BT USB firmware to 0x040D_7225 update firmware for MT7921/MT7922 WiFi device update firmware for mediatek MT7921/MT7922 bluetooth chip (MT7922) i915: Add HuC v8.5.0 for MTL mediatek: Update mt8195 SCP firmware to support hevc qcom: apq8016: add Dragonboard 410c WiFi and modem firmware cirrus: Add firmware for new Asus ROG Laptops brcm: Add symlinks from Pine64 devices to AW-CM256SM.txt amdgpu: Update GC 11.0.1 and 11.0.4 rtw89: 8851b: add firmware v0.29.41.0 amdgpu: various firmware updates for amd.5.5 release ice: update ice DDP comms package to 1.3.40.0 rtlwifi: Add firmware v6.0 for RTL8192FU rtlwifi: Update firmware for RTL8188EU to v28.0 cxgb4: Update firmware to revision 1.27.3.0
Edited by Patch Adam 1 Just tried with 6.3.11 fedora 38 kernel and with firmware updates and still got the warning (and I confirm that only after the warning the flickering restarts for me as told in #2453 (comment 1858139) And I confirm it is the same flickering that I used to have previously.)
Edited by naguamCan confirm the PSR error messages and flickering are still there in 6.5.0-rc2, with the latest firmware running on a T14s gen 3 AMD, 6850u.
sudo python Downloads/psr.py DRI device 0 DMCUB F/W version: 0x0400003c ○ PSR 2 with Y coordinates (eDP 1.4a) [3] ○ Sink OUI: Parade ○ resv_40f: 01 ○ ID String: 08-03 ○ PSR Status: 00-00-02
Edited by Roemer ClaasenI'm facing regular hangs on amdgpu. Searching I came on this thread, but I'm already running the latest DMCUB F/W
$ sudo python ./psr.py DRI device 1 DMCUB F/W version: 0x0400003c ○ PSR 2 with Y coordinates (eDP 1.4a) [3] ○ Sink OUI: Parade ○ resv_40f: 01 ○ ID String: 08-03 ○ PSR Status: 00-00-03
Kernel is 6.4.4-200.fc38.x86_64 on Fedora 38. Machine is Lenovo Thinkpad T14s Gen3
Kernel oops very familiar to what i've seen above here.
Jul 28 10:31:53 loki.home.lan kernel: ------------[ cut here ]------------ Jul 28 10:31:53 loki.home.lan kernel: WARNING: CPU: 5 PID: 7209 at drivers/gpu/drm/amd/amdgpu/../display/dc/dce/dmub_psr.c:126 dmub_psr_get_state+0xc6/0xd0 [amdgpu] Jul 28 10:31:53 loki.home.lan kernel: Modules linked in: uinput bnep michael_mic snd_seq_dummy snd_hrtimer nf_conntrack_netbios_ns nf_conntrack_broadcast nft_fib_inet nft_fib_ipv4 nft_fib_ipv6 nft_fib nft_reject_inet nf_reject_ipv4 nf_reject_ipv6 nft_reject nft_ct nft_chain_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 ip_set nf_tables nfnetlink qrtr_mhi sunrpc binfmt_misc vfat fat squashfs qrtr snd_soc_dmic snd_acp6x_pdm_dma snd_soc_acp6x_mach snd_ctl_led snd_sof_amd_rembrandt snd_sof_amd_renoir snd_hda_codec_realtek snd_sof_amd_acp snd_sof_pci snd_sof_xtensa_dsp snd_hda_codec_generic ath11k_pci snd_hda_codec_hdmi snd_sof intel_rapl_msr intel_rapl_common ath11k snd_hda_intel edac_mce_amd snd_sof_utils snd_intel_dspcfg snd_intel_sdw_acpi uvcvideo snd_soc_core snd_hda_codec uvc kvm_amd qmi_helpers videobuf2_vmalloc btusb videobuf2_memops snd_hda_core snd_compress videobuf2_v4l2 mac80211 btrtl videobuf2_common kvm btbcm ac97_bus snd_pcm_dmaengine btintel snd_pci_ps libarc4 btmtk videodev snd_hwdep snd_rpl_pci_acp6x irqbypass bluetooth Jul 28 10:31:53 loki.home.lan kernel: snd_seq mc think_lmi snd_pci_acp6x snd_seq_device cfg80211 thinkpad_acpi rapl firmware_attributes_class ledtrig_audio snd_pci_acp5x wmi_bmof snd_pcm thunderbolt platform_profile snd_rn_pci_acp3x k10temp snd_acp_config snd_timer rfkill snd_soc_acpi mhi i2c_piix4 snd snd_pci_acp3x soundcore amd_pmc acpi_tad joydev loop zram amdgpu i2c_algo_bit drm_ttm_helper ttm drm_suballoc_helper iommu_v2 drm_buddy nvme gpu_sched hid_logitech_hidpp nvme_core drm_display_helper crct10dif_pclmul crc32_pclmul crc32c_intel polyval_clmulni video hid_multitouch ucsi_acpi polyval_generic ghash_clmulni_intel sha512_ssse3 typec_ucsi sp5100_tco ccp cec typec nvme_common wmi i2c_hid_acpi i2c_hid hid_logitech_dj serio_raw ip6_tables ip_tables fuse Jul 28 10:31:53 loki.home.lan kernel: CPU: 5 PID: 7209 Comm: kworker/5:0H Not tainted 6.4.4-200.fc38.x86_64 #1 Jul 28 10:31:53 loki.home.lan kernel: Hardware name: LENOVO 21CQCTO1WW/21CQCTO1WW, BIOS R22ET61W (1.31 ) 03/21/2023 Jul 28 10:31:53 loki.home.lan kernel: Workqueue: events_highpri dm_irq_work_func [amdgpu] Jul 28 10:31:53 loki.home.lan kernel: RIP: 0010:dmub_psr_get_state+0xc6/0xd0 [amdgpu] Jul 28 10:31:53 loki.home.lan kernel: Code: 00 00 74 b4 48 8b 44 24 08 65 48 2b 04 25 28 00 00 00 75 1a 48 83 c4 10 5b 5d 41 5c 41 5d c3 cc cc cc cc 3d ff 00 00 00 75 da <0f> 0b eb d6 e8 31 d5 40 e0 90 90 90 90 90 90 90 90 90 90 90 90 90 Jul 28 10:31:53 loki.home.lan kernel: RSP: 0018:ffffa0264be5fca0 EFLAGS: 00010246 Jul 28 10:31:53 loki.home.lan kernel: RAX: 00000000000000ff RBX: 00000000000003e9 RCX: 0000000000000005 Jul 28 10:31:53 loki.home.lan kernel: RDX: 0000000000000000 RSI: 0000000000001681 RDI: ffff94fb571c0000 Jul 28 10:31:53 loki.home.lan kernel: RBP: ffff94fb4ac06c00 R08: 0000000000000014 R09: 000000002f8034f5 Jul 28 10:31:53 loki.home.lan kernel: R10: 0000000000000001 R11: 0000000000000000 R12: 0000000000000000 Jul 28 10:31:53 loki.home.lan kernel: R13: ffffa0264be5fcdc R14: ffff94fb44516300 R15: 0000000000000000 Jul 28 10:31:53 loki.home.lan kernel: FS: 0000000000000000(0000) GS:ffff95025ef40000(0000) knlGS:0000000000000000 Jul 28 10:31:53 loki.home.lan kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 Jul 28 10:31:53 loki.home.lan kernel: CR2: 00007f79c655a000 CR3: 00000001e8022000 CR4: 0000000000750ee0 Jul 28 10:31:53 loki.home.lan kernel: PKRU: 55555554 Jul 28 10:31:53 loki.home.lan kernel: Call Trace: Jul 28 10:31:53 loki.home.lan kernel: <TASK> Jul 28 10:31:53 loki.home.lan kernel: ? dmub_psr_get_state+0xc6/0xd0 [amdgpu] Jul 28 10:31:53 loki.home.lan kernel: ? __warn+0x81/0x130 Jul 28 10:31:53 loki.home.lan kernel: ? dmub_psr_get_state+0xc6/0xd0 [amdgpu] Jul 28 10:31:53 loki.home.lan kernel: ? report_bug+0x171/0x1a0 Jul 28 10:31:53 loki.home.lan kernel: ? handle_bug+0x3c/0x80 Jul 28 10:31:53 loki.home.lan kernel: ? exc_invalid_op+0x17/0x70 Jul 28 10:31:53 loki.home.lan kernel: ? asm_exc_invalid_op+0x1a/0x20 Jul 28 10:31:53 loki.home.lan kernel: ? dmub_psr_get_state+0xc6/0xd0 [amdgpu] Jul 28 10:31:53 loki.home.lan kernel: dmub_psr_enable+0xde/0x120 [amdgpu] Jul 28 10:31:53 loki.home.lan kernel: edp_set_psr_allow_active+0x27e/0x3b0 [amdgpu] Jul 28 10:31:53 loki.home.lan kernel: dp_handle_hpd_rx_irq+0x318/0x350 [amdgpu] Jul 28 10:31:53 loki.home.lan kernel: handle_hpd_rx_irq+0xca/0x490 [amdgpu] Jul 28 10:31:53 loki.home.lan kernel: ? __schedule+0x3f6/0x14c0 Jul 28 10:31:53 loki.home.lan kernel: process_one_work+0x1c7/0x3d0 Jul 28 10:31:53 loki.home.lan kernel: worker_thread+0x51/0x390 Jul 28 10:31:53 loki.home.lan kernel: ? __pfx_worker_thread+0x10/0x10 Jul 28 10:31:53 loki.home.lan kernel: kthread+0xe8/0x120 Jul 28 10:31:53 loki.home.lan kernel: ? __pfx_kthread+0x10/0x10 Jul 28 10:31:53 loki.home.lan kernel: ret_from_fork+0x2c/0x50 Jul 28 10:31:53 loki.home.lan kernel: </TASK> Jul 28 10:31:53 loki.home.lan kernel: ---[ end trace 0000000000000000 ]---
This happens every few days, sometimes a few times in a row. DE is KDE. What happens is that the screen turns black, i need to SSH into and reboot it to get it back to life.
Edited by devilkin- Developer
can this still be reproduced using https://lore.kernel.org/linux-firmware/20230731145446.2708759-1-aurabindo.pillai@amd.com/T/#u?
Collapse replies Firmware taken from https://git.kernel.org/pub/scm/linux/kernel/git/firmware/linux-firmware.git/commit/amdgpu/yellow_carp_dmcub.bin?id=253cc179d849fc82489773b2b553a49858d8725f
$ sudo python ./psr.py DRI device 1 DMCUB F/W version: 0x0400003e ○ PSR 2 with Y coordinates (eDP 1.4a) [3] ○ Sink OUI: Parade ○ resv_40f: 01 ○ ID String: 08-03 ○ PSR Status: 00-00-02
edit: just encountered a glitch on the screen (the entire screen jumps for a second)
[ 199.489768] ------------[ cut here ]------------ [ 199.489771] WARNING: CPU: 3 PID: 305 at drivers/gpu/drm/amd/amdgpu/../display/dc/dce/dmub_psr.c:126 dmub_psr_get_state+0xc6/0xd0 [amdgpu] [ 199.490068] Modules linked in: michael_mic uinput rfcomm snd_seq_dummy snd_hrtimer nf_conntrack_netbios_ns nf_conntrack_broadcast nft_fib_inet nft_fib_ipv4 nft_fib_ipv6 nft_fib nft_reject_inet nf_reject_ipv4 nf_reject_ipv6 nft_reject nft_ct nft_chain_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 ip_set nf_tables nfnetlink qrtr_mhi sunrpc bnep binfmt_misc vfat fat squashfs qrtr snd_acp6x_pdm_dma snd_soc_acp6x_mach snd_soc_dmic snd_sof_amd_rembrandt snd_sof_amd_renoir snd_sof_amd_acp snd_ctl_led ath11k_pci snd_sof_pci snd_hda_codec_realtek snd_sof_xtensa_dsp ath11k intel_rapl_msr snd_sof intel_rapl_common snd_hda_codec_generic snd_hda_codec_hdmi edac_mce_amd snd_sof_utils qmi_helpers snd_hda_intel kvm_amd snd_intel_dspcfg snd_intel_sdw_acpi mac80211 snd_soc_core snd_hda_codec kvm snd_compress uvcvideo ac97_bus snd_pcm_dmaengine snd_pci_ps snd_rpl_pci_acp6x btusb uvc libarc4 videobuf2_vmalloc videobuf2_memops snd_hda_core btrtl snd_hwdep videobuf2_v4l2 snd_pci_acp6x btbcm btintel irqbypass snd_seq [ 199.490104] videobuf2_common btmtk cfg80211 snd_seq_device bluetooth videodev thinkpad_acpi rapl think_lmi snd_pcm snd_pci_acp5x mc firmware_attributes_class ledtrig_audio platform_profile wmi_bmof snd_rn_pci_acp3x thunderbolt snd_timer rfkill snd_acp_config mhi snd snd_soc_acpi i2c_piix4 snd_pci_acp3x k10temp soundcore amd_pmc acpi_tad joydev loop zram amdgpu i2c_algo_bit drm_ttm_helper ttm drm_suballoc_helper iommu_v2 drm_buddy gpu_sched drm_display_helper nvme hid_logitech_hidpp ccp cec nvme_core crct10dif_pclmul crc32_pclmul crc32c_intel video hid_multitouch polyval_clmulni ucsi_acpi polyval_generic ghash_clmulni_intel typec_ucsi sha512_ssse3 sp5100_tco typec nvme_common wmi i2c_hid_acpi i2c_hid serio_raw hid_logitech_dj ip6_tables ip_tables fuse [ 199.490143] CPU: 3 PID: 305 Comm: kworker/3:1H Not tainted 6.4.6-200.fc38.x86_64 #1 [ 199.490146] Hardware name: LENOVO 21CQCTO1WW/21CQCTO1WW, BIOS R22ET61W (1.31 ) 03/21/2023 [ 199.490147] Workqueue: events_highpri dm_irq_work_func [amdgpu] [ 199.490347] RIP: 0010:dmub_psr_get_state+0xc6/0xd0 [amdgpu] [ 199.490560] Code: 00 00 74 b4 48 8b 44 24 08 65 48 2b 04 25 28 00 00 00 75 1a 48 83 c4 10 5b 5d 41 5c 41 5d c3 cc cc cc cc 3d ff 00 00 00 75 da <0f> 0b eb d6 e8 71 23 53 c4 90 90 90 90 90 90 90 90 90 90 90 90 90 [ 199.490561] RSP: 0018:ffffb621c0a07ca8 EFLAGS: 00010246 [ 199.490562] RAX: 00000000000000ff RBX: 00000000000003e9 RCX: 0000000000000003 [ 199.490563] RDX: 0000000000000000 RSI: 0000000000001681 RDI: ffff965351bc0000 [ 199.490564] RBP: ffff9653565a5400 R08: 0000000000000000 R09: ffffb622009e4000 [ 199.490565] R10: 0000000000000000 R11: fefefefefefefeff R12: 0000000000000000 [ 199.490565] R13: ffffb621c0a07ce4 R14: ffff96534bf1f5c0 R15: 0000000000000000 [ 199.490566] FS: 0000000000000000(0000) GS:ffff965a5eec0000(0000) knlGS:0000000000000000 [ 199.490567] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [ 199.490567] CR2: 00001d5400858000 CR3: 00000004ec022000 CR4: 0000000000750ee0 [ 199.490568] PKRU: 55555554 [ 199.490568] Call Trace: [ 199.490574] <TASK> [ 199.490575] ? dmub_psr_get_state+0xc6/0xd0 [amdgpu] [ 199.490784] ? __warn+0x81/0x130 [ 199.490792] ? dmub_psr_get_state+0xc6/0xd0 [amdgpu] [ 199.491006] ? report_bug+0x171/0x1a0 [ 199.491011] ? handle_bug+0x3c/0x80 [ 199.491014] ? exc_invalid_op+0x17/0x70 [ 199.491015] ? asm_exc_invalid_op+0x1a/0x20 [ 199.491019] ? dmub_psr_get_state+0xc6/0xd0 [amdgpu] [ 199.491213] ? dmub_psr_get_state+0x7a/0xd0 [amdgpu] [ 199.491420] dmub_psr_enable+0xde/0x120 [amdgpu] [ 199.491623] edp_set_psr_allow_active+0x27e/0x3b0 [amdgpu] [ 199.491817] dp_handle_hpd_rx_irq+0x318/0x350 [amdgpu] [ 199.492018] handle_hpd_rx_irq+0xca/0x470 [amdgpu] [ 199.492221] ? blk_mq_requeue_work+0x18b/0x1f0 [ 199.492226] process_one_work+0x1c7/0x3d0 [ 199.492231] worker_thread+0x51/0x390 [ 199.492232] ? __pfx_worker_thread+0x10/0x10 [ 199.492233] kthread+0xe8/0x120 [ 199.492238] ? __pfx_kthread+0x10/0x10 [ 199.492239] ret_from_fork+0x2c/0x50 [ 199.492245] </TASK> [ 199.492245] ---[ end trace 0000000000000000 ]--- [ 201.610469] ------------[ cut here ]------------ [ 201.610472] WARNING: CPU: 3 PID: 305 at drivers/gpu/drm/amd/amdgpu/../display/dc/dce/dmub_psr.c:226 dmub_psr_enable+0x114/0x120 [amdgpu] [ 201.610883] Modules linked in: michael_mic uinput rfcomm snd_seq_dummy snd_hrtimer nf_conntrack_netbios_ns nf_conntrack_broadcast nft_fib_inet nft_fib_ipv4 nft_fib_ipv6 nft_fib nft_reject_inet nf_reject_ipv4 nf_reject_ipv6 nft_reject nft_ct nft_chain_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 ip_set nf_tables nfnetlink qrtr_mhi sunrpc bnep binfmt_misc vfat fat squashfs qrtr snd_acp6x_pdm_dma snd_soc_acp6x_mach snd_soc_dmic snd_sof_amd_rembrandt snd_sof_amd_renoir snd_sof_amd_acp snd_ctl_led ath11k_pci snd_sof_pci snd_hda_codec_realtek snd_sof_xtensa_dsp ath11k intel_rapl_msr snd_sof intel_rapl_common snd_hda_codec_generic snd_hda_codec_hdmi edac_mce_amd snd_sof_utils qmi_helpers snd_hda_intel kvm_amd snd_intel_dspcfg snd_intel_sdw_acpi mac80211 snd_soc_core snd_hda_codec kvm snd_compress uvcvideo ac97_bus snd_pcm_dmaengine snd_pci_ps snd_rpl_pci_acp6x btusb uvc libarc4 videobuf2_vmalloc videobuf2_memops snd_hda_core btrtl snd_hwdep videobuf2_v4l2 snd_pci_acp6x btbcm btintel irqbypass snd_seq [ 201.610924] videobuf2_common btmtk cfg80211 snd_seq_device bluetooth videodev thinkpad_acpi rapl think_lmi snd_pcm snd_pci_acp5x mc firmware_attributes_class ledtrig_audio platform_profile wmi_bmof snd_rn_pci_acp3x thunderbolt snd_timer rfkill snd_acp_config mhi snd snd_soc_acpi i2c_piix4 snd_pci_acp3x k10temp soundcore amd_pmc acpi_tad joydev loop zram amdgpu i2c_algo_bit drm_ttm_helper ttm drm_suballoc_helper iommu_v2 drm_buddy gpu_sched drm_display_helper nvme hid_logitech_hidpp ccp cec nvme_core crct10dif_pclmul crc32_pclmul crc32c_intel video hid_multitouch polyval_clmulni ucsi_acpi polyval_generic ghash_clmulni_intel typec_ucsi sha512_ssse3 sp5100_tco typec nvme_common wmi i2c_hid_acpi i2c_hid serio_raw hid_logitech_dj ip6_tables ip_tables fuse [ 201.610965] CPU: 3 PID: 305 Comm: kworker/3:1H Tainted: G W 6.4.6-200.fc38.x86_64 #1 [ 201.610967] Hardware name: LENOVO 21CQCTO1WW/21CQCTO1WW, BIOS R22ET61W (1.31 ) 03/21/2023 [ 201.610969] Workqueue: events_highpri dm_irq_work_func [amdgpu] [ 201.611357] RIP: 0010:dmub_psr_enable+0x114/0x120 [amdgpu] [ 201.611766] Code: c5 81 fb e8 03 00 00 74 21 48 8b 44 24 48 65 48 2b 04 25 28 00 00 00 75 15 48 83 c4 50 5b 5d 41 5c 41 5d 41 5e c3 cc cc cc cc <0f> 0b eb db e8 33 21 53 c4 0f 1f 00 90 90 90 90 90 90 90 90 90 90 [ 201.611768] RSP: 0018:ffffb621c0a07ce0 EFLAGS: 00010246 [ 201.611770] RAX: 0000000000000000 RBX: 00000000000003e9 RCX: 000000000002c4b8 [ 201.611771] RDX: 0000000000000000 RSI: 0000000055555554 RDI: ffffb621c0a07c38 [ 201.611772] RBP: 0000000000000000 R08: 0000000000000000 R09: 0000000000000000 [ 201.611773] R10: 0000000000000001 R11: 0000000000000100 R12: ffff965350868590 [ 201.611774] R13: 0000000000000000 R14: ffff96534bf1f5c0 R15: 0000000000000000 [ 201.611775] FS: 0000000000000000(0000) GS:ffff965a5eec0000(0000) knlGS:0000000000000000 [ 201.611776] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [ 201.611778] CR2: 00001d5400858000 CR3: 00000004ec022000 CR4: 0000000000750ee0 [ 201.611779] PKRU: 55555554 [ 201.611780] Call Trace: [ 201.611783] <TASK> [ 201.611785] ? dmub_psr_enable+0x114/0x120 [amdgpu] [ 201.612088] ? __warn+0x81/0x130 [ 201.612093] ? dmub_psr_enable+0x114/0x120 [amdgpu] [ 201.612281] ? report_bug+0x171/0x1a0 [ 201.612285] ? handle_bug+0x3c/0x80 [ 201.612287] ? exc_invalid_op+0x17/0x70 [ 201.612288] ? asm_exc_invalid_op+0x1a/0x20 [ 201.612291] ? dmub_psr_enable+0x114/0x120 [amdgpu] [ 201.612488] edp_set_psr_allow_active+0x27e/0x3b0 [amdgpu] [ 201.612677] dp_handle_hpd_rx_irq+0x318/0x350 [amdgpu] [ 201.612860] handle_hpd_rx_irq+0xca/0x470 [amdgpu] [ 201.613052] ? blk_mq_requeue_work+0x18b/0x1f0 [ 201.613055] process_one_work+0x1c7/0x3d0 [ 201.613058] worker_thread+0x51/0x390 [ 201.613059] ? __pfx_worker_thread+0x10/0x10 [ 201.613060] kthread+0xe8/0x120 [ 201.613063] ? __pfx_kthread+0x10/0x10 [ 201.613064] ret_from_fork+0x2c/0x50 [ 201.613068] </TASK> [ 201.613068] ---[ end trace 0000000000000000 ]---
Edited by devilkin 1
Running the latest firmware from the linux-firmware git, and I'm having the same issues.
While not exactly the same error, I'm also getting this:
[ 197.788547] ------------[ cut here ]------------ [ 197.788553] WARNING: CPU: 5 PID: 3087 at drivers/gpu/drm/amd/amdgpu/../display/dc/dce/dmub_psr.c:226 dmub_psr_enable+0x114/0x120 [amdgpu] [ 197.788910] Modules linked in: ctr(E) ccm(E) rfcomm(E) cmac(E) algif_hash(E) algif_skcipher(E) af_alg(E) snd_seq_dummy(E) snd_hrtimer(E) snd_seq(E) snd_seq_device(E) nvme_fabrics(E) ip6t_REJECT(E) nf_reject_ipv6(E) xt_hl(E) ip6t_rt(E) ipt_REJECT(E) nf_reject_ipv4(E) xt_LOG(E) bnep(E) nf_log_syslog(E) nft_limit(E) btusb(E) btrtl(E) btbcm(E) btintel(E) uvcvideo(E) btmtk(E) videobuf2_vmalloc(E) uvc(E) videobuf2_memops(E) videobuf2_v4l2(E) bluetooth(E) videodev(E) videobuf2_common(E) mc(E) jitterentropy_rng(E) drbg(E) ansi_cprng(E) ecdh_generic(E) xt_limit(E) ecc(E) xt_addrtype(E) snd_ctl_led(E) joydev(E) qrtr(E) xt_tcpudp(E) xt_conntrack(E) nf_conntrack(E) snd_hda_codec_realtek(E) nf_defrag_ipv6(E) iwlmvm(E) nf_defrag_ipv4(E) intel_rapl_msr(E) snd_hda_codec_generic(E) intel_rapl_common(E) nft_compat(E) ledtrig_audio(E) edac_mce_amd(E) mac80211(E) snd_hda_codec_hdmi(E) nf_tables(E) kvm_amd(E) snd_hda_intel(E) hid_sensor_accel_3d(E) snd_intel_dspcfg(E) nfnetlink(E) binfmt_misc(E) libarc4(E) snd_intel_sdw_acpi(E) [ 197.788957] snd_soc_dmic(E) snd_soc_acp6x_mach(E) snd_acp6x_pdm_dma(E) hid_sensor_trigger(E) kvm(E) iwlwifi(E) hid_sensor_iio_common(E) snd_hda_codec(E) industrialio_triggered_buffer(E) snd_soc_core(E) kfifo_buf(E) industrialio(E) irqbypass(E) snd_compress(E) snd_hda_core(E) nls_ascii(E) hid_sensor_custom(E) crc32_pclmul(E) snd_pci_acp6x(E) snd_hwdep(E) cfg80211(E) sdhci_pci(E) snd_pci_acp5x(E) nls_cp437(E) sp5100_tco(E) snd_rn_pci_acp3x(E) snd_pcm(E) cqhci(E) ucsi_acpi(E) ccp(E) wacom(E) snd_acp_config(E) sdhci(E) watchdog(E) typec_ucsi(E) vfat(E) hid_sensor_hub(E) snd_timer(E) ideapad_laptop(E) hid_multitouch(E) xhci_pci(E) ghash_clmulni_intel(E) snd_soc_acpi(E) usbhid(E) hid_generic(E) xhci_hcd(E) snd(E) sha512_ssse3(E) sparse_keymap(E) fat(E) sha512_generic(E) rapl(E) pcspkr(E) wmi_bmof(E) roles(E) platform_profile(E) usbcore(E) thunderbolt(E) snd_pci_acp3x(E) battery(E) mmc_core(E) soundcore(E) k10temp(E) rng_core(E) usb_common(E) rfkill(E) i2c_piix4(E) typec(E) i2c_hid_acpi(E) button(E) i2c_hid(E) hid(E) [ 197.789009] amd_pmc(E) ac(E) acpi_tad(E) msr(E) parport_pc(E) ppdev(E) lp(E) parport(E) loop(E) fuse(E) efi_pstore(E) configfs(E) ip_tables(E) x_tables(E) autofs4(E) ext4(E) crc16(E) mbcache(E) jbd2(E) dm_crypt(E) efivarfs(E) raid10(E) raid456(E) libcrc32c(E) crc32c_generic(E) async_raid6_recov(E) async_memcpy(E) async_pq(E) async_xor(E) xor(E) async_tx(E) raid6_pq(E) raid1(E) raid0(E) multipath(E) linear(E) md_mod(E) dm_mirror(E) dm_region_hash(E) dm_log(E) dm_mod(E) amdgpu(E) i2c_algo_bit(E) drm_ttm_helper(E) ttm(E) drm_suballoc_helper(E) drm_buddy(E) gpu_sched(E) drm_display_helper(E) crc32c_intel(E) nvme(E) drm_kms_helper(E) nvme_core(E) t10_pi(E) aesni_intel(E) crypto_simd(E) drm(E) cryptd(E) evdev(E) serio_raw(E) crc64_rocksoft(E) cec(E) crc64(E) crc_t10dif(E) crct10dif_generic(E) rc_core(E) crct10dif_pclmul(E) crct10dif_common(E) video(E) wmi(E) [ 197.789057] CPU: 5 PID: 3087 Comm: kworker/5:2H Tainted: G E 6.4.0-0.a.test-amd64 #1 Debian 6.4.4-1a~test [ 197.789061] Hardware name: LENOVO 82QF/LNVNB161216, BIOS K5CN40WW 04/17/2023 [ 197.789063] Workqueue: events_highpri dm_irq_work_func [amdgpu] [ 197.789415] RIP: 0010:dmub_psr_enable+0x114/0x120 [amdgpu] [ 197.789965] Code: c5 81 fb e8 03 00 00 74 21 48 8b 44 24 48 65 48 2b 04 25 28 00 00 00 75 15 48 83 c4 50 5b 5d 41 5c 41 5d 41 5e c3 cc cc cc cc <0f> 0b eb db e8 d3 b2 55 fa 0f 1f 00 90 90 90 90 90 90 90 90 90 90 [ 197.789968] RSP: 0018:ffffbb3148887cd8 EFLAGS: 00010246 [ 197.789972] RAX: 0000000000000000 RBX: 00000000000003e9 RCX: 000000000000fffc [ 197.789975] RDX: 0000000000000000 RSI: ffffffffbba3aeaf RDI: ffffbb3148887c30 [ 197.789977] RBP: 0000000000000000 R08: 0000000000000000 R09: 0000000000000000 [ 197.789978] R10: 0000000000000001 R11: 0000000000000000 R12: ffff989d40d147c0 [ 197.789980] R13: 0000000000000000 R14: ffff989d40d12240 R15: 0000000000000000 [ 197.789982] FS: 0000000000000000(0000) GS:ffff98a02fd40000(0000) knlGS:0000000000000000 [ 197.789984] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [ 197.789986] CR2: 0000019807987000 CR3: 0000000031a20000 CR4: 0000000000750ee0 [ 197.789989] PKRU: 55555554 [ 197.789990] Call Trace: [ 197.789996] <TASK> [ 197.789998] ? dmub_psr_enable+0x114/0x120 [amdgpu] [ 197.790533] ? __warn+0x81/0x130 [ 197.790547] ? dmub_psr_enable+0x114/0x120 [amdgpu] [ 197.791118] ? report_bug+0x191/0x1c0 [ 197.791129] ? handle_bug+0x3c/0x80 [ 197.791135] ? exc_invalid_op+0x17/0x70 [ 197.791138] ? asm_exc_invalid_op+0x1a/0x20 [ 197.791145] ? dmub_psr_enable+0x114/0x120 [amdgpu] [ 197.791829] ? dmub_psr_enable+0xc7/0x120 [amdgpu] [ 197.792367] edp_set_psr_allow_active+0x27e/0x3b0 [amdgpu] [ 197.792894] dp_handle_hpd_rx_irq+0x318/0x350 [amdgpu] [ 197.793413] handle_hpd_rx_irq+0xca/0x490 [amdgpu] [ 197.793952] ? _raw_spin_unlock+0x19/0x40 [ 197.793959] ? finish_task_switch.isra.0+0x9a/0x300 [ 197.793966] ? __switch_to+0x10a/0x410 [ 197.793973] process_one_work+0x1c7/0x3d0 [ 197.793980] worker_thread+0x51/0x390 [ 197.793983] ? _raw_spin_lock_irqsave+0x27/0x60 [ 197.793988] ? __pfx_worker_thread+0x10/0x10 [ 197.793992] kthread+0xf7/0x130 [ 197.793998] ? __pfx_kthread+0x10/0x10 [ 197.794003] ret_from_fork+0x2c/0x50 [ 197.794013] </TASK> [ 197.794015] ---[ end trace 0000000000000000 ]--- [ 616.138774] ------------[ cut here ]------------
I can report a separate bug if required, but the calls seemed similar, and I get both on the same system.
I'm also seeing the screen show black for a second randomly, but nothing occurs in
dmesg
when these black screens occur.Running kernel 6.4.4
1Collapse replies - Developer
Can you please try 6.4.5 or later? There is a damage clips fix in 6.4.5.
I'm away from my development machine at the moment, I just have my laptop with me and very bad network. I'll be home in just over a week, so I can compile and test then if it's still required. Just thought I'd add my information while I can, in case it helps :)
- Developer
OK thanks guys!
Back now, do you still need this tested?
- Author
This is my fair share of the problem...
> doas ./psr.py DRI device 1 DMCUB F/W version: 0x0400003c - PSR 2 with Y coordinates (eDP 1.4a) [3] - Sink OUI: Parade - resv_40f: 01 - ID String: 08-03 - PSR Status: 00-00-02
Edited by Patch Adam