[6.3.5] Warning when suspend with Radeon RX 7900 XTX
it looks similar to #2522 (closed) but I have only one GPU.
suspend/resume is working, but I get these warnings in dmesg:
[ 53.300159] ------------[ cut here ]------------
[ 53.300163] WARNING: CPU: 4 PID: 2610 at drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c:599 amdgpu_irq_put+0x46/0x70 [amdgpu]
[ 53.300730] Modules linked in: uinput snd_seq_dummy snd_hrtimer xt_CHECKSUM xt_MASQUERADE xt_conntrack ipt_REJECT nf_reject_ipv4 ip6table_mangle ip6table_nat ip6table_filter iptable_mangle iptable_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 iptable_filter bridge stp llc qrtr rfkill sunrpc binfmt_misc vfat fat snd_hda_codec_realtek snd_hda_codec_generic snd_hda_codec_hdmi ledtrig_audio snd_usb_audio intel_rapl_msr intel_rapl_common snd_hda_intel edac_mce_amd snd_intel_dspcfg snd_intel_sdw_acpi snd_usbmidi_lib kvm_amd snd_hda_codec snd_rawmidi snd_hda_core mc snd_hwdep snd_seq kvm snd_seq_device snd_pcm irqbypass snd_timer wmi_bmof rapl snd k10temp i2c_piix4 joydev soundcore apple_mfi_fastcharge acpi_cpufreq squashfs loop zram amdgpu hid_roccat_koneplus hid_roccat hid_roccat_common drm_ttm_helper ttm video crct10dif_pclmul crc32_pclmul iommu_v2 crc32c_intel drm_buddy gpu_sched polyval_clmulni nvme polyval_generic drm_display_helper igb ucsi_ccg typec_ucsi ghash_clmulni_intel typec cec ccp sp5100_tco
[ 53.300789] sha512_ssse3 nvme_core dca i2c_algo_bit wmi nvme_common scsi_dh_rdac scsi_dh_emc scsi_dh_alua ip6_tables ip_tables dm_multipath fuse
[ 53.300799] CPU: 4 PID: 2610 Comm: kworker/u64:20 Not tainted 6.3.5-200.fc38.x86_64 #1
[ 53.300803] Hardware name: To Be Filled By O.E.M. To Be Filled By O.E.M./X570M Pro4, BIOS P1.90 09/10/2019
[ 53.300805] Workqueue: events_unbound async_run_entry_fn
[ 53.300812] RIP: 0010:amdgpu_irq_put+0x46/0x70 [amdgpu]
[ 53.301352] Code: c0 74 33 48 8b 4e 10 48 83 39 00 74 29 89 d1 48 8d 04 88 8b 08 85 c9 74 11 f0 ff 08 74 07 31 c0 e9 3f db 91 de e9 5a fd ff ff <0f> 0b b8 ea ff ff ff e9 2e db 91 de b8 ea ff ff ff e9 24 db 91 de
[ 53.301354] RSP: 0018:ffffb56d8e723d48 EFLAGS: 00010246
[ 53.301357] RAX: ffff902cc13097a0 RBX: ffff902cde860000 RCX: 0000000000000000
[ 53.301359] RDX: 0000000000000000 RSI: ffff902cde874e88 RDI: ffff902cde860000
[ 53.301360] RBP: ffff902cde860000 R08: ffff902cc0402bf0 R09: ffffffffa005d6e0
[ 53.301362] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000001050
[ 53.301363] R13: ffff902cde8789a8 R14: ffffffff9f89654e R15: ffff902d6da8cd48
[ 53.301365] FS: 0000000000000000(0000) GS:ffff9033beb00000(0000) knlGS:0000000000000000
[ 53.301367] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 53.301368] CR2: 000055d89e464448 CR3: 0000000161154000 CR4: 0000000000350ee0
[ 53.301371] Call Trace:
[ 53.301373] <TASK>
[ 53.301375] ? amdgpu_irq_put+0x46/0x70 [amdgpu]
[ 53.301906] ? __warn+0x81/0x130
[ 53.301912] ? amdgpu_irq_put+0x46/0x70 [amdgpu]
[ 53.302446] ? report_bug+0x171/0x1a0
[ 53.302453] ? handle_bug+0x3c/0x80
[ 53.302456] ? exc_invalid_op+0x17/0x70
[ 53.302459] ? asm_exc_invalid_op+0x1a/0x20
[ 53.302465] ? amdgpu_irq_put+0x46/0x70 [amdgpu]
[ 53.302995] jpeg_v4_0_hw_fini+0x6e/0xa0 [amdgpu]
[ 53.303550] jpeg_v4_0_suspend+0x12/0x30 [amdgpu]
[ 53.304092] amdgpu_device_ip_suspend_phase2+0x101/0x1a0 [amdgpu]
[ 53.304596] amdgpu_device_suspend+0x107/0x180 [amdgpu]
[ 53.305095] pci_pm_suspend+0x7f/0x170
[ 53.305101] ? __pfx_pci_pm_suspend+0x10/0x10
[ 53.305105] dpm_run_callback+0x8c/0x1e0
[ 53.305110] __device_suspend+0x10a/0x560
[ 53.305114] async_suspend+0x1e/0x70
[ 53.305117] async_run_entry_fn+0x34/0x130
[ 53.305122] process_one_work+0x1c7/0x3d0
[ 53.305125] worker_thread+0x51/0x390
[ 53.305128] ? __pfx_worker_thread+0x10/0x10
[ 53.305130] kthread+0xde/0x110
[ 53.305134] ? __pfx_kthread+0x10/0x10
[ 53.305137] ret_from_fork+0x2c/0x50
[ 53.305144] </TASK>
[ 53.305145] ---[ end trace 0000000000000000 ]---
[ 53.305167] ------------[ cut here ]------------
[ 53.305168] WARNING: CPU: 4 PID: 2610 at drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c:599 amdgpu_irq_put+0x46/0x70 [amdgpu]
[ 53.305703] Modules linked in: uinput snd_seq_dummy snd_hrtimer xt_CHECKSUM xt_MASQUERADE xt_conntrack ipt_REJECT nf_reject_ipv4 ip6table_mangle ip6table_nat ip6table_filter iptable_mangle iptable_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 iptable_filter bridge stp llc qrtr rfkill sunrpc binfmt_misc vfat fat snd_hda_codec_realtek snd_hda_codec_generic snd_hda_codec_hdmi ledtrig_audio snd_usb_audio intel_rapl_msr intel_rapl_common snd_hda_intel edac_mce_amd snd_intel_dspcfg snd_intel_sdw_acpi snd_usbmidi_lib kvm_amd snd_hda_codec snd_rawmidi snd_hda_core mc snd_hwdep snd_seq kvm snd_seq_device snd_pcm irqbypass snd_timer wmi_bmof rapl snd k10temp i2c_piix4 joydev soundcore apple_mfi_fastcharge acpi_cpufreq squashfs loop zram amdgpu hid_roccat_koneplus hid_roccat hid_roccat_common drm_ttm_helper ttm video crct10dif_pclmul crc32_pclmul iommu_v2 crc32c_intel drm_buddy gpu_sched polyval_clmulni nvme polyval_generic drm_display_helper igb ucsi_ccg typec_ucsi ghash_clmulni_intel typec cec ccp sp5100_tco
[ 53.305752] sha512_ssse3 nvme_core dca i2c_algo_bit wmi nvme_common scsi_dh_rdac scsi_dh_emc scsi_dh_alua ip6_tables ip_tables dm_multipath fuse
[ 53.305759] CPU: 4 PID: 2610 Comm: kworker/u64:20 Tainted: G W 6.3.5-200.fc38.x86_64 #1
[ 53.305762] Hardware name: To Be Filled By O.E.M. To Be Filled By O.E.M./X570M Pro4, BIOS P1.90 09/10/2019
[ 53.305763] Workqueue: events_unbound async_run_entry_fn
[ 53.305768] RIP: 0010:amdgpu_irq_put+0x46/0x70 [amdgpu]
[ 53.306296] Code: c0 74 33 48 8b 4e 10 48 83 39 00 74 29 89 d1 48 8d 04 88 8b 08 85 c9 74 11 f0 ff 08 74 07 31 c0 e9 3f db 91 de e9 5a fd ff ff <0f> 0b b8 ea ff ff ff e9 2e db 91 de b8 ea ff ff ff e9 24 db 91 de
[ 53.306298] RSP: 0018:ffffb56d8e723d30 EFLAGS: 00010246
[ 53.306300] RAX: ffff902cc1309498 RBX: ffff902cde860000 RCX: 0000000000000000
[ 53.306302] RDX: 0000000000000000 RSI: ffff902cde873a80 RDI: ffff902cde860000
[ 53.306303] RBP: 0000000000000000 R08: ffff902cc0402bf0 R09: ffffffffa005d6e0
[ 53.306304] R10: 0000000000000000 R11: 0000000000000000 R12: ffff902cde873a80
[ 53.306305] R13: 0000000000000001 R14: ffffffff9f89654e R15: ffff902d6da8cd48
[ 53.306307] FS: 0000000000000000(0000) GS:ffff9033beb00000(0000) knlGS:0000000000000000
[ 53.306309] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 53.306311] CR2: 000055d89e464448 CR3: 0000000161154000 CR4: 0000000000350ee0
[ 53.306312] Call Trace:
[ 53.306314] <TASK>
[ 53.306314] ? amdgpu_irq_put+0x46/0x70 [amdgpu]
[ 53.306844] ? __warn+0x81/0x130
[ 53.306848] ? amdgpu_irq_put+0x46/0x70 [amdgpu]
[ 53.307378] ? report_bug+0x171/0x1a0
[ 53.307383] ? handle_bug+0x3c/0x80
[ 53.307386] ? exc_invalid_op+0x17/0x70
[ 53.307389] ? asm_exc_invalid_op+0x1a/0x20
[ 53.307395] ? amdgpu_irq_put+0x46/0x70 [amdgpu]
[ 53.307922] vcn_v4_0_hw_fini+0x4d/0x100 [amdgpu]
[ 53.308468] vcn_v4_0_suspend+0x12/0x30 [amdgpu]
[ 53.309008] amdgpu_device_ip_suspend_phase2+0x101/0x1a0 [amdgpu]
[ 53.309510] amdgpu_device_suspend+0x107/0x180 [amdgpu]
[ 53.310009] pci_pm_suspend+0x7f/0x170
[ 53.310013] ? __pfx_pci_pm_suspend+0x10/0x10
[ 53.310018] dpm_run_callback+0x8c/0x1e0
[ 53.310022] __device_suspend+0x10a/0x560
[ 53.310025] async_suspend+0x1e/0x70
[ 53.310029] async_run_entry_fn+0x34/0x130
[ 53.310033] process_one_work+0x1c7/0x3d0
[ 53.310036] worker_thread+0x51/0x390
[ 53.310039] ? __pfx_worker_thread+0x10/0x10
[ 53.310041] kthread+0xde/0x110
[ 53.310044] ? __pfx_kthread+0x10/0x10
[ 53.310048] ret_from_fork+0x2c/0x50
[ 53.310054] </TASK>
[ 53.310055] ---[ end trace 0000000000000000 ]---
[ 53.310074] ------------[ cut here ]------------
[ 53.310074] WARNING: CPU: 4 PID: 2610 at drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c:599 amdgpu_irq_put+0x46/0x70 [amdgpu]
[ 53.310608] Modules linked in: uinput snd_seq_dummy snd_hrtimer xt_CHECKSUM xt_MASQUERADE xt_conntrack ipt_REJECT nf_reject_ipv4 ip6table_mangle ip6table_nat ip6table_filter iptable_mangle iptable_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 iptable_filter bridge stp llc qrtr rfkill sunrpc binfmt_misc vfat fat snd_hda_codec_realtek snd_hda_codec_generic snd_hda_codec_hdmi ledtrig_audio snd_usb_audio intel_rapl_msr intel_rapl_common snd_hda_intel edac_mce_amd snd_intel_dspcfg snd_intel_sdw_acpi snd_usbmidi_lib kvm_amd snd_hda_codec snd_rawmidi snd_hda_core mc snd_hwdep snd_seq kvm snd_seq_device snd_pcm irqbypass snd_timer wmi_bmof rapl snd k10temp i2c_piix4 joydev soundcore apple_mfi_fastcharge acpi_cpufreq squashfs loop zram amdgpu hid_roccat_koneplus hid_roccat hid_roccat_common drm_ttm_helper ttm video crct10dif_pclmul crc32_pclmul iommu_v2 crc32c_intel drm_buddy gpu_sched polyval_clmulni nvme polyval_generic drm_display_helper igb ucsi_ccg typec_ucsi ghash_clmulni_intel typec cec ccp sp5100_tco
[ 53.310656] sha512_ssse3 nvme_core dca i2c_algo_bit wmi nvme_common scsi_dh_rdac scsi_dh_emc scsi_dh_alua ip6_tables ip_tables dm_multipath fuse
[ 53.310664] CPU: 4 PID: 2610 Comm: kworker/u64:20 Tainted: G W 6.3.5-200.fc38.x86_64 #1
[ 53.310666] Hardware name: To Be Filled By O.E.M. To Be Filled By O.E.M./X570M Pro4, BIOS P1.90 09/10/2019
[ 53.310667] Workqueue: events_unbound async_run_entry_fn
[ 53.310672] RIP: 0010:amdgpu_irq_put+0x46/0x70 [amdgpu]
[ 53.311199] Code: c0 74 33 48 8b 4e 10 48 83 39 00 74 29 89 d1 48 8d 04 88 8b 08 85 c9 74 11 f0 ff 08 74 07 31 c0 e9 3f db 91 de e9 5a fd ff ff <0f> 0b b8 ea ff ff ff e9 2e db 91 de b8 ea ff ff ff e9 24 db 91 de
[ 53.311201] RSP: 0018:ffffb56d8e723d30 EFLAGS: 00010246
[ 53.311203] RAX: ffff902cc13090a8 RBX: ffff902cde860000 RCX: 0000000000000000
[ 53.311204] RDX: 0000000000000000 RSI: ffff902cde8749b8 RDI: ffff902cde860000
[ 53.311206] RBP: 0000000000000001 R08: ffff902cc0402bf0 R09: ffffffffa005d6e0
[ 53.311207] R10: 0000000000000000 R11: 0000000000000000 R12: ffff902cde8749b8
[ 53.311208] R13: 0000000000000001 R14: ffffffff9f89654e R15: ffff902d6da8cd48
[ 53.311210] FS: 0000000000000000(0000) GS:ffff9033beb00000(0000) knlGS:0000000000000000
[ 53.311212] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 53.311213] CR2: 000055d89e464448 CR3: 0000000161154000 CR4: 0000000000350ee0
[ 53.311215] Call Trace:
[ 53.311216] <TASK>
[ 53.311217] ? amdgpu_irq_put+0x46/0x70 [amdgpu]
[ 53.311747] ? __warn+0x81/0x130
[ 53.311751] ? amdgpu_irq_put+0x46/0x70 [amdgpu]
[ 53.312278] ? report_bug+0x171/0x1a0
[ 53.312282] ? handle_bug+0x3c/0x80
[ 53.312285] ? exc_invalid_op+0x17/0x70
[ 53.312288] ? asm_exc_invalid_op+0x1a/0x20
[ 53.312294] ? amdgpu_irq_put+0x46/0x70 [amdgpu]
[ 53.312824] vcn_v4_0_hw_fini+0x4d/0x100 [amdgpu]
[ 53.313368] vcn_v4_0_suspend+0x12/0x30 [amdgpu]
[ 53.313907] amdgpu_device_ip_suspend_phase2+0x101/0x1a0 [amdgpu]
[ 53.314409] amdgpu_device_suspend+0x107/0x180 [amdgpu]
[ 53.314908] pci_pm_suspend+0x7f/0x170
[ 53.314912] ? __pfx_pci_pm_suspend+0x10/0x10
[ 53.314916] dpm_run_callback+0x8c/0x1e0
[ 53.314920] __device_suspend+0x10a/0x560
[ 53.314924] async_suspend+0x1e/0x70
[ 53.314928] async_run_entry_fn+0x34/0x130
[ 53.314932] process_one_work+0x1c7/0x3d0
[ 53.314935] worker_thread+0x51/0x390
[ 53.314938] ? __pfx_worker_thread+0x10/0x10
[ 53.314940] kthread+0xde/0x110
[ 53.314943] ? __pfx_kthread+0x10/0x10
[ 53.314946] ret_from_fork+0x2c/0x50
[ 53.314952] </TASK>
[ 53.314953] ---[ end trace 0000000000000000 ]---
Tested on Fedora 38 with kernel 6.3.5-200.fc38.x86_64
we probably need to backport
https://git.kernel.org/pub/scm/linux/kernel/git/stable/linux.git/commit/?id=bc3e1d60f933f823599376f830eb99451afb995a
and https://git.kernel.org/pub/scm/linux/kernel/git/stable/linux.git/commit/?id=020c76d983151f6f6c9493a3bbe83c1ec927617a
to the stable branch ?
Edited by Jocelyn Falempe