amdgpu warning on amdgpu_irq.c upon resuming after suspend to RAM
Brief summary of the problem:
The warning appears after resuming. It did not happen on 6.1.25. Maybe it's similar to #2547 (closed).
Hardware description:
AMD Ryzen 7 2700U with Radeon Vega Mobile Gfx BIOS Information Vendor: LENOVO Version: 8PCN58WW Release Date: 03/26/2020
System information:
- Distro name and Version: devuan chimaera
- Kernel version: 6.1.27
- Custom kernel: No
- AMD official driver version: No
Log files (for system lockups / game freezes / crashes)
- Dmesg extract
May 7 12:44:31 localhost kernel: [ 140.710784] ------------[ cut here ]------------
May 7 12:44:31 localhost kernel: [ 140.710788] WARNING: CPU: 2 PID: 7507 at drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c:656 amdgpu_irq_put+0x45/0x70 [amdgpu]
May 7 12:44:31 localhost kernel: [ 140.711103] Modules linked in: rfcomm(E) fuse(E) cmac(E) algif_hash(E) algif_skcipher(E) af_alg(E) bnep(E) efivarfs(E) binfmt_misc(E) nls_ascii(E) nls_cp437(E) vfat(E) fat(E) i2c_dev(E) dm_crypt(E) dm_mod(E) btusb(E) btrtl(E) btbcm(E) btintel(E) bluetooth(E) jitterentropy_rng(E) uvcvideo(E) videobuf2_vmalloc(E) videobuf2_memops(E) videobuf2_v4l2(E) videobuf2_common(E) drbg(E) ansi_cprng(E) videodev(E) ecdh_generic(E) ecc(E) crc16(E) mc(E) amdgpu(E) intel_rapl_msr(E) intel_rapl_common(E) edac_mce_amd(E) ath10k_pci(E) ath10k_core(E) kvm_amd(E) ath(E) joydev(E) kvm(E) snd_hda_codec_realtek(E) irqbypass(E) snd_hda_codec_generic(E) crct10dif_pclmul(E) mac80211(E) ledtrig_audio(E) crc32_pclmul(E) crc32c_intel(E) snd_hda_codec_hdmi(E) snd_hda_intel(E) drm_ttm_helper(E) snd_intel_dspcfg(E) ttm(E) hid_multitouch(E) ghash_clmulni_intel(E) gpu_sched(E) snd_hda_codec(E) sha512_ssse3(E) i2c_algo_bit(E) hid_generic(E) sha512_generic(E) drm_buddy(E) wmi_bmof(E) snd_hwdep(E) i2c_hid_acpi(E)
May 7 12:44:31 localhost kernel: [ 140.711197] drm_display_helper(E) i2c_hid(E) cec(E) snd_hda_core(E) cfg80211(E) drm_kms_helper(E) hid(E) snd_pcm(E) ideapad_laptop(E) evdev(E) aesni_intel(E) sparse_keymap(E) xhci_pci(E) snd_timer(E) crypto_simd(E) cryptd(E) serio_raw(E) efi_pstore(E) xhci_hcd(E) snd(E) platform_profile(E) drm(E) usbcore(E) tpm_crb(E) rfkill(E) k10temp(E) mfd_core(E) ccp(E) libarc4(E) soundcore(E) usb_common(E) video(E) battery(E) tpm_tis(E) tpm_tis_core(E) wmi(E) i2c_designware_platform(E) tpm(E) tiny_power_button(E) iosf_mbi(E) i2c_amd_mp2_plat(E) rng_core(E) i2c_designware_core(E) i2c_amd_mp2_pci(E) ac(E) button(E) acpi_cpufreq(E)
May 7 12:44:31 localhost kernel: [ 140.711248] CPU: 2 PID: 7507 Comm: kworker/u32:7 Tainted: G E 6.1.27 #1
May 7 12:44:31 localhost kernel: [ 140.711253] Hardware name: LENOVO 81H1/LNVNB161216, BIOS 8PCN58WW 03/26/2020
May 7 12:44:31 localhost kernel: [ 140.711255] Workqueue: events_unbound async_run_entry_fn
May 7 12:44:31 localhost kernel: [ 140.711261] RIP: 0010:amdgpu_irq_put+0x45/0x70 [amdgpu]
May 7 12:44:31 localhost kernel: [ 140.711562] Code: 48 8b 4e 10 48 83 39 00 74 2c 89 d1 48 8d 04 88 8b 08 85 c9 74 14 f0 ff 08 b8 00 00 00 00 74 05 e9 c0 1c 9b ee e9 6b fd ff ff <0f> 0b b8 ea ff ff ff e9 af 1c 9b ee b8 ea ff ff ff e9 a5 1c 9b ee
May 7 12:44:31 localhost kernel: [ 140.711566] RSP: 0018:ffffac0503763d48 EFLAGS: 00010246
May 7 12:44:31 localhost kernel: [ 140.711569] RAX: ffff8954b2a17ba0 RBX: 0000000000000001 RCX: 0000000000000000
May 7 12:44:31 localhost kernel: [ 140.711571] RDX: 0000000000000000 RSI: ffff8954aa80f3e0 RDI: ffff8954aa800000
May 7 12:44:31 localhost kernel: [ 140.711573] RBP: ffff8954aa800000 R08: 0000000000000000 R09: 000000000008c7e8
May 7 12:44:31 localhost kernel: [ 140.711575] R10: 0000000000000002 R11: 0000000000000100 R12: ffff8954aa80f3e0
May 7 12:44:31 localhost kernel: [ 140.711577] R13: 0000000000000001 R14: 0000000000000002 R15: ffff895480074a05
May 7 12:44:31 localhost kernel: [ 140.711579] FS: 0000000000000000(0000) GS:ffff8955a7280000(0000) knlGS:0000000000000000
May 7 12:44:31 localhost kernel: [ 140.711582] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
May 7 12:44:31 localhost kernel: [ 140.711584] CR2: 00005610c5694c96 CR3: 0000000132f4c000 CR4: 00000000003506e0
May 7 12:44:31 localhost kernel: [ 140.711587] Call Trace:
May 7 12:44:31 localhost kernel: [ 140.711591] <TASK>
May 7 12:44:31 localhost kernel: [ 140.711593] sdma_v4_0_hw_fini+0x37/0xa0 [amdgpu]
May 7 12:44:31 localhost kernel: [ 140.711906] amdgpu_device_ip_suspend_phase2+0x119/0x1b0 [amdgpu]
May 7 12:44:31 localhost kernel: [ 140.712195] amdgpu_device_suspend+0xda/0x170 [amdgpu]
May 7 12:44:31 localhost kernel: [ 140.712464] pci_pm_suspend+0x94/0x1c0
May 7 12:44:31 localhost kernel: [ 140.712471] ? pci_pm_freeze+0xe0/0xe0
May 7 12:44:31 localhost kernel: [ 140.712475] dpm_run_callback+0x60/0x180
May 7 12:44:31 localhost kernel: [ 140.712480] __device_suspend+0x140/0x4c0
May 7 12:44:31 localhost kernel: [ 140.712484] async_suspend+0x25/0xa0
May 7 12:44:31 localhost kernel: [ 140.712487] async_run_entry_fn+0x30/0x130
May 7 12:44:31 localhost kernel: [ 140.712491] process_one_work+0x1e5/0x3b0
May 7 12:44:31 localhost kernel: [ 140.712497] worker_thread+0x50/0x3b0
May 7 12:44:31 localhost kernel: [ 140.712502] ? rescuer_thread+0x390/0x390
May 7 12:44:31 localhost kernel: [ 140.712506] kthread+0xe7/0x110
May 7 12:44:31 localhost kernel: [ 140.712510] ? kthread_complete_and_exit+0x20/0x20
May 7 12:44:31 localhost kernel: [ 140.712514] ret_from_fork+0x22/0x30
May 7 12:44:31 localhost kernel: [ 140.712521] </TASK>
May 7 12:44:31 localhost kernel: [ 140.712523] ---[ end trace 0000000000000000 ]---
May 7 12:44:31 localhost kernel: [ 140.713081] ------------[ cut here ]------------
May 7 12:44:31 localhost kernel: [ 140.713082] WARNING: CPU: 2 PID: 7507 at drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c:656 amdgpu_irq_put+0x45/0x70 [amdgpu]
May 7 12:44:31 localhost kernel: [ 140.713402] Modules linked in: rfcomm(E) fuse(E) cmac(E) algif_hash(E) algif_skcipher(E) af_alg(E) bnep(E) efivarfs(E) binfmt_misc(E) nls_ascii(E) nls_cp437(E) vfat(E) fat(E) i2c_dev(E) dm_crypt(E) dm_mod(E) btusb(E) btrtl(E) btbcm(E) btintel(E) bluetooth(E) jitterentropy_rng(E) uvcvideo(E) videobuf2_vmalloc(E) videobuf2_memops(E) videobuf2_v4l2(E) videobuf2_common(E) drbg(E) ansi_cprng(E) videodev(E) ecdh_generic(E) ecc(E) crc16(E) mc(E) amdgpu(E) intel_rapl_msr(E) intel_rapl_common(E) edac_mce_amd(E) ath10k_pci(E) ath10k_core(E) kvm_amd(E) ath(E) joydev(E) kvm(E) snd_hda_codec_realtek(E) irqbypass(E) snd_hda_codec_generic(E) crct10dif_pclmul(E) mac80211(E) ledtrig_audio(E) crc32_pclmul(E) crc32c_intel(E) snd_hda_codec_hdmi(E) snd_hda_intel(E) drm_ttm_helper(E) snd_intel_dspcfg(E) ttm(E) hid_multitouch(E) ghash_clmulni_intel(E) gpu_sched(E) snd_hda_codec(E) sha512_ssse3(E) i2c_algo_bit(E) hid_generic(E) sha512_generic(E) drm_buddy(E) wmi_bmof(E) snd_hwdep(E) i2c_hid_acpi(E)
May 7 12:44:31 localhost kernel: [ 140.713461] drm_display_helper(E) i2c_hid(E) cec(E) snd_hda_core(E) cfg80211(E) drm_kms_helper(E) hid(E) snd_pcm(E) ideapad_laptop(E) evdev(E) aesni_intel(E) sparse_keymap(E) xhci_pci(E) snd_timer(E) crypto_simd(E) cryptd(E) serio_raw(E) efi_pstore(E) xhci_hcd(E) snd(E) platform_profile(E) drm(E) usbcore(E) tpm_crb(E) rfkill(E) k10temp(E) mfd_core(E) ccp(E) libarc4(E) soundcore(E) usb_common(E) video(E) battery(E) tpm_tis(E) tpm_tis_core(E) wmi(E) i2c_designware_platform(E) tpm(E) tiny_power_button(E) iosf_mbi(E) i2c_amd_mp2_plat(E) rng_core(E) i2c_designware_core(E) i2c_amd_mp2_pci(E) ac(E) button(E) acpi_cpufreq(E)
May 7 12:44:31 localhost kernel: [ 140.713500] CPU: 2 PID: 7507 Comm: kworker/u32:7 Tainted: G W E 6.1.27 #1
May 7 12:44:31 localhost kernel: [ 140.713503] Hardware name: LENOVO 81H1/LNVNB161216, BIOS 8PCN58WW 03/26/2020
May 7 12:44:31 localhost kernel: [ 140.713505] Workqueue: events_unbound async_run_entry_fn
May 7 12:44:31 localhost kernel: [ 140.713508] RIP: 0010:amdgpu_irq_put+0x45/0x70 [amdgpu]
May 7 12:44:31 localhost kernel: [ 140.713806] Code: 48 8b 4e 10 48 83 39 00 74 2c 89 d1 48 8d 04 88 8b 08 85 c9 74 14 f0 ff 08 b8 00 00 00 00 74 05 e9 c0 1c 9b ee e9 6b fd ff ff <0f> 0b b8 ea ff ff ff e9 af 1c 9b ee b8 ea ff ff ff e9 a5 1c 9b ee
May 7 12:44:31 localhost kernel: [ 140.713809] RSP: 0018:ffffac0503763d40 EFLAGS: 00010246
May 7 12:44:31 localhost kernel: [ 140.713812] RAX: ffff89548920c200 RBX: ffff8954aa817460 RCX: 0000000000000000
May 7 12:44:31 localhost kernel: [ 140.713813] RDX: 0000000000000000 RSI: ffff8954aa80bb80 RDI: ffff8954aa800000
May 7 12:44:31 localhost kernel: [ 140.713815] RBP: ffff8954aa800000 R08: ffff8954aa805e28 R09: 000000000008c7e8
May 7 12:44:31 localhost kernel: [ 140.713817] R10: 0000000000000002 R11: 0000000000000100 R12: ffff8954aa8173f0
May 7 12:44:31 localhost kernel: [ 140.713818] R13: 0000000000000001 R14: 0000000000000002 R15: ffff895480074a05
May 7 12:44:31 localhost kernel: [ 140.713820] FS: 0000000000000000(0000) GS:ffff8955a7280000(0000) knlGS:0000000000000000
May 7 12:44:31 localhost kernel: [ 140.713823] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
May 7 12:44:31 localhost kernel: [ 140.713825] CR2: 00005610c5694c96 CR3: 0000000132f4c000 CR4: 00000000003506e0
May 7 12:44:31 localhost kernel: [ 140.713827] Call Trace:
May 7 12:44:31 localhost kernel: [ 140.713829] <TASK>
May 7 12:44:31 localhost kernel: [ 140.713830] gfx_v9_0_hw_fini+0x1c/0x760 [amdgpu]
May 7 12:44:31 localhost kernel: [ 140.714156] amdgpu_device_ip_suspend_phase2+0x119/0x1b0 [amdgpu]
May 7 12:44:31 localhost kernel: [ 140.714424] amdgpu_device_suspend+0xda/0x170 [amdgpu]
May 7 12:44:31 localhost kernel: [ 140.714692] pci_pm_suspend+0x94/0x1c0
May 7 12:44:31 localhost kernel: [ 140.714695] ? pci_pm_freeze+0xe0/0xe0
May 7 12:44:31 localhost kernel: [ 140.714698] dpm_run_callback+0x60/0x180
May 7 12:44:31 localhost kernel: [ 140.714702] __device_suspend+0x140/0x4c0
May 7 12:44:31 localhost kernel: [ 140.714706] async_suspend+0x25/0xa0
May 7 12:44:31 localhost kernel: [ 140.714709] async_run_entry_fn+0x30/0x130
May 7 12:44:31 localhost kernel: [ 140.714712] process_one_work+0x1e5/0x3b0
May 7 12:44:31 localhost kernel: [ 140.714716] worker_thread+0x50/0x3b0
May 7 12:44:31 localhost kernel: [ 140.714721] ? rescuer_thread+0x390/0x390
May 7 12:44:31 localhost kernel: [ 140.714725] kthread+0xe7/0x110
May 7 12:44:31 localhost kernel: [ 140.714728] ? kthread_complete_and_exit+0x20/0x20
May 7 12:44:31 localhost kernel: [ 140.714731] ret_from_fork+0x22/0x30
May 7 12:44:31 localhost kernel: [ 140.714737] </TASK>
May 7 12:44:31 localhost kernel: [ 140.714738] ---[ end trace 0000000000000000 ]---
May 7 12:44:31 localhost kernel: [ 140.732140] amdgpu 0000:03:00.0: amdgpu: free PSP TMR buffer
May 7 12:44:31 localhost kernel: [ 140.783643] ------------[ cut here ]------------
May 7 12:44:31 localhost kernel: [ 140.783644] WARNING: CPU: 2 PID: 7507 at drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c:656 amdgpu_irq_put+0x45/0x70 [amdgpu]
May 7 12:44:31 localhost kernel: [ 140.783944] Modules linked in: rfcomm(E) fuse(E) cmac(E) algif_hash(E) algif_skcipher(E) af_alg(E) bnep(E) efivarfs(E) binfmt_misc(E) nls_ascii(E) nls_cp437(E) vfat(E) fat(E) i2c_dev(E) dm_crypt(E) dm_mod(E) btusb(E) btrtl(E) btbcm(E) btintel(E) bluetooth(E) jitterentropy_rng(E) uvcvideo(E) videobuf2_vmalloc(E) videobuf2_memops(E) videobuf2_v4l2(E) videobuf2_common(E) drbg(E) ansi_cprng(E) videodev(E) ecdh_generic(E) ecc(E) crc16(E) mc(E) amdgpu(E) intel_rapl_msr(E) intel_rapl_common(E) edac_mce_amd(E) ath10k_pci(E) ath10k_core(E) kvm_amd(E) ath(E) joydev(E) kvm(E) snd_hda_codec_realtek(E) irqbypass(E) snd_hda_codec_generic(E) crct10dif_pclmul(E) mac80211(E) ledtrig_audio(E) crc32_pclmul(E) crc32c_intel(E) snd_hda_codec_hdmi(E) snd_hda_intel(E) drm_ttm_helper(E) snd_intel_dspcfg(E) ttm(E) hid_multitouch(E) ghash_clmulni_intel(E) gpu_sched(E) snd_hda_codec(E) sha512_ssse3(E) i2c_algo_bit(E) hid_generic(E) sha512_generic(E) drm_buddy(E) wmi_bmof(E) snd_hwdep(E) i2c_hid_acpi(E)
May 7 12:44:31 localhost kernel: [ 140.784002] drm_display_helper(E) i2c_hid(E) cec(E) snd_hda_core(E) cfg80211(E) drm_kms_helper(E) hid(E) snd_pcm(E) ideapad_laptop(E) evdev(E) aesni_intel(E) sparse_keymap(E) xhci_pci(E) snd_timer(E) crypto_simd(E) cryptd(E) serio_raw(E) efi_pstore(E) xhci_hcd(E) snd(E) platform_profile(E) drm(E) usbcore(E) tpm_crb(E) rfkill(E) k10temp(E) mfd_core(E) ccp(E) libarc4(E) soundcore(E) usb_common(E) video(E) battery(E) tpm_tis(E) tpm_tis_core(E) wmi(E) i2c_designware_platform(E) tpm(E) tiny_power_button(E) iosf_mbi(E) i2c_amd_mp2_plat(E) rng_core(E) i2c_designware_core(E) i2c_amd_mp2_pci(E) ac(E) button(E) acpi_cpufreq(E)
May 7 12:44:31 localhost kernel: [ 140.784042] CPU: 2 PID: 7507 Comm: kworker/u32:7 Tainted: G W E 6.1.27 #1
May 7 12:44:31 localhost kernel: [ 140.784045] Hardware name: LENOVO 81H1/LNVNB161216, BIOS 8PCN58WW 03/26/2020
May 7 12:44:31 localhost kernel: [ 140.784046] Workqueue: events_unbound async_run_entry_fn
May 7 12:44:31 localhost kernel: [ 140.784050] RIP: 0010:amdgpu_irq_put+0x45/0x70 [amdgpu]
May 7 12:44:31 localhost kernel: [ 140.784368] Code: 48 8b 4e 10 48 83 39 00 74 2c 89 d1 48 8d 04 88 8b 08 85 c9 74 14 f0 ff 08 b8 00 00 00 00 74 05 e9 c0 1c 9b ee e9 6b fd ff ff <0f> 0b b8 ea ff ff ff e9 af 1c 9b ee b8 ea ff ff ff e9 a5 1c 9b ee
May 7 12:44:31 localhost kernel: [ 140.784371] RSP: 0018:ffffac0503763d58 EFLAGS: 00010246
May 7 12:44:31 localhost kernel: [ 140.784374] RAX: ffff89548920c740 RBX: ffff8954aa817410 RCX: 0000000000000000
May 7 12:44:31 localhost kernel: [ 140.784376] RDX: 0000000000000000 RSI: ffff8954aa8024d8 RDI: ffff8954aa800000
May 7 12:44:31 localhost kernel: [ 140.784377] RBP: ffff8954aa800000 R08: 0000000000000000 R09: ffff8955a72a9340
May 7 12:44:31 localhost kernel: [ 140.784379] R10: ffff8955a72a8940 R11: ffff8955a72a7624 R12: ffff8954aa8173f0
May 7 12:44:31 localhost kernel: [ 140.784380] R13: 0000000000000001 R14: 0000000000000002 R15: ffff895480074a05
May 7 12:44:31 localhost kernel: [ 140.784382] FS: 0000000000000000(0000) GS:ffff8955a7280000(0000) knlGS:0000000000000000
May 7 12:44:31 localhost kernel: [ 140.784385] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
May 7 12:44:31 localhost kernel: [ 140.784387] CR2: 00005610c5694c96 CR3: 0000000132f4c000 CR4: 00000000003506e0
May 7 12:44:31 localhost kernel: [ 140.784389] Call Trace:
May 7 12:44:31 localhost kernel: [ 140.784391] <TASK>
May 7 12:44:31 localhost kernel: [ 140.784392] gmc_v9_0_hw_fini+0x60/0xa0 [amdgpu]
May 7 12:44:31 localhost kernel: [ 140.784688] amdgpu_device_ip_suspend_phase2+0x119/0x1b0 [amdgpu]
May 7 12:44:31 localhost kernel: [ 140.784956] amdgpu_device_suspend+0xda/0x170 [amdgpu]
May 7 12:44:31 localhost kernel: [ 140.785244] pci_pm_suspend+0x94/0x1c0
May 7 12:44:31 localhost kernel: [ 140.785248] ? pci_pm_freeze+0xe0/0xe0
May 7 12:44:31 localhost kernel: [ 140.785251] dpm_run_callback+0x60/0x180
May 7 12:44:31 localhost kernel: [ 140.785255] __device_suspend+0x140/0x4c0
May 7 12:44:31 localhost kernel: [ 140.785259] async_suspend+0x25/0xa0
May 7 12:44:31 localhost kernel: [ 140.785262] async_run_entry_fn+0x30/0x130
May 7 12:44:31 localhost kernel: [ 140.785265] process_one_work+0x1e5/0x3b0
May 7 12:44:31 localhost kernel: [ 140.785270] worker_thread+0x50/0x3b0
May 7 12:44:31 localhost kernel: [ 140.785274] ? rescuer_thread+0x390/0x390
May 7 12:44:31 localhost kernel: [ 140.785278] kthread+0xe7/0x110
May 7 12:44:31 localhost kernel: [ 140.785281] ? kthread_complete_and_exit+0x20/0x20
May 7 12:44:31 localhost kernel: [ 140.785284] ret_from_fork+0x22/0x30
May 7 12:44:31 localhost kernel: [ 140.785290] </TASK>
May 7 12:44:31 localhost kernel: [ 140.785291] ---[ end trace 0000000000000000 ]---
Edited by Mario Limonciello