Occasional failure to resume on Renoir (Asus ZenBook 14 UM425IA)
On my ZenBook when using Fedora 5.12 kernel with the resume patches applied, my machine sometimes fails to resume. This happens especially often if power is either plugged or unplugged during s0ix sleep. Using SysRq magic it is possible to obtain the logs:
Jun 19 14:44:35 snowball3 kernel: PM: suspend entry (s2idle)
Jun 19 14:44:35 snowball3 kernel: Filesystems sync: 0.037 seconds
Jun 19 14:50:11 snowball3 kernel: Freezing user space processes ... (elapsed 0.002 seconds) done.
Jun 19 14:50:11 snowball3 kernel: OOM killer disabled.
Jun 19 14:50:11 snowball3 kernel: Freezing remaining freezable tasks ... (elapsed 0.000 seconds) done.
Jun 19 14:50:11 snowball3 kernel: printk: Suspending console(s) (use no_console_suspend to debug)
Jun 19 14:50:11 snowball3 kernel: PM: suspend devices took 0.116 seconds
Jun 19 14:50:11 snowball3 kernel: ACPI: EC: interrupt blocked
Jun 19 14:50:11 snowball3 kernel: ccp 0000:04:00.2: refused to change power state from D0 to D3hot
Jun 19 14:50:11 snowball3 kernel: xhci_hcd 0000:04:00.3: refused to change power state from D0 to D3hot
Jun 19 14:50:11 snowball3 kernel: ACPI: EC: interrupt unblocked
Jun 19 14:50:11 snowball3 kernel: pci 0000:00:00.2: can't derive routing for PCI INT A
Jun 19 14:50:11 snowball3 kernel: pci 0000:00:00.2: PCI INT A: no GSI
Jun 19 14:50:11 snowball3 kernel: nvme nvme0: 8/0/0 default/read/poll queues
Jun 19 14:50:11 snowball3 kernel: ata2: SATA link down (SStatus 0 SControl 300)
Jun 19 14:50:11 snowball3 kernel: ata1: SATA link down (SStatus 0 SControl 300)
Jun 19 14:50:11 snowball3 kernel: amdgpu 0000:04:00.0: amdgpu: failed to write reg 28b4 wait reg 28c6
Jun 19 14:50:11 snowball3 kernel: amdgpu 0000:04:00.0: amdgpu: failed to write reg 1a6f4 wait reg 1a706
Jun 19 14:50:11 snowball3 kernel: [drm] PCIE GART of 1024M enabled (table at 0x000000F400900000).
Jun 19 14:50:11 snowball3 kernel: amdgpu 0000:04:00.0: amdgpu: SMU is resuming...
Jun 19 14:50:11 snowball3 kernel: amdgpu 0000:04:00.0: amdgpu: message: SetDriverDramAddrHigh (26) param: 0x000000f4 is timeout (no response)
Jun 19 14:50:11 snowball3 kernel: amdgpu 0000:04:00.0: amdgpu: Failed to SetDriverDramAddr!
Jun 19 14:50:11 snowball3 kernel: amdgpu 0000:04:00.0: amdgpu: Failed to setup smc hw!
Jun 19 14:50:11 snowball3 kernel: [drm:amdgpu_device_ip_resume_phase2 [amdgpu]] *ERROR* resume of IP block <smu> failed -62
Jun 19 14:50:11 snowball3 kernel: amdgpu 0000:04:00.0: amdgpu: amdgpu_device_ip_resume failed (-62).
Jun 19 14:50:11 snowball3 kernel: PM: dpm_run_callback(): pci_pm_resume+0x0/0xe0 returns -62
Jun 19 14:50:11 snowball3 kernel: amdgpu 0000:04:00.0: PM: failed to resume async: error -62
Jun 19 14:50:11 snowball3 kernel: ACPI Error: Aborting method \_SB.A011 due to previous error (AE_AML_LOOP_TIMEOUT) (20210105/psparse-529)
Jun 19 14:50:11 snowball3 kernel: ACPI Error: Aborting method \_SB.A004 due to previous error (AE_AML_LOOP_TIMEOUT) (20210105/psparse-529)
Jun 19 14:50:11 snowball3 kernel: ACPI Error: Aborting method \_SB.A019 due to previous error (AE_AML_LOOP_TIMEOUT) (20210105/psparse-529)
Jun 19 14:50:11 snowball3 kernel: ACPI Error: Aborting method \_SB.ALIB due to previous error (AE_AML_LOOP_TIMEOUT) (20210105/psparse-529)
Jun 19 14:50:11 snowball3 kernel: ACPI Error: Aborting method \_SB.PCI0.SBRG.EC0.ACAD._PSR due to previous error (AE_AML_LOOP_TIMEOUT) (20210105/psparse-529)
Jun 19 14:50:11 snowball3 kernel: ACPI: \_SB_.PCI0.SBRG.EC0_.ACAD: Error reading AC Adapter state: AE_AML_LOOP_TIMEOUT
Jun 19 14:50:11 snowball3 kernel: ACPI: button: The lid device is not compliant to SW_LID.
Jun 19 14:50:11 snowball3 kernel: ACPI Error: Aborting method \_SB.A011 due to previous error (AE_AML_LOOP_TIMEOUT) (20210105/psparse-529)
Jun 19 14:50:11 snowball3 kernel: ACPI Error: Aborting method \_SB.A025 due to previous error (AE_AML_LOOP_TIMEOUT) (20210105/psparse-529)
Jun 19 14:50:11 snowball3 kernel: ACPI Error: Aborting method \_SB.ALIB due to previous error (AE_AML_LOOP_TIMEOUT) (20210105/psparse-529)
Jun 19 14:50:11 snowball3 kernel: ACPI Error: Aborting method \_SB.PCI0.SBRG.EC0.DPTC due to previous error (AE_AML_LOOP_TIMEOUT) (20210105/psparse-529)
Jun 19 14:50:11 snowball3 kernel: ACPI Error: Aborting method \_SB.PCI0.SBRG.EC0._QF2 due to previous error (AE_AML_LOOP_TIMEOUT) (20210105/psparse-529)
Jun 19 14:50:11 snowball3 kernel: PM: resume devices took 60.037 seconds
Jun 19 14:50:11 snowball3 kernel: ------------[ cut here ]------------
Jun 19 14:50:11 snowball3 kernel: Component: resume devices, time: 60037
Jun 19 14:50:11 snowball3 kernel: WARNING: CPU: 2 PID: 36074 at kernel/power/suspend_test.c:53 suspend_test_finish+0x71/0x80
Jun 19 14:50:11 snowball3 kernel: Modules linked in: dm_crypt trusted uinput rfcomm snd_seq_dummy snd_hrtimer xt_CHECKSUM xt_MASQUERADE xt_conntrack ipt_REJECT nf_nat_tftp nf_conntrack_tftp bridge stp llc ccm cmac nft_objref nf_conntrack_netbios_ns nf_conntrack_broadcast nft_fib_inet nft_fib_ipv4 nft_fib_ipv6 nft_fib nft_reject_inet nf_reject_ipv4 nf_reject_ipv6 nft_reject nft_ct nft_chain_nat ip6table_nat ip6table_mangle ip6table_raw ip6table_security iptable_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 iptable_mangle iptable_raw iptable_security ip_set nf_tables nfnetlink ip6table_filter ip6_tables iptable_filter bnep sunrpc vfat fat snd_acp3x_rn snd_soc_dmic snd_acp3x_pdm_dma intel_rapl_msr snd_soc_core intel_rapl_common snd_compress edac_mce_amd snd_pcm_dmaengine ac97_bus kvm_amd kvm snd_hda_codec_realtek snd_hda_codec_generic ledtrig_audio iwlmvm snd_hda_codec_hdmi irqbypass rapl snd_hda_intel snd_intel_dspcfg asus_nb_wmi asus_wmi snd_intel_sdw_acpi sparse_keymap wmi_bmof pcspkr mac80211
Jun 19 14:50:11 snowball3 kernel: snd_hda_codec joydev libarc4 uvcvideo snd_hda_core videobuf2_vmalloc snd_hwdep videobuf2_memops videobuf2_v4l2 iwlwifi snd_seq btusb videobuf2_common snd_seq_device btrtl btbcm videodev btintel k10temp bluetooth snd_pcm cfg80211 mc snd_timer snd ecdh_generic ucsi_acpi ecc snd_rn_pci_acp3x typec_ucsi soundcore snd_pci_acp3x rfkill i2c_piix4 typec cm32181 industrialio acpi_cpufreq amd_pmc binfmt_misc zram ip_tables amdgpu rtsx_pci_sdmmc mmc_core drm_ttm_helper ttm crct10dif_pclmul hid_multitouch crc32_pclmul crc32c_intel iommu_v2 gpu_sched i2c_algo_bit ghash_clmulni_intel drm_kms_helper nvme serio_raw cec drm wmi ccp rtsx_pci nvme_core sp5100_tco video i2c_hid_acpi i2c_hid fuse
Jun 19 14:50:11 snowball3 kernel: CPU: 2 PID: 36074 Comm: systemd-sleep Not tainted 5.12.12-300.s0ix01.fc34.x86_64 #1
Jun 19 14:50:11 snowball3 kernel: Hardware name: ASUSTeK COMPUTER INC. ZenBook UX425IA_UM425IA/UX425IA, BIOS UX425IA.309 01/29/2021
Jun 19 14:50:11 snowball3 kernel: RIP: 0010:suspend_test_finish+0x71/0x80
Jun 19 14:50:11 snowball3 kernel: Code: e8 03 00 00 29 c1 e8 38 14 a5 00 41 81 fc 10 27 00 00 77 04 5d 41 5c c3 44 89 e2 48 89 ee 48 c7 c7 c8 fe 3a a0 e8 cc b7 a4 00 <0f> 0b 5d 41 5c c3 cc cc cc cc cc cc cc cc cc 0f 1f 44 00 00 0f b6
Jun 19 14:50:11 snowball3 kernel: RSP: 0018:ffffc1df01f53d68 EFLAGS: 00010296
Jun 19 14:50:11 snowball3 kernel: RAX: 0000000000000026 RBX: 0000000000000000 RCX: 0000000000000027
Jun 19 14:50:11 snowball3 kernel: RDX: ffff9c337f6985c8 RSI: 0000000000000001 RDI: ffff9c337f6985c0
Jun 19 14:50:11 snowball3 kernel: RBP: ffffffffa03afe11 R08: ffffffffa0a65f20 R09: 0000000000000004
Jun 19 14:50:11 snowball3 kernel: R10: ffffffffffffffff R11: ffffffffa1549e06 R12: 000000000000ea85
Jun 19 14:50:11 snowball3 kernel: R13: 0000000000000000 R14: ffffc1df01f53d88 R15: 0000000000000000
Jun 19 14:50:11 snowball3 kernel: FS: 00007f069960eb40(0000) GS:ffff9c337f680000(0000) knlGS:0000000000000000
Jun 19 14:50:11 snowball3 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jun 19 14:50:11 snowball3 kernel: CR2: 00007f24700050f6 CR3: 000000029b10a000 CR4: 0000000000350ee0
Jun 19 14:50:11 snowball3 kernel: Call Trace:
Jun 19 14:50:11 snowball3 kernel: suspend_devices_and_enter+0x1a1/0x750
Jun 19 14:50:11 snowball3 kernel: pm_suspend.cold+0x329/0x374
Jun 19 14:50:11 snowball3 kernel: state_store+0x71/0xd0
Jun 19 14:50:11 snowball3 kernel: kernfs_fop_write_iter+0x11c/0x1b0
Jun 19 14:50:11 snowball3 kernel: new_sync_write+0x108/0x180
Jun 19 14:50:11 snowball3 kernel: vfs_write+0x1b8/0x270
Jun 19 14:50:11 snowball3 kernel: ksys_write+0x4f/0xc0
Jun 19 14:50:11 snowball3 kernel: do_syscall_64+0x33/0x40
Jun 19 14:50:11 snowball3 kernel: entry_SYSCALL_64_after_hwframe+0x44/0xae
Jun 19 14:50:11 snowball3 kernel: RIP: 0033:0x7f069a2b9387
Jun 19 14:50:11 snowball3 kernel: Code: 0d 00 f7 d8 64 89 02 48 c7 c0 ff ff ff ff eb b7 0f 1f 00 f3 0f 1e fa 64 8b 04 25 18 00 00 00 85 c0 75 10 b8 01 00 00 00 0f 05 <48> 3d 00 f0 ff ff 77 51 c3 48 83 ec 28 48 89 54 24 18 48 89 74 24
Jun 19 14:50:11 snowball3 kernel: RSP: 002b:00007ffc68784068 EFLAGS: 00000246 ORIG_RAX: 0000000000000001
Jun 19 14:50:11 snowball3 kernel: RAX: ffffffffffffffda RBX: 0000000000000004 RCX: 00007f069a2b9387
Jun 19 14:50:11 snowball3 kernel: RDX: 0000000000000004 RSI: 00007ffc68784150 RDI: 0000000000000004
Jun 19 14:50:11 snowball3 kernel: RBP: 00007ffc68784150 R08: 000056065d1f04d0 R09: 00007f069a34f4e0
Jun 19 14:50:11 snowball3 kernel: R10: 00007f069a34f3e0 R11: 0000000000000246 R12: 0000000000000004
Jun 19 14:50:11 snowball3 kernel: R13: 000056065d1ec650 R14: 0000000000000004 R15: 00007f069a38c700
Jun 19 14:50:11 snowball3 kernel: ---[ end trace 5c36a66c04f52418 ]---
Jun 19 14:50:11 snowball3 kernel: OOM killer enabled.
Jun 19 14:50:11 snowball3 kernel: Restarting tasks ... done.
Jun 19 14:50:21 snowball3 kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring sdma0 timeout, signaled seq=9704, emitted seq=9705
Jun 19 14:50:21 snowball3 kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process information: process pid 0 thread pid 0
Jun 19 14:50:21 snowball3 kernel: amdgpu 0000:04:00.0: amdgpu: GPU reset begin!
Jun 19 14:50:21 snowball3 kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx timeout, signaled seq=88178, emitted seq=88181
Jun 19 14:50:21 snowball3 kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process information: process firefox pid 11018 thread firefox:cs0 pid 11088
Jun 19 14:50:21 snowball3 kernel: amdgpu 0000:04:00.0: amdgpu: GPU reset begin!
Jun 19 14:50:21 snowball3 kernel: amdgpu 0000:04:00.0: amdgpu: Bailing on TDR for s_job:15343, as another already in progress
Jun 19 14:50:22 snowball3 kernel: amdgpu 0000:04:00.0: [drm:amdgpu_ring_test_helper [amdgpu]] *ERROR* ring kiq_2.1.0 test failed (-110)
Jun 19 14:50:24 snowball3 kernel: amdgpu 0000:04:00.0: amdgpu: Msg issuing pre-check failed and SMU may be not in the right state!
Jun 19 14:50:24 snowball3 kernel: [drm] free PSP TMR buffer
Jun 19 14:50:26 snowball3 kernel: [drm] psp command (0x7) failed and response status is (0x0)
Full kernel log is attached.
Edited by Julian Sikorski