SMU fails to be resumed from runtime suspend
Brief summary of the problem:
After several suspend/resume*30 then reboot, get the smu failure.
Sep 26 18:20:01 ubuntu kernel: amdgpu 0000:03:00.0: amdgpu: SMU is resuming...
Sep 26 18:20:01 ubuntu kernel: amdgpu 0000:03:00.0: amdgpu: smu driver if version = 0x0000000d, smu fw if version = 0x0000000f, smu fw program = 0, version = 0x00491a00 (73.26.0)
Sep 26 18:20:01 ubuntu kernel: amdgpu 0000:03:00.0: amdgpu: SMU driver if version not matched
Sep 26 18:20:01 ubuntu kernel: amdgpu 0000:03:00.0: amdgpu: use vbios provided pptable
Sep 26 18:20:01 ubuntu kernel: ata5: SATA link down (SStatus 4 SControl 300)
Sep 26 18:20:01 ubuntu kernel: amdgpu 0000:03:00.0: amdgpu: SMU: I'm not done with your previous command: SMN_C2PMSG_66:0x00000036 SMN_C2PMSG_82:0x00000000
Sep 26 18:20:01 ubuntu kernel: amdgpu 0000:03:00.0: amdgpu: RunDcBtc failed!
Sep 26 18:20:01 ubuntu kernel: amdgpu 0000:03:00.0: amdgpu: Failed to setup smc hw!
Sep 26 18:20:01 ubuntu kernel: [drm:amdgpu_device_ip_resume_phase2 [amdgpu]] ERROR resume of IP block <smu> failed -62
Hardware description:
- CPU: RPL-S
- GPU: Advanced Micro Devices, Inc. [AMD/ATI] [1002:743F]
System information:
- Distro name and Version: Ubuntu Jammy
- Kernel version: drmtip
How to reproduce the issue:
< TODO: Describe step-by-step how to reproduce the issue > < NOTE: Add as much detail as possible >
- after installed any kernel, must cold boot the machine.
- use checkbox-cli to run stress/resume.
checkbox-cli run com.canonical.certification::power-management/suspend-30-cycles-log-check-with-reboots
- There're four reboots so check four dmesgs through journactl
journalctl -k -b -[0-3] | grep "resume of IP block"