Due to an influx of spam, we have had to impose restrictions on new accounts. Please see this wiki page for instructions on how to get full permissions. Sorry for the inconvenience.
Admin message
The migration is almost done, at least the rest should happen in the background. There are still a few technical difference between the old cluster and the new ones, and they are summarized in this issue. Please pay attention to the TL:DR at the end of the comment.
CPU: AMD Ryzen 5 4500U with Radeon Graphics (6) @ 2.375GHzGPU: AMD ATI 03:00.0 RenoirModel: Acer Swift 3 (SF314-42-R2UX)OS: Ubuntu 21.04
I tried all patches mentioned in #1230 (closed) on Kernel 5.14 but with little success: Some times the machine will wake after first suspend but not always and at least I usually can SSH into my machine.
I was hoping for Kernel 5.15 and tried v5.15-rc2 but unfortunately the situation worsened: My machine wont wake up at all after suspend.
Trying to get on top off recent developments I found it quite hard to grasp, which patches on which kernel version I should try to solve the issue.
Could somebody please summarize for me, what to try?
On your kernel command line for the following checks set up amd_pmc.dyndbg='+p' pm_debug_messages.
Check 5.14.7 kernel. If it works here, can you please provide a dmesg of your full suspend and resume so we can compare 5.15-rcX.
Check 5.15-rc3. The most notable change is https://github.com/torvalds/linux/commit/3c3c8e88c8712bfe06cd10d7ca77a94a33610cd6 but it may or may not help your system. If it doesn't help, please share more about how the failure occurs. Does it seem that it enters suspend but can't wake? Is the power LED "breathing"? Have you tried all the wake sources? Keyboard, touchpad, lid, power button?
If all of those wake sources don't work, can you please try to plug in a USB keyboard before suspend and try to press a key to wake it? If this works, please provide a dmesg log.
System seems to go in some kind of suspend on lid close or manual suspend, as all ssh sessions freeze. The power LED however does not indicate suspend mode, as it does not »breath« but is constantly on.
Opening of the lid or power button do not have any effect. All keyboard buttons (of the internal keyboard) however kind of wake the machine up:
Keyboard back light lights up.
Some times I can ssh into the machine but not always.
Here is the dmesg output (with amd_pmc.dyndbg='+p' pm_debug_messages): dmesg.linux14.7.txt
With Linux 5.14.0 and also with Linux 5.12.12 (the actual Ubuntu Kernel):
-System seems to go in some kind of suspend on lid close or manual suspend, as all ssh sessions freeze. The power LED however does not indicate suspend mode, as it does not »breath« but is constantly on.
Opening of the lid or power button do not have any effect. All keyboard buttons (of the internal keyboard) however kind of wake the machine up:
Keyboard back light lights up.
Some times I can ssh into the machine but not always.
Generally waking up the system works for the first time.
Trying to wake the system for the second time after few key strokes it will reboot.
Some times after suspend I can switch to console getting those messages:
Mi Sep 29 22:26:14 2021] amdgpu 0000:03:00.0: amdgpu: Msg issuing pre-check failed(0xff) and SMU may be not in the right state![Mi Sep 29 22:26:14 2021] amdgpu 0000:03:00.0: amdgpu: Attempt to get max GX frequency from SMC Failed ![Mi Sep 29 22:26:14 2021] amdgpu 0000:03:00.0: amdgpu: Msg issuing pre-check failed(0xff) and SMU may be not in the right state![Mi Sep 29 22:26:20 2021] [drm:smu_v12_0_gfx_off_control [amdgpu]] *ERROR* disable gfxoff timeout and failed![Mi Sep 29 22:26:20 2021] amdgpu 0000:03:00.0: amdgpu: Failed to disable gfxoff![Mi Sep 29 22:26:20 2021] amdgpu 0000:03:00.0: amdgpu: Msg issuing pre-check failed(0xff) and SMU may be not in the right state![Mi Sep 29 22:26:20 2021] amdgpu 0000:03:00.0: amdgpu: Attempt to get max GX frequency from SMC Failed !
Switching back to display will may show login screen but is frozen and may reboot after a while
So this bug report looks like it's the same as #1693 (closed) - it's got a different PEP ID than we support in the amd-pmc driver. That was fixed by a BIOS upgrade.
Can you please check with Acer for an updated BIOS?
I see updates for your model available here: https://www.acer.com/ac/en/US/content/support-product/8263?b=1
If the issue persists after BIOS upgrade, please provide a new acpidump.
FWIW there are some patches that went out to the mailing list to add this ID. The BIOS upgrade should be your primary way to solve this, but if that doesn't help please give them a try.
The resume freezes XOrg, after being in sleep (printf mem | sudo tee /sys/power/state) for more than 30s actually randomly. By freeze I mean the mouse moves, but nothing on the screen reacts, CPU seem to get hot.
I could recover by quitting the WM with keybind and re-entering the Xsession with startx. Attaching the dmesg.log.
I feel like the problem is due to rather an amdgpu fault (a short red part from the log):
+1 I'm on same laptop with VoidLinux base image, I'd be happy to help with any information required.
For me I have different scenarios:
I have latest BIOS
How much time does building a kernel take (I'm just new to it, I guess I didn't specified multi-core use with -j, it took 1.5hr still building on net/ethernet directory). Is it near or should I restart with -j6?
I have a similar issue: When putting my Slim 7 Pro (14ARE05) to sleep and waking up again, my external monitor (connected via USB-C Dock / DP-alt-mode) will remain dark. This only happens when the refresh rate is set to 144hz. When setting it to 60hz, the issue does not occur. When plugging out the dock and reconnecting it, the monitor gets a signal again.
I'm running Fedora 34 (Gnome / Wayland) on 5.14.13-250.vanilla.1.fc34.x86_64 with all the latest drivers available from the official repositories installed.
Any clues on how to fix that? No issues with sleep (also deep state).
Plus I forgot to mention at time of boot there's two more error messages up (seen at tty login as well as in dmesg logs below 100lines approx):
[ 3.254671] acp_pdm_mach acp_pdm_mach.0: snd_soc_register_card(acp) failed: -517
[ 71.887691] amdgpu 0000:03:00.0: amdgpu: VM_L2_PROTECTION_FAULT_STATUS:0x005C0071@superm1 Can I understand what those error status indicate, so I can also help in fixing that?
Can you try with iommu=off to see that this helps?
Any clues on how to fix that? No issues with sleep (also deep state).
@qdrop we need more details, and more importantly we need to be sure that you have the same issue as is reported. Marking a duplicate (after we confirm it) is cheap, but tracking 5 issues in a single thread is expensive. Please open your own issue and attach your dmesg and acpidump to the issue.
confirmed your acpi dump for sure also requires the AMD0005. One more debugging idea is to try to update to latest linux-firmware.git if you haven't already. If it still happens there to figure out what was happening differently that caused the crash I think we'll need to look at an STB dump comparing your good and failed result. Please apply https://patchwork.kernel.org/project/platform-driver-x86/patch/20211005155641.2308-1-Sanket.Goswami@amd.com/
You might have a few dependencies to use it from linux-next, so look for those if it doesn't apply cleanly.
Capture the stb from debugfs (/sys/kernel/debug/amd_pmc/stb_read) both from a successful resume and a failed one (and label them accordingly). Attach them to this bug and one of us will need to compare them for anything notably different.
Hmm, few hunks failed to register from the patch, which version I have to test the patch with, I'm on linux5.14-5.14.13_1 currently. Is that patch extracted somewhere from the kernel v5.15?
I was using 20210818, I'll try with latest 20210919 shortly. I'll include patch in a bit as well. I'll notify when I'm done with both :)
Just a question should I start with the topmost all defaults or those kernel-parameters and kernel patches you gave before applied (along with updated firmware with current patches)?