AMDGPU crash Error queuing DMUB command: status=2, Error waiting for DMUB idle: status=3
Brief summary of the problem:
I have a laptop with switchable graphics with an AMD iGPU and an NVIDIA dGPU. Problems occur when the system is in hybrid/integrated GPU mode (currently testing nvidia mode, but it is unlikely the crash is caused by the dGPU). I have monitor connected to the laptop which is controlled by the dGPU. The laptop screen is controlled by the iGPU. The problem is most likely caused by the iGPU since the system never crashed when only using the secondary monitor. The crashes occur randomly without any pattern. The only thing I noticed is that it only happened when the mouse was on the laptop screen on the top of the screen. The system doesn't freeze entirely just becomes unusably slow. I would only see changes in every 10 seconds and switching to TTY takes 10 minutes. Killing the desktop environment (KDE in my case) has no effect. Once in the TTY the screen is flooded with kernel error messages like in the attached dmesg. The last time it crashed rebooting didn't solve the problem, it happened immediately when I logged in again. Only shutting down solves the problem temporarily.
Hardware description:
- Laptop: HP Victus 16-s0xxx
- CPU: AMD Ryzen 5 7640HS w/ Radeon 760M Graphics
- GPU: NVIDIA GeForce RTX 3050 6GB Laptop GPU
*-display
description: VGA compatible controller
product: NVIDIA Corporation [10DE:25AC]
vendor: NVIDIA Corporation [10DE]
physical id: 0
bus info: pci@0000:01:00.0
logical name: /dev/fb0
version: a1
width: 64 bits
clock: 33MHz
capabilities: pm msi pciexpress vga_controller bus_master cap_list rom fb
configuration: depth=32 driver=nvidia latency=0 mode=1920x1080 visual=truecolor xres=1920 yres=1080
resources: iomemory:7a0-79f iomemory:7c0-7bf irq:70 memory:60000000-60ffffff memory:7a00000000-7bffffffff memory:7c00000000-7c01ffffff ioport:3000(size=128) memory:61080000-610fffff
*-display
description: VGA compatible controller
product: Advanced Micro Devices, Inc. [AMD/ATI] [1002:15BF]
vendor: Advanced Micro Devices, Inc. [AMD/ATI] [1002]
physical id: 0
bus info: pci@0000:05:00.0
logical name: /dev/fb0
version: c3
width: 64 bits
clock: 33MHz
capabilities: pm pciexpress msi msix vga_controller bus_master cap_list fb
configuration: depth=32 driver=amdgpu latency=0 resolution=1920,1080
resources: iomemory:7c0-7bf irq:46 memory:7c10000000-7c1fffffff memory:61200000-613fffff ioport:1000(size=256) memory:61700000-6177ffff
- System Memory: 14.8GiB RAM
- Display(s):
- Laptop screen (1920x1080) 144Hz (crash happened on 60Hz as well)
- Secondary monitor LG HDR 4K (3840x2160)
- Type of Display Connection:
- Laptop screen: builtin
- Secondary monitor: HDMI
System information:
- Distro name and Version: Pop!_OS 22.04
- Kernel version: Linux pop-os 6.4.6-76060406-generic #202307241739~1694621917~22.04~ac5e1a8 SMP PREEMPT_DYNAMIC Wed S x86_64 x86_64 x86_64 GNU/Linux
- AMD official driver version: 22.0.0-1ubuntu0.1pop1~1677198820~22.04~2e82f80
- KDE Plasma version: 5.24.7
- NVIDIA Driver version: 535.104.05
How to reproduce the issue:
No pattern. Sometimes moving the mouse to the top of the laptop screen crashes the system.
Attached files:
Log files (for system lockups / game freezes / crashes)
- Dmesg log (full log)
- crash before last crash: 2023.09.20.16.00.dmesg
- last crash: 2023.09.20.16.07.dmesg
- journalctl
- crash before last crash: 2023.09.20.16.00.crash
- last crash: 2023.09.20.16.07.crash