[drm:amdgpu_dm_atomic_commit_tail [amdgpu]] *ERROR* Waiting for fences timed out!
Brief summary of the problem:
It's a laptop, the lenovo z16. I only use an external screen. usb4 > usb to hdmi adapter > hdmi.
The external screen flashes, then freeze. When I unplug the HDMI adapter, the video goes back to the laptop but is frozen on a brown(ish) screen. I then have to reboot.
It usually happens when I am on video call. (Zoom in the browser, firefox). But it did happen randomly as well. It also doesn't always happen on zoom calls.
I cannot confirm if this happens when not using the external screen. but I will try that and update when I have a feedback.
I did try other adapter (random docking station) it had the same behavior.
Kernel log:
Oct 26 08:59:50.785409 sylvain-work systemd-journald[1031]: Time jumped backwards, rotating.
Oct 26 08:59:52.692614 sylvain-work kernel: [drm] DP Alt mode state on HPD: 1
Oct 26 08:59:53.957620 sylvain-work kernel: [drm] DP Alt mode state on HPD: 1
Oct 26 09:20:11.685546 sylvain-work kernel: [drm:amdgpu_dm_atomic_commit_tail [amdgpu]] *ERROR* Waiting for fences timed out!
Oct 26 09:20:11.702559 sylvain-work kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx_0.0.0 timeout, signaled seq=612269, emitted seq=612270
Oct 26 09:20:11.702652 sylvain-work kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process information: process gnome-shell pid 2497 thread gnome-shel:cs0 pid 2518
Oct 26 09:20:11.702699 sylvain-work kernel: amdgpu 0000:64:00.0: amdgpu: GPU reset begin!
Oct 26 09:20:12.378517 sylvain-work kernel: amdgpu 0000:64:00.0: [drm:amdgpu_ring_test_helper [amdgpu]] *ERROR* ring kiq_2.1.0 test failed (-110)
Oct 26 09:20:12.378860 sylvain-work kernel: [drm:gfx_v10_0_hw_fini [amdgpu]] *ERROR* KGQ disable failed
Oct 26 09:20:12.577512 sylvain-work kernel: [drm:gfx_v10_0_hw_fini [amdgpu]] *ERROR* failed to halt cp gfx
Oct 26 09:20:12.587505 sylvain-work kernel: [drm] free PSP TMR buffer
Oct 26 09:20:12.619513 sylvain-work kernel: CPU: 6 PID: 11 Comm: kworker/u32:1 Not tainted 5.19.16-200.fc36.x86_64 #1
Oct 26 09:20:12.619654 sylvain-work kernel: Hardware name: LENOVO 21D4CTO1WW/21D4CTO1WW, BIOS N3GET42W (1.22 ) 09/06/2022
Oct 26 09:20:12.619698 sylvain-work kernel: Workqueue: amdgpu-reset-dev drm_sched_job_timedout [gpu_sched]
Oct 26 09:20:12.619741 sylvain-work kernel: Call Trace:
Oct 26 09:20:12.619785 sylvain-work kernel: <TASK>
Oct 26 09:20:12.619825 sylvain-work kernel: dump_stack_lvl+0x44/0x5c
Oct 26 09:20:12.619866 sylvain-work kernel: amdgpu_do_asic_reset+0x26/0x459 [amdgpu]
Oct 26 09:20:12.619909 sylvain-work kernel: amdgpu_device_gpu_recover_imp.cold+0x59d/0x8cb [amdgpu]
Oct 26 09:20:12.619948 sylvain-work kernel: amdgpu_job_timedout+0x156/0x190 [amdgpu]
Oct 26 09:20:12.619982 sylvain-work kernel: ? default_send_IPI_single_phys+0x32/0x50
Oct 26 09:20:12.620021 sylvain-work kernel: ? _raw_spin_unlock_irqrestore+0x23/0x40
Oct 26 09:20:12.620059 sylvain-work kernel: drm_sched_job_timedout+0x76/0x110 [gpu_sched]
Oct 26 09:20:12.620101 sylvain-work kernel: process_one_work+0x1c7/0x380
Oct 26 09:20:12.620140 sylvain-work kernel: worker_thread+0x4d/0x380
Oct 26 09:20:12.620180 sylvain-work kernel: ? process_one_work+0x380/0x380
Oct 26 09:20:12.620218 sylvain-work kernel: kthread+0xe9/0x110
Oct 26 09:20:12.620262 sylvain-work kernel: ? kthread_complete_and_exit+0x20/0x20
Oct 26 09:20:12.620303 sylvain-work kernel: ret_from_fork+0x22/0x30
Oct 26 09:20:12.620343 sylvain-work kernel: </TASK>
Oct 26 09:20:12.620381 sylvain-work kernel: amdgpu 0000:64:00.0: amdgpu: MODE2 reset
Oct 26 09:20:12.628516 sylvain-work kernel: amdgpu 0000:64:00.0: amdgpu: GPU reset succeeded, trying to resume
Oct 26 09:20:12.628683 sylvain-work kernel: [drm] PCIE GART of 512M enabled (table at 0x000000F4008C9000).
Oct 26 09:20:12.628715 sylvain-work kernel: [drm] PSP is resuming...
Oct 26 09:20:12.650504 sylvain-work kernel: [drm] reserve 0xa00000 from 0xf41f400000 for PSP TMR
Oct 26 09:20:12.948525 sylvain-work kernel: amdgpu 0000:64:00.0: amdgpu: RAS: optional ras ta ucode is not available
Oct 26 09:20:12.957502 sylvain-work kernel: amdgpu 0000:64:00.0: amdgpu: RAP: optional rap ta ucode is not available
Oct 26 09:20:12.957857 sylvain-work kernel: amdgpu 0000:64:00.0: amdgpu: SECUREDISPLAY: securedisplay ta ucode is not available
Oct 26 09:20:12.958187 sylvain-work kernel: amdgpu 0000:64:00.0: amdgpu: SMU is resuming...
Oct 26 09:20:12.958473 sylvain-work kernel: amdgpu 0000:64:00.0: amdgpu: SMU is resumed successfully!
Oct 26 09:20:12.960506 sylvain-work kernel: [drm] DMUB hardware initialized: version=0x0400002A
Oct 26 09:20:13.074546 sylvain-work kernel: [drm] kiq ring mec 2 pipe 1 q 0
Oct 26 09:20:13.078546 sylvain-work kernel: [drm] VCN decode and encode initialized successfully(under DPG Mode).
Oct 26 09:20:13.079549 sylvain-work kernel: [drm] JPEG decode initialized successfully.
Oct 26 09:20:13.079665 sylvain-work kernel: amdgpu 0000:64:00.0: amdgpu: ring gfx_0.0.0 uses VM inv eng 0 on hub 0
Oct 26 09:20:13.080055 sylvain-work kernel: amdgpu 0000:64:00.0: amdgpu: ring comp_1.0.0 uses VM inv eng 1 on hub 0
Oct 26 09:20:13.080339 sylvain-work kernel: amdgpu 0000:64:00.0: amdgpu: ring comp_1.1.0 uses VM inv eng 4 on hub 0
Oct 26 09:20:13.080634 sylvain-work kernel: amdgpu 0000:64:00.0: amdgpu: ring comp_1.2.0 uses VM inv eng 5 on hub 0
Oct 26 09:20:13.080908 sylvain-work kernel: amdgpu 0000:64:00.0: amdgpu: ring comp_1.3.0 uses VM inv eng 6 on hub 0
Oct 26 09:20:13.081191 sylvain-work kernel: amdgpu 0000:64:00.0: amdgpu: ring comp_1.0.1 uses VM inv eng 7 on hub 0
Oct 26 09:20:13.081452 sylvain-work kernel: amdgpu 0000:64:00.0: amdgpu: ring comp_1.1.1 uses VM inv eng 8 on hub 0
Oct 26 09:20:13.081845 sylvain-work kernel: amdgpu 0000:64:00.0: amdgpu: ring comp_1.2.1 uses VM inv eng 9 on hub 0
Oct 26 09:20:13.082189 sylvain-work kernel: amdgpu 0000:64:00.0: amdgpu: ring comp_1.3.1 uses VM inv eng 10 on hub 0
Oct 26 09:20:13.082566 sylvain-work kernel: amdgpu 0000:64:00.0: amdgpu: ring kiq_2.1.0 uses VM inv eng 11 on hub 0
Oct 26 09:20:13.083078 sylvain-work kernel: amdgpu 0000:64:00.0: amdgpu: ring sdma0 uses VM inv eng 12 on hub 0
Oct 26 09:20:13.083568 sylvain-work kernel: amdgpu 0000:64:00.0: amdgpu: ring vcn_dec_0 uses VM inv eng 0 on hub 1
Oct 26 09:20:13.083938 sylvain-work kernel: amdgpu 0000:64:00.0: amdgpu: ring vcn_enc_0.0 uses VM inv eng 1 on hub 1
Oct 26 09:20:13.084166 sylvain-work kernel: amdgpu 0000:64:00.0: amdgpu: ring vcn_enc_0.1 uses VM inv eng 4 on hub 1
Oct 26 09:20:13.084262 sylvain-work kernel: amdgpu 0000:64:00.0: amdgpu: ring jpeg_dec uses VM inv eng 5 on hub 1
Oct 26 09:20:13.104581 sylvain-work kernel: amdgpu 0000:64:00.0: amdgpu: recover vram bo from shadow start
Oct 26 09:20:13.104941 sylvain-work kernel: amdgpu 0000:64:00.0: amdgpu: recover vram bo from shadow done
Oct 26 09:20:13.105104 sylvain-work kernel: [drm] Skip scheduling IBs!
Oct 26 09:20:13.105144 sylvain-work kernel: amdgpu 0000:64:00.0: amdgpu: GPU reset(2) succeeded!
Oct 26 09:20:13.108509 sylvain-work kernel: amdgpu_cs_ioctl: 149 callbacks suppressed
Oct 26 09:20:13.108585 sylvain-work kernel: [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
Oct 26 09:20:13.129520 sylvain-work kernel: [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
Oct 26 09:20:13.149511 sylvain-work kernel: [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
Oct 26 09:20:19.929016 sylvain-work kernel: [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
Oct 26 09:20:19.961628 sylvain-work kernel: [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
Oct 26 09:20:19.981526 sylvain-work kernel: [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
Oct 26 09:20:20.001598 sylvain-work kernel: [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
Oct 26 09:20:20.021587 sylvain-work kernel: [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
Oct 26 09:20:20.041521 sylvain-work kernel: [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
Oct 26 09:20:20.062511 sylvain-work kernel: [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
Oct 26 09:20:20.081635 sylvain-work kernel: [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
Oct 26 09:20:20.102512 sylvain-work kernel: [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
Oct 26 09:20:20.122511 sylvain-work kernel: [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
Oct 26 09:20:23.759541 sylvain-work kernel: usb 3-1: USB disconnect, device number 2
Oct 26 09:20:23.760031 sylvain-work kernel: usb 3-1.1: USB disconnect, device number 4
Oct 26 09:20:23.760317 sylvain-work kernel: usb 3-1.1.1: USB disconnect, device number 7
Oct 26 09:20:23.760608 sylvain-work kernel: uvcvideo 3-1.1.2:1.1: Failed to resubmit video URB (-19).
Oct 26 09:20:23.760904 sylvain-work kernel: usb 3-1.1.2: cannot submit urb (err = -19)
Oct 26 09:20:23.763530 sylvain-work kernel: uvcvideo 3-1.1.2:1.1: Failed to resubmit video URB (-19).
Oct 26 09:20:23.767511 sylvain-work kernel: uvcvideo 3-1.1.2:1.1: Failed to resubmit video URB (-19).
Oct 26 09:20:23.771555 sylvain-work kernel: usb 3-1.1.2: cannot submit urb 0, error -19: no device
Oct 26 09:20:23.771793 sylvain-work kernel: uvcvideo 3-1.1.2:1.1: Failed to resubmit video URB (-19).
Oct 26 09:20:23.775505 sylvain-work kernel: uvcvideo 3-1.1.2:1.1: Failed to resubmit video URB (-19).
Oct 26 09:20:24.064520 sylvain-work kernel: usb 3-1.1.2: USB disconnect, device number 9
Oct 26 09:20:24.096513 sylvain-work kernel: usb 3-1.1.3: USB disconnect, device number 10
Oct 26 09:20:24.128507 sylvain-work kernel: usb 3-1.1.4: USB disconnect, device number 11
Oct 26 09:20:24.192524 sylvain-work kernel: usb 3-1.2: USB disconnect, device number 6
Oct 26 09:20:24.345035 sylvain-work kernel: xhci_hcd 0000:64:00.4: Refused to change power state from D0 to D3hot
Oct 26 09:20:24.475538 sylvain-work kernel: usb 4-1: USB disconnect, device number 2
Oct 26 09:20:24.475928 sylvain-work kernel: usb 4-1.1: USB disconnect, device number 3
Oct 26 09:20:24.943515 sylvain-work kernel: amdgpu_cs_ioctl: 173 callbacks suppressed
Oct 26 09:20:24.943606 sylvain-work kernel: [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
Oct 26 09:20:24.960528 sylvain-work kernel: [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
Hardware description:
- Laptop: Lenovo z16
- CPU: AMD Ryzen 7 PRO 6850H with Radeon Graphics
- GPU: 64:00.0 VGA compatible controller [0300]: Advanced Micro Devices, Inc. [AMD/ATI] Rembrandt [Radeon 680M] [1002:1681] (rev 12)
- System Memory: 32gb
- Display(s): 16" WUXGA (1920 x 1200) IPS (non touch) and external screen Samsung 34" (3440x1440)
- Type of Display Connection: usb4 > usb to HDMI adapter (apple) > hdmi
System information:
- Distro name and Version: fedora 36 (up to date)
- Kernel version: Linux sylvain-work 5.19.16-200.fc36.x86_64 #1 (closed) SMP PREEMPT_DYNAMIC Sun Oct 16 22:50:04 UTC 2022 x86_64 x86_64 x86_64 GNU/Linux
- Custom kernel: n/a
- AMD official driver version: n/a
How to reproduce the issue:
craches are random and don't always happen. But most of the time, it happens when I Join a zoom meeting in the browser (firefox)
Attached files:
full dmsg logs: journalctl.log