SI hangs when using DC with DPM
Brief summary of the problem:
So.. I was just checking #1776 when I realized that DC broke some time ago.
Since somewhere between 5.16 and 6.1 (and then counting to this day) it also started to simultaneously drop displaying anything.
System information:
System:
Host: test Kernel: 6.4.6-1-MANJARO arch: x86_64 bits: 64
compiler: gcc v: 13.1.1 Desktop: Xfce v: 4.18.1 tk: Gtk v: 3.24.36 wm: xfwm
dm: LightDM Distro: Manjaro Linux base: Arch Linux
CPU:
Info: quad core model: Intel Core2 Quad Q9300 bits: 64 type: MCP
arch: Penryn rev: 7 cache: L1: 256 KiB L2: 6 MiB
Speed (MHz): avg: 2003 min/max: 2003/2499 cores: 1: 2003 2: 2003 3: 2003
4: 2003 bogomips: 20075
Flags: ht lm nx pae sse sse2 sse3 sse4_1 ssse3 vmx
Graphics:
Device-1: AMD Cape Verde PRO [Radeon HD 7750/8740 / R7 250E]
vendor: PC Partner / Sapphire driver: amdgpu v: kernel arch: GCN-1 pcie:
speed: 2.5 GT/s lanes: 16 ports: active: HDMI-A-1 empty: DVI-D-1
bus-ID: 01:00.0 chip-ID: 1002:683f
Display: x11 server: X.Org v: 21.1.8 compositor: xfwm v: 4.18.0 driver: X:
loaded: amdgpu unloaded: modesetting alternate: fbdev,vesa dri: radeonsi
gpu: amdgpu display-ID: :0.0 screens: 1
Screen-1: 0 s-res: 1920x1080 s-dpi: 96 monitors: <missing: xrandr>
Monitor-1: HDMI-A-1 model: LG (GoldStar) 27GL850 res: 2560x1440 dpi: 93
diag: 800mm (31.5")
API: OpenGL v: 4.6 Mesa 23.0.4 renderer: AMD Radeon HD 7700 Series (verde
LLVM 15.0.7 DRM 3.52 6.4.6-1-MANJARO) direct-render: Yes
How to reproduce the issue:
- Enable AMDGPU (
radeon.si_support=0 amdgpu.si_support=1
) - Enable DC (
amdgpu.dc=1
) - Enjoy blackness (unless using
amdgpu.dpm=0
)
Attached files:
Log files (for system lockups / game freezes / crashes)
The only remarkable thing I can see is these lines
kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring sdma1 timeout, signaled seq=31, emitted seq=33
kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process information: process pid 0 thread pid 0
On kernel 5.15 (if something so old can still help) after waiting for a lot of time I can see:
[drm:drm_atomic_helper_wait_for_flip_done] *ERROR* [CRTC:47:crtc-0] flip_done timed out
[drm:drm_crtc_commit_wait] *ERROR* flip_done timed out
[drm:drm_atomic_helper_wait_for_dependencies] *ERROR* [CRTC:47:crtc-0] commit wait timed out
[drm:drm_crtc_commit_wait] *ERROR* flip_done timed out
[drm:drm_atomic_helper_wait_for_dependencies] *ERROR* [CONNECTOR:59:HDMI-A-1] commit wait timed out
[drm:drm_crtc_commit_wait] *ERROR* flip_done timed out
[drm:drm_atomic_helper_wait_for_dependencies] *ERROR* [PLANE:45:plane-5] commit wait timed out
------------[ cut here ]------------
WARNING: CPU: 0 PID: 185 at drivers/gpu/drm/amd/amdgpu/../display/amdgpu_dm/amdgpu_dm.c:9205 amdgpu_dm_atomic_commit_tail+0x2699/0x27a0 [amdgp>
Modules linked in: cmac algif_hash ecb algif_skcipher af_alg bnep btusb btrtl btbcm btintel bluetooth ecdh_generic joydev mousedev amdgpu rfki>
CPU: 0 PID: 185 Comm: kworker/0:2 Tainted: G W 5.15.122-1-MANJARO #1 168fb641dbb2b7df0309a9898046e4bb26860e14
Hardware name: System manufacturer System Product Name/P5QPL-AM, BIOS 0416 06/22/2010
Workqueue: events console_callback
RIP: 0010:amdgpu_dm_atomic_commit_tail+0x2699/0x27a0 [amdgpu]
Code: 89 95 48 fd ff ff 89 85 40 fd ff ff 89 8d 50 fd ff ff c7 85 3c fd ff ff 04 00 00 00 e9 95 e4 ff ff 0f 0b e9 ca f9 ff ff 0f 0b <0f> 0b e9>
RSP: 0018:ffffbced010477e0 EFLAGS: 00010002
RAX: 0000000000000002 RBX: 0000000000000246 RCX: 0000000000000000
RDX: 0000000000000000 RSI: 0000000000000000 RDI: 0000000000000000
RBP: ffffbced01047b80 R08: 0000000000000000 R09: 0000000000000000
R10: 0000000000000000 R11: 0000000000000000 R12: ffff9c09c20d1118
R13: 0000000000000000 R14: ffff9c09c273c400 R15: ffff9c09c20d1000
FS: 0000000000000000(0000) GS:ffff9c0ad7c00000(0000) knlGS:0000000000000000
CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: 00007fca78f9f4b8 CR3: 00000001009b0000 CR4: 00000000000006f0
Call Trace:
<TASK>
? __warn+0x81/0x100
? amdgpu_dm_atomic_commit_tail+0x2699/0x27a0 [amdgpu 2f9972f9d6d0c9a906bdd63dc4126cde446d2a04]
? report_bug+0xa7/0xd0
? handle_bug+0x34/0x90
? exc_invalid_op+0x13/0x70
? asm_exc_invalid_op+0x16/0x20
? amdgpu_dm_atomic_commit_tail+0x2699/0x27a0 [amdgpu 2f9972f9d6d0c9a906bdd63dc4126cde446d2a04]
commit_tail+0x94/0x130
drm_atomic_helper_commit+0x122/0x150
drm_client_modeset_commit_atomic+0x204/0x240
drm_client_modeset_commit_locked+0x56/0x170
drm_fb_helper_pan_display+0xc5/0x1f0
fb_pan_display+0x82/0x140
bit_update_start+0x1a/0x40
fbcon_switch+0x3de/0x580
redraw_screen+0xfa/0x290
complete_change_console+0x3a/0x120
console_callback+0x16f/0x180
process_one_work+0x1c9/0x3a0
worker_thread+0x4d/0x3a0
? process_one_work+0x3a0/0x3a0
kthread+0x123/0x150
? set_kthread_struct+0x60/0x60
ret_from_fork+0x22/0x30
</TASK>
---[ end trace 268f985ea7df4bc5 ]---
------------[ cut here ]------------
WARNING: CPU: 0 PID: 185 at drivers/gpu/drm/amd/amdgpu/../display/amdgpu_dm/amdgpu_dm.c:8781 amdgpu_dm_atomic_commit_tail+0x26a0/0x27a0 [amdgp>
Modules linked in: cmac algif_hash ecb algif_skcipher af_alg bnep btusb btrtl btbcm btintel bluetooth ecdh_generic joydev mousedev amdgpu rfki>
CPU: 0 PID: 185 Comm: kworker/0:2 Tainted: G W 5.15.122-1-MANJARO #1 168fb641dbb2b7df0309a9898046e4bb26860e14
Hardware name: System manufacturer System Product Name/P5QPL-AM, BIOS 0416 06/22/2010
Workqueue: events console_callback
RIP: 0010:amdgpu_dm_atomic_commit_tail+0x26a0/0x27a0 [amdgpu]
Code: 85 40 fd ff ff 89 8d 50 fd ff ff c7 85 3c fd ff ff 04 00 00 00 e9 95 e4 ff ff 0f 0b e9 ca f9 ff ff 0f 0b 0f 0b e9 2a fa ff ff <0f> 0b e9>
RSP: 0018:ffffbced010477e0 EFLAGS: 00010082
RAX: 0000000000000001 RBX: 0000000000000246 RCX: 0000000000000000
RDX: 0000000000000000 RSI: 0000000000000000 RDI: 0000000000000000
RBP: ffffbced01047b80 R08: 0000000000000000 R09: 0000000000000000
R10: 0000000000000000 R11: 0000000000000000 R12: ffff9c09c20d1118
R13: 0000000000000000 R14: ffff9c09c273c400 R15: ffff9c09c20d1000
FS: 0000000000000000(0000) GS:ffff9c0ad7c00000(0000) knlGS:0000000000000000
CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: 00007fca78f9f4b8 CR3: 00000001009b0000 CR4: 00000000000006f0
Call Trace:
<TASK>
? __warn+0x81/0x100
? amdgpu_dm_atomic_commit_tail+0x26a0/0x27a0 [amdgpu 2f9972f9d6d0c9a906bdd63dc4126cde446d2a04]
? report_bug+0xa7/0xd0
? handle_bug+0x34/0x90
? exc_invalid_op+0x13/0x70
? asm_exc_invalid_op+0x16/0x20
? amdgpu_dm_atomic_commit_tail+0x26a0/0x27a0 [amdgpu 2f9972f9d6d0c9a906bdd63dc4126cde446d2a04]
commit_tail+0x94/0x130
drm_atomic_helper_commit+0x122/0x150
drm_client_modeset_commit_atomic+0x204/0x240
drm_client_modeset_commit_locked+0x56/0x170
drm_fb_helper_pan_display+0xc5/0x1f0
fb_pan_display+0x82/0x140
bit_update_start+0x1a/0x40
fbcon_switch+0x3de/0x580
redraw_screen+0xfa/0x290
complete_change_console+0x3a/0x120
console_callback+0x16f/0x180
process_one_work+0x1c9/0x3a0
worker_thread+0x4d/0x3a0
? process_one_work+0x3a0/0x3a0
kthread+0x123/0x150
? set_kthread_struct+0x60/0x60
ret_from_fork+0x22/0x30
</TASK>
---[ end trace 268f985ea7df4bc6 ]---
[drm:drm_atomic_helper_wait_for_flip_done] *ERROR* [CRTC:47:crtc-0] flip_done timed out
Edited by mirh