Radeon X700 PRO [RV410] crashes at boot: "Failed to wait GUI idle while programming pipes. Bad things might happen." (kernel 6.6.13 + kernel 6.8-rc3)
At first I thought my Radeon X700 PRO got defective due its' age but then I got the same dmesg with another R400 card, a Radeon X800 GTO.
Also a X1050, a X1550 and a HD6570 work perfectly well on the same machine so I think this might be a specific problem only showing up on R400 cards.
When the radeon module gets loaded I get this Failed to wait GUI idle while programming pipes. Bad things might happen.
and shortly afterwards RIP: 0010:r100_irq_process+0x160/0x202 [radeon]
, see dmesg:
[drm] radeon kernel modesetting enabled.
radeon 0000:04:00.0: enabling device (0000 -> 0003)
[drm] initializing kernel modesetting (RV410 0x1002:0x5E4B 0x148C:0x2103 0x00).
hid-generic 0003:0D8C:000C.0001: input,hidraw0: USB HID v1.00 Device [C-Media USB Headphone Set ] on usb-0000:0a:00.3-2.1/input3
input: HDA ATI HDMI HDMI/DP,pcm=7 as /devices/pci0000:00/0000:00:03.1/0000:06:00.0/0000:07:00.0/0000:08:00.1/sound/card0/input4
input: HID 046a:010d as /devices/pci0000:00/0000:00:01.2/0000:02:00.0/usb1/1-6/1-6.2/1-6.2:1.0/0003:046A:010D.0002/input/input10
hid-generic 0003:26CE:01A2.0004: input,hidraw1: USB HID v1.10 Device [ASRock LED Controller] on usb-0000:02:00.0-8/input0
input: HDA ATI HDMI HDMI/DP,pcm=8 as /devices/pci0000:00/0000:00:03.1/0000:06:00.0/0000:07:00.0/0000:08:00.1/sound/card0/input5
ATOM BIOS: RV410P
[drm] GPU not posted. posting now...
usb 1-6.3: new full-speed USB device number 6 using xhci_hcd
hid-generic 0003:046A:010D.0002: input,hidraw2: USB HID v1.11 Keyboard [HID 046a:010d] on usb-0000:02:00.0-6.2/input0
input: HDA ATI HDMI HDMI/DP,pcm=9 as /devices/pci0000:00/0000:00:03.1/0000:06:00.0/0000:07:00.0/0000:08:00.1/sound/card0/input6
input: HDA ATI HDMI HDMI/DP,pcm=10 as /devices/pci0000:00/0000:00:03.1/0000:06:00.0/0000:07:00.0/0000:08:00.1/sound/card0/input7
input: HDA ATI HDMI HDMI/DP,pcm=11 as /devices/pci0000:00/0000:00:03.1/0000:06:00.0/0000:07:00.0/0000:08:00.1/sound/card0/input8
input: HID 046a:010d as /devices/pci0000:00/0000:00:01.2/0000:02:00.0/usb1/1-6/1-6.2/1-6.2:1.1/0003:046A:010D.0003/input/input11
Adding 33558524k swap on /dev/nvme0n1p5. Priority:-2 extents:1 across:33558524k SS
[drm] Generation 2 PCI interface, using max accessible memory
radeon 0000:04:00.0: limiting VRAM to PCI aperture size
radeon 0000:04:00.0: VRAM: 128M 0x00000000E8000000 - 0x00000000EFFFFFFF (128M used)
radeon 0000:04:00.0: GTT: 512M 0x00000000C8000000 - 0x00000000E7FFFFFF
[drm] Detected VRAM RAM=128M, BAR=128M
[drm] RAM width 128bits DDR
[drm] radeon: 128M of VRAM memory ready
[drm] radeon: 512M of GTT memory ready.
[drm] GART: num cpu pages 131072, num gpu pages 131072
hid-generic 0003:046A:010D.0003: input,hidraw3: USB HID v1.11 Device [HID 046a:010d] on usb-0000:02:00.0-6.2/input1
[drm] PCIE GART of 512M enabled (table at 0x00000000E8040000).
usb 1-6.3: New USB device found, idVendor=1e7d, idProduct=2c38, bcdDevice= 0.78
usb 1-6.3: New USB device strings: Mfr=1, Product=2, SerialNumber=0
usb 1-6.3: Product: ROCCAT Kiro Mouse
usb 1-6.3: Manufacturer: ROCCAT
input: ROCCAT ROCCAT Kiro Mouse as /devices/pci0000:00/0000:00:01.2/0000:02:00.0/usb1/1-6/1-6.3/1-6.3:1.0/0003:1E7D:2C38.0005/input/input12
input: ROCCAT ROCCAT Kiro Mouse Consumer Control as /devices/pci0000:00/0000:00:01.2/0000:02:00.0/usb1/1-6/1-6.3/1-6.3:1.0/0003:1E7D:2C38.0005/input/input13
Failed to wait GUI idle while programming pipes. Bad things might happen.
input: ROCCAT ROCCAT Kiro Mouse as /devices/pci0000:00/0000:00:01.2/0000:02:00.0/usb1/1-6/1-6.3/1-6.3:1.0/0003:1E7D:2C38.0005/input/input14
hid-generic 0003:1E7D:2C38.0005: input,hiddev96,hidraw4: USB HID v1.11 Mouse [ROCCAT ROCCAT Kiro Mouse] on usb-0000:02:00.0-6.3/input0
input: ROCCAT ROCCAT Kiro Mouse as /devices/pci0000:00/0000:00:01.2/0000:02:00.0/usb1/1-6/1-6.3/1-6.3:1.1/0003:1E7D:2C38.0006/input/input15
hid-generic 0003:1E7D:2C38.0006: input,hidraw5: USB HID v1.00 Keyboard [ROCCAT ROCCAT Kiro Mouse] on usb-0000:02:00.0-6.3/input1
Failed to wait GUI idle while programming pipes. Bad things might happen.
Failed to wait GUI idle while programming pipes. Bad things might happen.
[drm] radeon: 4 quad pipes, 1 z pipes initialized.
radeon 0000:04:00.0: WB enabled
radeon 0000:04:00.0: fence driver on ring 0 use gpu addr 0x00000000c8000000
radeon 0000:04:00.0: radeon: MSI limited to 32-bit
radeon 0000:04:00.0: radeon: using MSI.
usb 1-6: USB disconnect, device number 2
usb 1-6.2: USB disconnect, device number 4
rcu: INFO: rcu_sched self-detected stall on CPU
rcu: 13-....: (6301 ticks this GP) idle=135c/1/0x4000000000000000 softirq=567/567 fqs=1667
rcu: hardirqs softirqs csw/system
rcu: number: 0 195 0
rcu: cputime: 0 0 10493 ==> 10510(ms)
rcu: (t=6307 jiffies g=-287 q=2261 ncpus=32)
CPU: 13 PID: 986 Comm: (udev-worker) Not tainted 6.8.0-rc3-Zen3 #1
Hardware name: To Be Filled By O.E.M. B550M Pro4/B550M Pro4, BIOS P3.20 09/27/2023
RIP: 0010:r100_irq_process+0x160/0x202 [radeon]
Code: 9e c0 31 f6 31 ff 41 b4 01 e8 6c 71 d3 e4 0f ba e5 0a 73 13 48 c7 c2 92 23 9e c0 31 f6 31 ff 41 b4 01 e8 53 71 d3 e4 48 89 df <e8> 5f fe ff ff 89 c5 85 c0 0f 85 dc fe ff ff 45 84 e4 74 1a 48 8b
RSP: 0018:ffffb2b5c0a6b9c8 EFLAGS: 00000246
RAX: 0000000000000000 RBX: ffff8f2ff2668000 RCX: 0000000000000000
RDX: 0000000000000000 RSI: 0000000000000000 RDI: ffff8f2ff2668000
RBP: 0000000002000611 R08: 0000000000000000 R09: 0000000000000000
R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000001
R13: 0000000000000000 R14: ffff8f2ff266a0f8 R15: 0000000000000006
FS: 00007fd756341c80(0000) GS:ffff8f365ad40000(0000) knlGS:0000000000000000
CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: 00007fac54603320 CR3: 00000001b075e000 CR4: 0000000000b50ef0
Call Trace:
<IRQ>
? rcu_dump_cpu_stacks+0xe7/0x113
? rcu_sched_clock_irq+0x26b/0x6ff
? tick_sched_do_timer+0x7a/0x7a
? update_process_times+0x73/0x96
? tick_nohz_highres_handler+0x3c/0x72
? __hrtimer_run_queues+0x90/0xfd
? ktime_get_update_offsets_now+0x41/0xaf
? hrtimer_interrupt+0x91/0x183
? __sysvec_apic_timer_interrupt+0x5f/0x74
? sysvec_apic_timer_interrupt+0x5b/0x75
</IRQ>
<TASK>
? asm_sysvec_apic_timer_interrupt+0x1a/0x20
? r100_irq_process+0x160/0x202 [radeon]
? r100_irq_process+0x15d/0x202 [radeon]
radeon_irq_kms_init+0x309/0x39d [radeon]
r420_startup.constprop.0+0xea/0x184 [radeon]
r420_init+0x194/0x20d [radeon]
radeon_device_init+0x946/0xb9a [radeon]
radeon_driver_load_kms+0xbf/0x18b [radeon]
drm_dev_register+0x11e/0x231
? __device_attach_driver+0xeb/0xeb
radeon_pci_probe+0xc8/0xff [radeon]
pci_device_probe+0x8c/0x105
really_probe+0x12f/0x289
__driver_probe_device+0xd4/0x10e
driver_probe_device+0x1a/0x93
__driver_attach+0xd8/0xf7
bus_for_each_dev+0x8d/0xd7
bus_add_driver+0xe4/0x208
driver_register+0x9e/0xe2
? 0xffffffffc0a1d000
do_one_initcall+0xa3/0x210
do_init_module+0x7d/0x232
init_module_from_file+0x8b/0xc2
__do_sys_finit_module+0x177/0x24d
do_syscall_64+0x84/0xee
entry_SYSCALL_64_after_hwframe+0x4b/0x53
RIP: 0033:0x7fd755f2f7a9
Code: ff c3 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 44 00 00 48 89 f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c 8b 4c 24 08 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d 57 66 0c 00 f7 d8 64 89 01 48
RSP: 002b:00007ffd1487c998 EFLAGS: 00000246 ORIG_RAX: 0000000000000139
RAX: ffffffffffffffda RBX: 000056107ec41a30 RCX: 00007fd755f2f7a9
RDX: 0000000000000000 RSI: 000056107ec4bb20 RDI: 0000000000000014
RBP: 0000000000000000 R08: 0000000000000001 R09: 000056107ebf16f0
R10: 0000000000000050 R11: 0000000000000246 R12: 000056107ec4bb20
R13: 0000000000020000 R14: 000056107ec41b20 R15: 0000000000000028
</TASK>
rcu: INFO: rcu_sched detected expedited stalls on CPUs/tasks: { 13-.... } 6462 jiffies s: 205 root: 0x1/.
rcu: blocking rcu_node structures (internal RCU debug): l=1:0-15:0x2000/.
Sending NMI from CPU 21 to CPUs 13:
NMI backtrace for cpu 13
CPU: 13 PID: 986 Comm: (udev-worker) Not tainted 6.8.0-rc3-Zen3 #1
Hardware name: To Be Filled By O.E.M. B550M Pro4/B550M Pro4, BIOS P3.20 09/27/2023
RIP: 0010:r100_mm_rreg.constprop.0+0x1e/0x27 [radeon]
Code: 5d 41 5c 31 f6 31 ff e9 23 13 23 e5 89 f0 48 3b 87 d8 01 00 00 72 0a 81 fe ff ff 00 00 76 02 eb a2 48 03 87 00 03 00 00 8b 00 <31> f6 31 ff e9 fc 12 23 e5 f3 0f 1e fa 55 53 48 89 fb 48 8b af 90
RSP: 0018:ffffb2b5c0a6b9a8 EFLAGS: 00000286
RAX: 00000000ffffffff RBX: ffff8f2ff2668000 RCX: 0000000000000000
RDX: 0000000000000000 RSI: 0000000000000044 RDI: ffff8f2ff2668000
RBP: 0000000002000611 R08: 0000000000000000 R09: 0000000000000000
R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000001
R13: 0000000000000000 R14: ffff8f2ff266a0f8 R15: 0000000000000006
FS: 00007fd756341c80(0000) GS:ffff8f365ad40000(0000) knlGS:0000000000000000
CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: 00007fac54603320 CR3: 00000001b075e000 CR4: 0000000000b50ef0
Call Trace:
<NMI>
? nmi_cpu_backtrace+0xc0/0x102
? nmi_cpu_backtrace_handler+0xc/0x18
? nmi_handle+0x5e/0xe8
? default_do_nmi+0x3f/0x211
? exc_nmi+0xf6/0x193
? end_repeat_nmi+0xf/0x4e
? r100_mm_rreg.constprop.0+0x1e/0x27 [radeon]
? r100_mm_rreg.constprop.0+0x1e/0x27 [radeon]
? r100_mm_rreg.constprop.0+0x1e/0x27 [radeon]
</NMI>
<TASK>
r100_irq_ack+0x10/0x3c [radeon]
r100_irq_process+0x165/0x202 [radeon]
radeon_irq_kms_init+0x309/0x39d [radeon]
r420_startup.constprop.0+0xea/0x184 [radeon]
r420_init+0x194/0x20d [radeon]
radeon_device_init+0x946/0xb9a [radeon]
radeon_driver_load_kms+0xbf/0x18b [radeon]
drm_dev_register+0x11e/0x231
? __device_attach_driver+0xeb/0xeb
radeon_pci_probe+0xc8/0xff [radeon]
pci_device_probe+0x8c/0x105
really_probe+0x12f/0x289
__driver_probe_device+0xd4/0x10e
driver_probe_device+0x1a/0x93
__driver_attach+0xd8/0xf7
bus_for_each_dev+0x8d/0xd7
bus_add_driver+0xe4/0x208
driver_register+0x9e/0xe2
? 0xffffffffc0a1d000
do_one_initcall+0xa3/0x210
do_init_module+0x7d/0x232
init_module_from_file+0x8b/0xc2
__do_sys_finit_module+0x177/0x24d
do_syscall_64+0x84/0xee
entry_SYSCALL_64_after_hwframe+0x4b/0x53
RIP: 0033:0x7fd755f2f7a9
Code: ff c3 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 44 00 00 48 89 f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c 8b 4c 24 08 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d 57 66 0c 00 f7 d8 64 89 01 48
RSP: 002b:00007ffd1487c998 EFLAGS: 00000246 ORIG_RAX: 0000000000000139
RAX: ffffffffffffffda RBX: 000056107ec41a30 RCX: 00007fd755f2f7a9
RDX: 0000000000000000 RSI: 000056107ec4bb20 RDI: 0000000000000014
RBP: 0000000000000000 R08: 0000000000000001 R09: 000056107ebf16f0
R10: 0000000000000050 R11: 0000000000000246 R12: 000056107ec4bb20
R13: 0000000000020000 R14: 000056107ec41b20 R15: 0000000000000028
</TASK>
INFO: NMI handler (nmi_cpu_backtrace_handler) took too long to run: 1.788 msecs
watchdog: BUG: soft lockup - CPU#13 stuck for 45s! [(udev-worker):986]
Modules linked in: input_leds led_class joydev radeon(+) amd64_edac edac_mce_amd video backlight drm_suballoc_helper i2c_algo_bit drm_ttm_helper ttm snd_hda_codec_hdmi drm_display_helper kvm_amd hid_generic snd_hda_intel snd_intel_dspcfg snd_hda_codec snd_hwdep snd_hda_core wmi_bmof snd_pcm snd_timer evdev snd soundcore rapl wmi k10temp hwmon gpio_amdpt gpio_generic button usbhid hid lz4 lz4_compress lz4_decompress zram loop configfs dmi_sysfs sha512_ssse3 sha512_generic sha256_ssse3 sha1_ssse3 sha1_generic aesni_intel libaes xhci_pci crypto_simd cryptd xhci_hcd ccp usbcore usb_common sunrpc dm_mod pkcs8_key_parser efivarfs
CPU: 13 PID: 986 Comm: (udev-worker) Not tainted 6.8.0-rc3-Zen3 #1
Hardware name: To Be Filled By O.E.M. B550M Pro4/B550M Pro4, BIOS P3.20 09/27/2023
RIP: 0010:r100_mm_rreg.constprop.0+0x1e/0x27 [radeon]
Code: 5d 41 5c 31 f6 31 ff e9 23 13 23 e5 89 f0 48 3b 87 d8 01 00 00 72 0a 81 fe ff ff 00 00 76 02 eb a2 48 03 87 00 03 00 00 8b 00 <31> f6 31 ff e9 fc 12 23 e5 f3 0f 1e fa 55 53 48 89 fb 48 8b af 90
RSP: 0018:ffffb2b5c0a6b9a8 EFLAGS: 00000286
RAX: 00000000ffffffff RBX: ffff8f2ff2668000 RCX: 0000000000000000
RDX: 0000000000000000 RSI: 0000000000000044 RDI: ffff8f2ff2668000
RBP: 0000000002000611 R08: 0000000000000000 R09: 0000000000000000
R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000001
R13: 0000000000000000 R14: ffff8f2ff266a0f8 R15: 0000000000000006
FS: 00007fd756341c80(0000) GS:ffff8f365ad40000(0000) knlGS:0000000000000000
CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: 00007fac54603320 CR3: 00000001b075e000 CR4: 0000000000b50ef0
Call Trace:
<IRQ>
? watchdog_timer_fn+0x1b5/0x23a
? __hrtimer_run_queues+0x90/0xfd
? ktime_get_update_offsets_now+0x41/0xaf
? hrtimer_interrupt+0x91/0x183
? __sysvec_apic_timer_interrupt+0x5f/0x74
? sysvec_apic_timer_interrupt+0x5b/0x75
</IRQ>
<TASK>
? asm_sysvec_apic_timer_interrupt+0x1a/0x20
? r100_mm_rreg.constprop.0+0x1e/0x27 [radeon]
r100_irq_ack+0x10/0x3c [radeon]
r100_irq_process+0x165/0x202 [radeon]
radeon_irq_kms_init+0x309/0x39d [radeon]
r420_startup.constprop.0+0xea/0x184 [radeon]
r420_init+0x194/0x20d [radeon]
radeon_device_init+0x946/0xb9a [radeon]
radeon_driver_load_kms+0xbf/0x18b [radeon]
drm_dev_register+0x11e/0x231
? __device_attach_driver+0xeb/0xeb
radeon_pci_probe+0xc8/0xff [radeon]
pci_device_probe+0x8c/0x105
really_probe+0x12f/0x289
__driver_probe_device+0xd4/0x10e
driver_probe_device+0x1a/0x93
__driver_attach+0xd8/0xf7
bus_for_each_dev+0x8d/0xd7
bus_add_driver+0xe4/0x208
driver_register+0x9e/0xe2
? 0xffffffffc0a1d000
do_one_initcall+0xa3/0x210
do_init_module+0x7d/0x232
init_module_from_file+0x8b/0xc2
__do_sys_finit_module+0x177/0x24d
do_syscall_64+0x84/0xee
entry_SYSCALL_64_after_hwframe+0x4b/0x53
RIP: 0033:0x7fd755f2f7a9
Code: ff c3 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 44 00 00 48 89 f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c 8b 4c 24 08 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d 57 66 0c 00 f7 d8 64 89 01 48
RSP: 002b:00007ffd1487c998 EFLAGS: 00000246 ORIG_RAX: 0000000000000139
RAX: ffffffffffffffda RBX: 000056107ec41a30 RCX: 00007fd755f2f7a9
RDX: 0000000000000000 RSI: 000056107ec4bb20 RDI: 0000000000000014
RBP: 0000000000000000 R08: 0000000000000001 R09: 000056107ebf16f0
R10: 0000000000000050 R11: 0000000000000246 R12: 000056107ec4bb20
R13: 0000000000020000 R14: 000056107ec41b20 R15: 0000000000000028
</TASK>
Kernel panic - not syncing: softlockup: hung tasks
CPU: 13 PID: 986 Comm: (udev-worker) Tainted: G L 6.8.0-rc3-Zen3 #1
Hardware name: To Be Filled By O.E.M. B550M Pro4/B550M Pro4, BIOS P3.20 09/27/2023
Call Trace:
<IRQ>
dump_stack_lvl+0x44/0x6c
panic+0x168/0x2eb
? softlockup_fn+0x37/0x37
watchdog_timer_fn+0x21f/0x23a
__hrtimer_run_queues+0x90/0xfd
? ktime_get_update_offsets_now+0x41/0xaf
hrtimer_interrupt+0x91/0x183
__sysvec_apic_timer_interrupt+0x5f/0x74
sysvec_apic_timer_interrupt+0x5b/0x75
</IRQ>
<TASK>
asm_sysvec_apic_timer_interrupt+0x1a/0x20
RIP: 0010:r100_mm_rreg.constprop.0+0x1e/0x27 [radeon]
Code: 5d 41 5c 31 f6 31 ff e9 23 13 23 e5 89 f0 48 3b 87 d8 01 00 00 72 0a 81 fe ff ff 00 00 76 02 eb a2 48 03 87 00 03 00 00 8b 00 <31> f6 31 ff e9 fc 12 23 e5 f3 0f 1e fa 55 53 48 89 fb 48 8b af 90
RSP: 0018:ffffb2b5c0a6b9a8 EFLAGS: 00000286
RAX: 00000000ffffffff RBX: ffff8f2ff2668000 RCX: 0000000000000000
RDX: 0000000000000000 RSI: 0000000000000044 RDI: ffff8f2ff2668000
RBP: 0000000002000611 R08: 0000000000000000 R09: 0000000000000000
R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000001
R13: 0000000000000000 R14: ffff8f2ff266a0f8 R15: 0000000000000006
r100_irq_ack+0x10/0x3c [radeon]
r100_irq_process+0x165/0x202 [radeon]
radeon_irq_kms_init+0x309/0x39d [radeon]
r420_startup.constprop.0+0xea/0x184 [radeon]
r420_init+0x194/0x20d [radeon]
radeon_device_init+0x946/0xb9a [radeon]
radeon_driver_load_kms+0xbf/0x18b [radeon]
drm_dev_register+0x11e/0x231
? __device_attach_driver+0xeb/0xeb
radeon_pci_probe+0xc8/0xff [radeon]
pci_device_probe+0x8c/0x105
really_probe+0x12f/0x289
__driver_probe_device+0xd4/0x10e
driver_probe_device+0x1a/0x93
__driver_attach+0xd8/0xf7
bus_for_each_dev+0x8d/0xd7
bus_add_driver+0xe4/0x208
driver_register+0x9e/0xe2
? 0xffffffffc0a1d000
do_one_initcall+0xa3/0x210
do_init_module+0x7d/0x232
init_module_from_file+0x8b/0xc2
__do_sys_finit_module+0x177/0x24d
do_syscall_64+0x84/0xee
entry_SYSCALL_64_after_hwframe+0x4b/0x53
RIP: 0033:0x7fd755f2f7a9
Code: ff c3 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 44 00 00 48 89 f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c 8b 4c 24 08 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d 57 66 0c 00 f7 d8 64 89 01 48
RSP: 002b:00007ffd1487c998 EFLAGS: 00000246 ORIG_RAX: 0000000000000139
RAX: ffffffffffffffda RBX: 000056107ec41a30 RCX: 00007fd755f2f7a9
RDX: 0000000000000000 RSI: 000056107ec4bb20 RDI: 0000000000000014
RBP: 0000000000000000 R08: 0000000000000001 R09: 000056107ebf16f0
R10: 0000000000000050 R11: 0000000000000246 R12: 000056107ec4bb20
R13: 0000000000020000 R14: 000056107ec41b20 R15: 0000000000000028
</TASK>
Kernel Offset: 0x24000000 from 0xffffffff81000000 (relocation range: 0xffffffff80000000-0xffffffffbfffffff)
Rebooting in 40 seconds..
I get no crash when I blacklist the radeon module and run the machine via ssh or VNC or from the 2nd card (RX 6700).
Some data about the machine:
# inxi -bz
System:
Kernel: 6.7.4-gentoo-Zen3 arch: x86_64 bits: 64 Desktop: Xfce Distro: Gentoo
Base System release 2.14
Machine:
Type: Desktop Mobo: ASRock model: B550M Pro4 serial: <filter> UEFI: American
Megatrends LLC. v: P3.20 date: 09/27/2023
CPU:
Info: 16-core AMD Ryzen 9 5950X [MT MCP] speed (MHz): avg: 550
min/max: 550/5084
Graphics:
Device-1: AMD RV410 [Radeon X700 PRO] driver: N/A
Device-2: AMD Navi 22 [Radeon RX 6700/6700 XT/6750 XT / 6800M/6850M XT]
driver: amdgpu v: kernel
Display: x11 server: X.org v: 1.21.1.11 driver: X: loaded: radeon
dri: r300 gpu: amdgpu resolution: <missing: xdpyinfo/xrandr>
resolution: 3840x2160
API: EGL v: N/A drivers: N/A platforms: N/A
Network:
Device-1: Realtek RTL8111/8168/8411 PCI Express Gigabit Ethernet
driver: r8169
Drives:
Local Storage: total: 931.51 GiB used: 32.06 GiB (3.4%)
Info:
Processes: 560 Uptime: 3m Memory: total: 32 GiB available: 31.28 GiB
used: 1.34 GiB (4.3%) Shell: Bash inxi: 3.3.30
# lspci
00:00.0 Host bridge: Advanced Micro Devices, Inc. [AMD] Starship/Matisse Root Complex
00:00.2 IOMMU: Advanced Micro Devices, Inc. [AMD] Starship/Matisse IOMMU
00:01.0 Host bridge: Advanced Micro Devices, Inc. [AMD] Starship/Matisse PCIe Dummy Host Bridge
00:01.1 PCI bridge: Advanced Micro Devices, Inc. [AMD] Starship/Matisse GPP Bridge
00:01.2 PCI bridge: Advanced Micro Devices, Inc. [AMD] Starship/Matisse GPP Bridge
00:02.0 Host bridge: Advanced Micro Devices, Inc. [AMD] Starship/Matisse PCIe Dummy Host Bridge
00:03.0 Host bridge: Advanced Micro Devices, Inc. [AMD] Starship/Matisse PCIe Dummy Host Bridge
00:03.1 PCI bridge: Advanced Micro Devices, Inc. [AMD] Starship/Matisse GPP Bridge
00:04.0 Host bridge: Advanced Micro Devices, Inc. [AMD] Starship/Matisse PCIe Dummy Host Bridge
00:05.0 Host bridge: Advanced Micro Devices, Inc. [AMD] Starship/Matisse PCIe Dummy Host Bridge
00:07.0 Host bridge: Advanced Micro Devices, Inc. [AMD] Starship/Matisse PCIe Dummy Host Bridge
00:07.1 PCI bridge: Advanced Micro Devices, Inc. [AMD] Starship/Matisse Internal PCIe GPP Bridge 0 to bus[E:B]
00:08.0 Host bridge: Advanced Micro Devices, Inc. [AMD] Starship/Matisse PCIe Dummy Host Bridge
00:08.1 PCI bridge: Advanced Micro Devices, Inc. [AMD] Starship/Matisse Internal PCIe GPP Bridge 0 to bus[E:B]
00:14.0 SMBus: Advanced Micro Devices, Inc. [AMD] FCH SMBus Controller (rev 61)
00:14.3 ISA bridge: Advanced Micro Devices, Inc. [AMD] FCH LPC Bridge (rev 51)
00:18.0 Host bridge: Advanced Micro Devices, Inc. [AMD] Matisse/Vermeer Data Fabric: Device 18h; Function 0
00:18.1 Host bridge: Advanced Micro Devices, Inc. [AMD] Matisse/Vermeer Data Fabric: Device 18h; Function 1
00:18.2 Host bridge: Advanced Micro Devices, Inc. [AMD] Matisse/Vermeer Data Fabric: Device 18h; Function 2
00:18.3 Host bridge: Advanced Micro Devices, Inc. [AMD] Matisse/Vermeer Data Fabric: Device 18h; Function 3
00:18.4 Host bridge: Advanced Micro Devices, Inc. [AMD] Matisse/Vermeer Data Fabric: Device 18h; Function 4
00:18.5 Host bridge: Advanced Micro Devices, Inc. [AMD] Matisse/Vermeer Data Fabric: Device 18h; Function 5
00:18.6 Host bridge: Advanced Micro Devices, Inc. [AMD] Matisse/Vermeer Data Fabric: Device 18h; Function 6
00:18.7 Host bridge: Advanced Micro Devices, Inc. [AMD] Matisse/Vermeer Data Fabric: Device 18h; Function 7
01:00.0 Non-Volatile memory controller: Kingston Technology Company, Inc. A2000 NVMe SSD (rev 03)
02:00.0 USB controller: Advanced Micro Devices, Inc. [AMD] 500 Series Chipset USB 3.1 XHCI Controller
02:00.1 SATA controller: Advanced Micro Devices, Inc. [AMD] 500 Series Chipset SATA Controller
02:00.2 PCI bridge: Advanced Micro Devices, Inc. [AMD] 500 Series Chipset Switch Upstream Port
03:00.0 PCI bridge: Advanced Micro Devices, Inc. [AMD] Device 43ea
03:08.0 PCI bridge: Advanced Micro Devices, Inc. [AMD] Device 43ea
04:00.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI] RV410 [Radeon X700 PRO]
04:00.1 Display controller: Advanced Micro Devices, Inc. [AMD/ATI] RV410 [Radeon X700 PRO] (Secondary)
05:00.0 Ethernet controller: Realtek Semiconductor Co., Ltd. RTL8111/8168/8411 PCI Express Gigabit Ethernet Controller (rev 15)
06:00.0 PCI bridge: Advanced Micro Devices, Inc. [AMD/ATI] Navi 10 XL Upstream Port of PCI Express Switch (rev c5)
07:00.0 PCI bridge: Advanced Micro Devices, Inc. [AMD/ATI] Navi 10 XL Downstream Port of PCI Express Switch
08:00.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI] Navi 22 [Radeon RX 6700/6700 XT/6750 XT / 6800M/6850M XT] (rev c5)
08:00.1 Audio device: Advanced Micro Devices, Inc. [AMD/ATI] Navi 21/23 HDMI/DP Audio Controller
09:00.0 Non-Essential Instrumentation [1300]: Advanced Micro Devices, Inc. [AMD] Starship/Matisse PCIe Dummy Function
0a:00.0 Non-Essential Instrumentation [1300]: Advanced Micro Devices, Inc. [AMD] Starship/Matisse Reserved SPP
0a:00.1 Encryption controller: Advanced Micro Devices, Inc. [AMD] Starship/Matisse Cryptographic Coprocessor PSPCPP
0a:00.3 USB controller: Advanced Micro Devices, Inc. [AMD] Matisse USB 3.0 Host Controller
Some data about the card:
# lspci -vv -s 04:00.0
04:00.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI] RV410 [Radeon X700 PRO] (prog-if 00 [VGA controller])
Subsystem: Tul Corporation / PowerColor RV410 [Radeon X700 PRO]
Control: I/O- Mem- BusMaster- SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx-
Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
Interrupt: pin A routed to IRQ 10
IOMMU group: 0
Region 0: Memory at e8000000 (64-bit, prefetchable) [disabled] [size=128M]
Region 2: Memory at fcd30000 (64-bit, non-prefetchable) [disabled] [size=64K]
Region 4: I/O ports at e000 [disabled] [size=256]
Expansion ROM at fcd00000 [disabled] [size=128K]
Capabilities: [50] Power Management version 2
Flags: PMEClk- DSI- D1+ D2+ AuxCurrent=0mA PME(D0-,D1-,D2-,D3hot-,D3cold-)
Status: D0 NoSoftRst- PME-Enable- DSel=0 DScale=0 PME-
Capabilities: [58] Express (v1) Endpoint, MSI 00
DevCap: MaxPayload 128 bytes, PhantFunc 0, Latency L0s <256ns, L1 <4us
ExtTag+ AttnBtn- AttnInd- PwrInd- RBE- FLReset- SlotPowerLimit 26W
DevCtl: CorrErr+ NonFatalErr+ FatalErr+ UnsupReq+
RlxdOrd+ ExtTag+ PhantFunc- AuxPwr- NoSnoop+
MaxPayload 128 bytes, MaxReadReq 128 bytes
DevSta: CorrErr- NonFatalErr- FatalErr- UnsupReq- AuxPwr- TransPend-
LnkCap: Port #0, Speed 2.5GT/s, Width x16, ASPM L0s L1, Exit Latency L0s <256ns, L1 <2us
ClockPM- Surprise- LLActRep- BwNot- ASPMOptComp-
LnkCtl: ASPM Disabled; RCB 64 bytes, Disabled- CommClk+
ExtSynch- ClockPM- AutWidDis- BWInt- AutBWInt-
LnkSta: Speed 2.5GT/s, Width x4 (downgraded)
TrErr- Train- SlotClk+ DLActive- BWMgmt- ABWMgmt-
Capabilities: [80] MSI: Enable- Count=1/1 Maskable- 64bit+
Address: 0000000000000000 Data: 0000
Capabilities: [100 v1] Advanced Error Reporting
UESta: DLP- SDES- TLP- FCP- CmpltTO- CmpltAbrt- UnxCmplt- RxOF- MalfTLP- ECRC- UnsupReq- ACSViol-
UEMsk: DLP- SDES- TLP- FCP- CmpltTO- CmpltAbrt- UnxCmplt- RxOF- MalfTLP- ECRC- UnsupReq- ACSViol-
UESvrt: DLP+ SDES- TLP- FCP+ CmpltTO- CmpltAbrt- UnxCmplt- RxOF+ MalfTLP+ ECRC- UnsupReq- ACSViol-
CESta: RxErr- BadTLP- BadDLLP- Rollover- Timeout- AdvNonFatalErr-
CEMsk: RxErr- BadTLP- BadDLLP- Rollover- Timeout- AdvNonFatalErr-
AERCap: First Error Pointer: 00, ECRCGenCap- ECRCGenEn- ECRCChkCap- ECRCChkEn-
MultHdrRecCap- MultHdrRecEn- TLPPfxPres- HdrLogCap-
HeaderLog: 04000001 0000200f 04070000 3d7d8e8a
Full kernel dmesg of 6.6.13 (Gentoo stock kernel) and 6.8-rc3 attached, also 6.8-rc3 kernel .config attached. dmesg_6613_zen3.txtdmesg_68-rc3_zen3.txtconfig_68-rc3_zen3