[regression 6.4.13->6.5.3] Multiple amdgpu crashes (events_long drm_dp_check_and_send_link_address [drm_display_helper])
I'm using Gentoo Linux on a Raptor CS Talos 2 ppc64le (4K page size), GPU is an AMD RX 570. I'm using DisplayPort MST: the GPU is attached to a Dell UltraSharp U2515H which is chained towards an identical display. Everything seem to be working fine except for the multiple crashes.
[ 9.123152] WARNING: CPU: 0 PID: 7 at drivers/gpu/drm/drm_mode_object.c:45 __drm_mode_object_add+0x64/0x130
[ 9.123167] Modules linked in: overlay binfmt_misc amdgpu rt2800usb rt2x00usb rt2800lib rt2x00lib mac80211 libarc4 snd_hda_codec_hdmi cfg80211 amdxcp uvcvideo at24(+) snd_hda_intel snd_usb_audio joydev uvc snd_intel_dspcfg gpu_sched videobuf2_vmalloc videobuf2_memops drm_suballoc_helper i2c_algo_bit snd_usbmidi_lib videobuf2_v4l2 rfkill snd_hda_codec videobuf2_common snd_ump drm_ttm_helper ttm snd_rawmidi snd_hda_core videodev snd_seq_device crct10dif_vpmsum snd_hwdep drm_display_helper mc ipmi_powernv ofpart snd_pcm cec ipmi_devintf powernv_flash ipmi_msghandler snd_timer rtc_opal mtd opal_prd drm_kms_helper onboard_usb_hub i2c_opal snd drm_buddy mfd_core soundcore lz4 lz4_compress zram loop fuse vmx_crypto crc32c_vpmsum nvme tg3 nvme_core nvme_common
[ 9.123275] CPU: 0 PID: 7 Comm: kworker/0:0 Not tainted 6.5.3-gentoo-dist #1
[ 9.123282] Hardware name: T2P9S01 REV 1.01 POWER9 0x4e1202 opal:skiboot-9858186 PowerNV
[ 9.123287] Workqueue: events_long drm_dp_check_and_send_link_address [drm_display_helper]
[ 9.123302] NIP: c000000000cbe424 LR: c000000000ccaca0 CTR: 0000000000000000
[ 9.123307] REGS: c000000003503450 TRAP: 0700 Not tainted (6.5.3-gentoo-dist)
[ 9.123313] MSR: 9000000000029033 <SF,HV,EE,ME,IR,DR,RI,LE> CR: 44002420 XER: 00000034
[ 9.123332] CFAR: c000000000cbe4e4 IRQMASK: 0
GPR00: c000000000ccaca0 c0000000035036f0 c000000001965200 c000000027480010
GPR04: c000000036817110 00000000b0b0b0b0 0000000000000001 0000000000000000
GPR08: 0000000000000c56 0000000000000001 0000000000000001 0000000000002000
GPR12: c000000000c9c9d0 c000000002ab4000 0000000000000003 c00000003d04e000
GPR16: 0000000000000001 0000000000000000 0000000000000000 0000000000000001
GPR20: 0000000000000001 c000000027480010 0000000000000000 c00000003b6b0000
GPR24: c000000003503a98 c00800000dd21684 00000000b0b0b0b0 c0000000017ee400
GPR28: 0000000000000001 0000000000000000 c000000036817110 c000000027480010
[ 9.123408] NIP [c000000000cbe424] __drm_mode_object_add+0x64/0x130
[ 9.123417] LR [c000000000ccaca0] drm_property_create+0xf0/0x250
[ 9.123425] Call Trace:
[ 9.123429] [c0000000035036f0] [c00000000046c0a8] __kmalloc+0x68/0x230 (unreliable)
[ 9.123442] [c000000003503740] [c000000000ccaca0] drm_property_create+0xf0/0x250
[ 9.123452] [c000000003503790] [c000000000ccb188] drm_property_create_enum+0x48/0xd0
[ 9.123461] [c0000000035037d0] [c000000000c9c950] drm_mode_create_colorspace_property+0x120/0x170
[ 9.123471] [c000000003503960] [c00800000fb59c6c] amdgpu_dm_connector_init_helper+0x374/0x3c0 [amdgpu]
[ 9.123591] [c0000000035039c0] [c00800000fb64180] set_dsc_configs_from_fairness_vars+0x3f8/0x550 [amdgpu]
[ 9.123712] [c000000003503a30] [c00800000d8685cc] drm_dp_mst_port_add_connector+0x2a4/0x3e0 [drm_display_helper]
[ 9.123726] [c000000003503bf0] [c00800000d870e64] drm_dp_send_link_address+0x76c/0xcb0 [drm_display_helper]
[ 9.123738] [c000000003503d70] [c00800000d871494] drm_dp_check_and_send_link_address+0xec/0x3a0 [drm_display_helper]
[ 9.123751] [c000000003503dc0] [c00800000d871668] drm_dp_check_and_send_link_address+0x2c0/0x3a0 [drm_display_helper]
[ 9.123764] [c000000003503e60] [c000000000174f64] process_one_work+0x2d4/0x570
[ 9.123775] [c000000003503f00] [c0000000001752a8] worker_thread+0xa8/0x5f0
[ 9.123785] [c000000003503f90] [c000000000183118] kthread+0x138/0x140
[ 9.123794] [c000000003503fe0] [c00000000000ded8] start_kernel_thread+0x14/0x18
[ 9.123802] Code: 91610008 7c7f1b78 7c9e2378 7cba2b78 7cdc3378 7cfd3b78 f8010010 f821ffb1 e9430030 e94a0000 2c2a0000 418200b0 <0b090000> 3b7f01d0 7f63db78 48688659
[ 9.123829] ---[ end trace 0000000000000000 ]---
[ 9.174420] bridge: filtering via arp/ip/ip6tables is no longer available by default. Update your scripts to load br_netfilter if you need this.
Might be a duplicate of #2824 (closed)