igt@i915_selftest@live@workarounds - incomplete - scrub_guc_desc_for_outstanding_g2h:\d+ GEM_BUG_ON(!do_put && !destroyed)
<0> [175.504312] kworker/-5380 7..... 161680920us : __intel_context_retire: 0000:00:02.0 vecs0: context:122b retire runtime: { total:0ns, avg:0ns }
<0> [175.504375] i915_sel-5819 3d.... 161681496us : intel_guc_submission_reset_prepare.cold.154: scrub_guc_desc_for_outstanding_g2h:1043 GEM_BUG_ON(!do_put && !destroyed)
<0> [175.504380] ---------------------------------
<4> [175.504393] ------------[ cut here ]------------
<2> [175.504394] kernel BUG at drivers/gpu/drm/i915/gt/uc/intel_guc_submission.c:1043!
<4> [175.504398] invalid opcode: 0000 [#1] PREEMPT SMP NOPTI
<4> [175.504400] CPU: 3 PID: 5819 Comm: i915_selftest Tainted: G U W 5.15.0-CI-CI_DRM_10820+ #1
<4> [175.504401] Hardware name: ASUS System Product Name/TUF GAMING Z590-PLUS WIFI, BIOS 0811 04/06/2021
<4> [175.504402] RIP: 0010:intel_guc_submission_reset_prepare.cold.154+0x5b/0x5d [i915]
<4> [175.504510] Code: 00 48 c7 c2 00 b8 57 a0 48 c7 c7 6e 6d 55 a0 e8 1d 65 c5 e0 bf 01 00 00 00 e8 13 34 c5 e0 31 f6 bf 09 00 00 00 e8 27 8d b5 e0 <0f> 0b 48 c7 c1 f4 e7 5e a0 ba 62 05 00 00 48 c7 c6 60 b7 57 a0 48
<4> [175.504512] RSP: 0018:ffffc90000ff3958 EFLAGS: 00010046
<4> [175.504513] RAX: 0000000000000240 RBX: ffff888134a9b8b8 RCX: 000000000000000c
<4> [175.504514] RDX: 0000000000000000 RSI: 0000000000000000 RDI: 0000000000000009
<4> [175.504515] RBP: ffff888130a03f68 R08: 0000000000000000 R09: c0000001000384cf
<4> [175.504516] R10: 0000000001ce7b60 R11: ffffc90000ff3740 R12: ffff888134a9c1d8
<4> [175.504517] R13: 0000000000000040 R14: 0000000000000000 R15: ffff888130a03c40
<4> [175.504518] FS: 00007f792bf78c00(0000) GS:ffff88844ef80000(0000) knlGS:0000000000000000
<4> [175.504519] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4> [175.504520] CR2: 0000559547e0e140 CR3: 00000001276de001 CR4: 0000000000770ee0
<4> [175.504521] PKRU: 55555554
<4> [175.504522] Call Trace:
<4> [175.504524] ? _raw_spin_unlock_irqrestore+0x50/0x60
<4> [175.504527] intel_uc_reset_prepare+0x44/0x50 [i915]
<4> [175.504618] reset_prepare+0x5c/0x80 [i915]
<4> [175.504695] intel_gt_reset+0x146/0x350 [i915]
<4> [175.504765] ? verify_wa_lists+0x80/0xa1 [i915]
<4> [175.504867] live_gpu_reset_workarounds.cold.75+0x8c/0xdf [i915]
<4> [175.504962] __i915_subtests.cold.7+0x3f/0x92 [i915]
<4> [175.505067] ? __i915_live_teardown+0x50/0x50 [i915]
<4> [175.505158] ? __intel_gt_live_setup+0x30/0x30 [i915]
<4> [175.505245] __run_selftests.part.3+0x10a/0x172 [i915]
<4> [175.505350] i915_live_selftests.cold.5+0x1f/0x47 [i915]
<4> [175.505454] i915_pci_probe+0x93/0x1d0 [i915]
<4> [175.505521] ? _raw_spin_unlock_irqrestore+0x3d/0x60
<4> [175.505523] pci_device_probe+0x9b/0x110
<4> [175.505525] really_probe+0x1b0/0x3b0
<4> [175.505528] __driver_probe_device+0xf6/0x170
<4> [175.505530] driver_probe_device+0x1a/0x90
<4> [175.505531] __driver_attach+0x93/0x160
<4> [175.505532] ? __device_attach_driver+0xd0/0xd0
<4> [175.505533] ? __device_attach_driver+0xd0/0xd0
<4> [175.505534] bus_for_each_dev+0x72/0xc0
<4> [175.505536] bus_add_driver+0x14b/0x1f0
<4> [175.505538] driver_register+0x66/0xb0
<4> [175.505540] i915_init+0x1f/0x91 [i915]
<4> [175.505603] ? 0xffffffffa0755000
<4> [175.505604] do_one_initcall+0x53/0x2e0
<4> [175.505606] ? kmem_cache_alloc_trace+0x489/0x5a0
<4> [175.505608] do_init_module+0x55/0x200
<4> [175.505611] load_module+0x2700/0x2980
<4> [175.505614] ? __do_sys_finit_module+0xaa/0x110
<4> [175.505616] __do_sys_finit_module+0xaa/0x110
<4> [175.505619] do_syscall_64+0x37/0xb0
<4> [175.505620] entry_SYSCALL_64_after_hwframe+0x44/0xae
<4> [175.505622] RIP: 0033:0x7f792ed4989d
<4> [175.505623] Code: 00 c3 66 2e 0f 1f 84 00 00 00 00 00 90 f3 0f 1e fa 48 89 f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c 8b 4c 24 08 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d c3 f5 0c 00 f7 d8 64 89 01 48
<4> [175.505625] RSP: 002b:00007ffceb873628 EFLAGS: 00000246 ORIG_RAX: 0000000000000139
<4> [175.505626] RAX: ffffffffffffffda RBX: 000055aca52dc7b0 RCX: 00007f792ed4989d
<4> [175.505627] RDX: 0000000000000000 RSI: 000055aca52d40d0 RDI: 0000000000000006
<4> [175.505628] RBP: 0000000000000020 R08: 00007ffceb872400 R09: 000055aca52d2270
<4> [175.505629] R10: 00007ffceb873770 R11: 0000000000000246 R12: 000055aca52d40d0
<4> [175.505630] R13: 0000000000000000 R14: 000055aca52cdb00 R15: 000055aca52dc7b0
<4> [175.505632] Modules linked in: i915(+) vgem drm_shmem_helper fuse snd_hda_codec_hdmi snd_hda_codec_realtek snd_hda_codec_generic ledtrig_audio btusb btrtl btbcm btintel bluetooth ecdh_generic ecc mei_hdcp x86_pkg_temp_thermal coretemp crct10dif_pclmul crc32_pclmul ghash_clmulni_intel snd_intel_dspcfg igc snd_hda_codec snd_hwdep ptp snd_hda_core pps_core snd_pcm mei_me i2c_i801 ttm i2c_smbus mei prime_numbers intel_lpss_pci [last unloaded: i915]
<4> [175.505649] ---[ end trace f3ecb7f5e4d00041 ]---
We had a similar bug on DG1 #4217 (closed)