igt@i915_selftest@live@gt_mocs - incomplete - watchdog: Watchdog detected hard LOCKUP on cpu
https://intel-gfx-ci.01.org/tree/drm-tip/IGTPW_9683/bat-dg2-11/igt@i915_selftest@live@gt_mocs.html https://intel-gfx-ci.01.org/tree/drm-tip/IGTPW_9683/bat-dg2-11/pstore0-2861934275_Panic_1.txt
<6> [318.867242] [IGT] i915_selftest: starting dynamic subtest gt_mocs
<5> [318.934754] Setting dangerous option live_selftests - tainting kernel
<7> [318.962314] i915 0000:00:02.0: [drm:intel_gt_common_init_early [i915]] WOPCM: 2048K
<7> [318.962480] i915 0000:00:02.0: [drm:intel_uc_init_early [i915]] GT0: enable_guc=3 (guc:yes submission:yes huc:yes slpc:yes)
<7> [318.962648] i915 0000:00:02.0: [drm:intel_pch_type [i915]] Found Alder Lake PCH
<7> [318.962773] i915 0000:00:02.0: [drm:intel_gt_probe_all [i915]] GT0: Setting up Primary GT
<6>[ 320.336720] i915 0000:00:02.0: [drm] GT0: GUC: submission enabled
<6>[ 320.336723] i915 0000:00:02.0: [drm] GT0: GUC: SLPC enabled
<0>[ 329.992690] watchdog: Watchdog detected hard LOCKUP on cpu 3
<4>[ 329.992693] Modules linked in: i915(+) mei_pxp mei_hdcp mei_gsc vgem drm_shmem_helper snd_hda_codec_hdmi snd_intel_dspcfg snd_hda_codec snd_hwdep snd_hda_core snd_pcm prime_numbers i2c_algo_bit ttm drm_buddy drm_display_helper fuse r8153_ecm cdc_ether usbnet x86_pkg_temp_thermal coretemp kvm_intel kvm e1000e r8152 irqbypass video crct10dif_pclmul wmi_bmof mii crc32_pclmul ghash_clmulni_intel mei_me ptp i2c_i801 pps_core mei i2c_smbus intel_lpss_pci wmi [last unloaded: i915]
<4>[ 329.992711] irq event stamp: 5964783
<4>[ 329.992711] hardirqs last enabled at (5964782): [<ffffffff81cc1273>] _raw_spin_unlock_irq+0x23/0x50
<4>[ 329.992717] hardirqs last disabled at (5964783): [<ffffffff81cc1006>] _raw_spin_lock_irqsave+0x56/0x60
<4>[ 329.992718] softirqs last enabled at (5964760): [<ffffffff81cc243d>] __do_softirq+0x2bd/0x3a9
<4>[ 329.992720] softirqs last disabled at (5964779): [<ffffffff810d803e>] irq_exit_rcu+0x8e/0xd0
<4>[ 329.992724] CPU: 3 PID: 0 Comm: swapper/3 Tainted: G U 6.5.0-CI_DRM_13577-gbb585492db95+ #1
<4>[ 329.992725] Hardware name: Intel Corporation Alder Lake Client Platform/AlderLake-P DDR5 RVP, BIOS ADLPFWI1.R00.2374.A01.2109160719 09/16/2021
<4>[ 329.992726] RIP: 0010:check_chain_key+0xed/0x1d0
<4>[ 329.992729] Code: 8d 1c c6 4c 39 2b 0f 85 ce 00 00 00 0f b7 53 20 8b 3d 0f d5 f8 02 66 81 e2 ff 1f 85 ff 75 16 0f b7 d2 48 0f a3 15 83 7b 22 02 <73> 37 0f b7 53 20 66 81 e2 ff 1f 48 85 c9 74 0f 0f b6 41 21 32 43
<4>[ 329.992730] RSP: 0018:ffffc900004bcbd8 EFLAGS: 00000047
<4>[ 329.992731] RAX: 0000000000000005 RBX: ffff8881010e5cc8 RCX: ffff8881010e5ca0
<4>[ 329.992732] RDX: 000000000000005f RSI: ffff8881010e5ca0 RDI: 0000000000000000
<4>[ 329.992733] RBP: ffff8881010e52c0 R08: 00000000ffffffff R09: ffff8881010e52c0
<4>[ 329.992733] R10: 0000000000000001 R11: ffff8881010e52c0 R12: 0000000000000001
<4>[ 329.992734] R13: 36224cf48d186805 R14: 0000000000005e04 R15: 0000000000000000
<4>[ 329.992734] FS: 0000000000000000(0000) GS:ffff88849ef80000(0000) knlGS:0000000000000000
<4>[ 329.992735] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4>[ 329.992736] CR2: 00005602e70fb1d0 CR3: 0000000006638000 CR4: 0000000000f50ee0
<4>[ 329.992736] PKRU: 55555554
<4>[ 329.992737] Call Trace:
<4>[ 329.992738] <NMI>
<4>[ 329.992738] ? watchdog_hardlockup_check+0xf4/0x1b0
<4>[ 329.992742] ? __perf_event_overflow+0xfa/0x1e0
<4>[ 329.992745] ? handle_pmi_common+0x18b/0x390
<4>[ 329.992749] ? intel_pmu_handle_irq+0x12c/0x5c0
<4>[ 329.992750] ? rcu_is_watching+0x11/0x50
<4>[ 329.992753] ? perf_event_nmi_handler+0x27/0x50
<4>[ 329.992755] ? nmi_handle+0xc8/0x260
<4>[ 329.992756] ? check_chain_key+0xed/0x1d0
<4>[ 329.992758] ? default_do_nmi+0x6b/0x180
<4>[ 329.992760] ? exc_nmi+0x106/0x130
<4>[ 329.992761] ? end_repeat_nmi+0x16/0x67
<4>[ 329.992764] ? check_chain_key+0xed/0x1d0
<4>[ 329.992765] ? check_chain_key+0xed/0x1d0
<4>[ 329.992766] ? check_chain_key+0xed/0x1d0
<4>[ 329.992767] </NMI>
<4>[ 329.992768] <IRQ>
<4>[ 329.992768] __lock_acquire+0xa9a/0x2300
<4>[ 329.992771] lock_acquire+0xd8/0x2d0
<4>[ 329.992773] ? qi_submit_sync+0x304/0x7b0
<4>[ 329.992776] _raw_spin_lock+0x2e/0x40
<4>[ 329.992778] ? qi_submit_sync+0x304/0x7b0
<4>[ 329.992779] qi_submit_sync+0x304/0x7b0
<4>[ 329.992782] ? __pfx_fq_flush_timeout+0x10/0x10
<4>[ 329.992784] qi_flush_iotlb+0x82/0xa0
<4>[ 329.992786] intel_flush_iotlb_all+0x78/0x110
<4>[ 329.992787] ? __pfx_fq_flush_timeout+0x10/0x10
<4>[ 329.992788] fq_flush_iotlb+0x1d/0x30
<4>[ 329.992790] fq_flush_timeout+0x27/0xc0
<4>[ 329.992792] ? __pfx_fq_flush_timeout+0x10/0x10
<4>[ 329.992793] ? __pfx_fq_flush_timeout+0x10/0x10
<4>[ 329.992794] call_timer_fn+0xa1/0x220
<4>[ 329.992797] run_timer_softirq+0x4b5/0x570
<4>[ 329.992800] ? ktime_get+0x58/0x140
<4>[ 329.992802] ? ktime_get+0x8c/0x140
<4>[ 329.992804] __do_softirq+0xc3/0x3a9
<4>[ 329.992805] irq_exit_rcu+0x8e/0xd0
<4>[ 329.992806] sysvec_apic_timer_interrupt+0xa6/0xd0
<4>[ 329.992808] </IRQ>
<4>[ 329.992808] <TASK>
<4>[ 329.992809] asm_sysvec_apic_timer_interrupt+0x1a/0x20
<4>[ 329.992810] RIP: 0010:cpuidle_enter_state+0xf3/0x4e0
<4>[ 329.992811] Code: 7e 89 c0 48 0f a3 05 7c 54 bd 00 0f 82 ed 02 00 00 31 ff e8 7f 37 48 ff 45 84 ff 0f 85 bd 02 00 00 e8 11 63 55 ff fb 45 85 f6 <0f> 88 de 01 00 00 49 63 c6 4c 2b 2c 24 48 8d 14 40 48 8d 14 90 49
<4>[ 329.992812] RSP: 0018:ffffc900001a3e78 EFLAGS: 00000202
<4>[ 329.992813] RAX: 0000000000000003 RBX: 0000000000000004 RCX: 000000000000001f
<4>[ 329.992813] RDX: 0000000000000000 RSI: ffffffff82418fb8 RDI: ffffffff823f4a12
<4>[ 329.992813] RBP: ffffe8ffff5c8f40 R08: 0000000000000001 R09: 0000000000000001
<4>[ 329.992814] R10: 0000000000000001 R11: ffff8881010e52c0 R12: ffffffff827b15c0
<4>[ 329.992814] R13: 0000004a9729a6cf R14: 0000000000000004 R15: 0000000000000000
<4>[ 329.992816] ? cpuidle_enter_state+0xef/0x4e0
<4>[ 329.992817] cpuidle_enter+0x28/0x40
<4>[ 329.992819] do_idle+0x1eb/0x250
<4>[ 329.992821] cpu_startup_entry+0x18/0x20
<4>[ 329.992822] start_secondary+0x115/0x140
<4>[ 329.992824] secondary_startup_64_no_verify+0x167/0x16b
<4>[ 329.992827] </TASK>
<0>[ 329.992827] Kernel panic - not syncing: Hard LOCKUP
<0>[ 331.014292] Shutting down cpus with NMI
<0>[ 331.014298] Kernel Offset: disabled
<4>[ 331.014298] CPU: 3 PID: 0 Comm: swapper/3 Tainted: G U 6.5.0-CI_DRM_13577-gbb585492db95+ #1
<4>[ 331.014299] Hardware name: Intel Corporation Alder Lake Client Platform/AlderLake-P DDR5 RVP, BIOS ADLPFWI1.R00.2374.A01.2109160719 09/16/2021
<4>[ 331.014300] Call Trace:
<4>[ 331.014300] <NMI>
<4>[ 331.014301] dump_stack_lvl+0x64/0xb0
<4>[ 331.014304] panic+0x2e3/0x2f0
<4>[ 331.014307] nmi_panic+0x32/0x40
<4>[ 331.014308] watchdog_hardlockup_check+0x129/0x1b0
<4>[ 331.014310] __perf_event_overflow+0xfa/0x1e0
<4>[ 331.014312] handle_pmi_common+0x18b/0x390
<4>[ 331.014316] intel_pmu_handle_irq+0x12c/0x5c0
<4>[ 331.014317] ? rcu_is_watching+0x11/0x50
<4>[ 331.014318] perf_event_nmi_handler+0x27/0x50
<4>[ 331.014320] nmi_handle+0xc8/0x260
<4>[ 331.014321] ? check_chain_key+0xed/0x1d0
<4>[ 331.014322] default_do_nmi+0x6b/0x180
<4>[ 331.014324] exc_nmi+0x106/0x130
<4>[ 331.014325] end_repeat_nmi+0x16/0x67
<4>[ 331.014326] RIP: 0010:check_chain_key+0xed/0x1d0
<4>[ 331.014328] Code: 8d 1c c6 4c 39 2b 0f 85 ce 00 00 00 0f b7 53 20 8b 3d 0f d5 f8 02 66 81 e2 ff 1f 85 ff 75 16 0f b7 d2 48 0f a3 15 83 7b 22 02 <73> 37 0f b7 53 20 66 81 e2 ff 1f 48 85 c9 74 0f 0f b6 41 21 32 43
<4>[ 331.014328] RSP: 0018:ffffc900004bcbd8 EFLAGS: 00000047
<4>[ 331.014329] RAX: 0000000000000005 RBX: ffff8881010e5cc8 RCX: ffff8881010e5ca0
<4>[ 331.014329] RDX: 000000000000005f RSI: ffff8881010e5ca0 RDI: 0000000000000000
<4>[ 331.014330] RBP: ffff8881010e52c0 R08: 00000000ffffffff R09: ffff8881010e52c0
<4>[ 331.014330] R10: 0000000000000001 R11: ffff8881010e52c0 R12: 0000000000000001
<4>[ 331.014331] R13: 36224cf48d186805 R14: 0000000000005e04 R15: 0000000000000000
<4>[ 331.014332] ? check_chain_key+0xed/0x1d0
<4>[ 331.014334] ? check_chain_key+0xed/0x1d0
<4>[ 331.014335] </NMI>
<4>[ 331.014335] <IRQ>
<4>[ 331.014335] __lock_acquire+0xa9a/0x2300
Edited by SAI NANDAN