few tests - dmesg-fail/abort/incomplete - GuC RC enable failed: -ENODEV
Stdout
Using IGT_SRANDOM=1712866431 for randomisation
Opened device: /dev/dri/card1
Starting subtest: mixed-binds-3145728
Stack trace:
#0 ../../../usr/src/igt-gpu-tools/lib/igt_core.c:1989 __igt_fail_assert()
#1 ../../../usr/src/igt-gpu-tools/tests/intel/xe_vm.c:1097 test_large_binds.constprop.0()
#2 [_fini+0x92c]
Subtest mixed-binds-3145728: FAIL (10.477s)
Stderr
Starting subtest: mixed-binds-3145728
(xe_vm:1806) CRITICAL: Test assertion failure function test_large_binds, file ../../../usr/src/igt-gpu-tools/tests/intel/xe_vm.c:1076:
(xe_vm:1806) CRITICAL: Failed assertion: data[i].data == 0xc0ffee
(xe_vm:1806) CRITICAL: error: 0 != 12648430
Subtest mixed-binds-3145728 failed.
**** DEBUG ****
(xe_vm:1806) DEBUG: Test requirement passed: !(xe_visible_vram_size(fd, 0) && bo_size > xe_visible_vram_size(fd, 0))
(xe_vm:1806) CRITICAL: Test assertion failure function test_large_binds, file ../../../usr/src/igt-gpu-tools/tests/intel/xe_vm.c:1076:
(xe_vm:1806) CRITICAL: Failed assertion: data[i].data == 0xc0ffee
(xe_vm:1806) CRITICAL: error: 0 != 12648430
(xe_vm:1806) igt_core-INFO: Stack trace:
(xe_vm:1806) igt_core-INFO: #0 ../../../usr/src/igt-gpu-tools/lib/igt_core.c:1989 __igt_fail_assert()
(xe_vm:1806) igt_core-INFO: #1 ../../../usr/src/igt-gpu-tools/tests/intel/xe_vm.c:1097 test_large_binds.constprop.0()
(xe_vm:1806) igt_core-INFO: #2 [_fini+0x92c]
**** END ****
Subtest mixed-binds-3145728: FAIL (10.477s)
Dmesg
<6> [964.722989] Console: switching to colour dummy device 80x25
<6> [964.723171] [IGT] xe_vm: executing
<6> [964.727215] [IGT] xe_vm: starting subtest mixed-binds-3145728
<7> [964.840023] xe 0000:00:02.0: [drm:intel_power_well_disable [xe]] disabling DC_off
<7> [964.840425] xe 0000:00:02.0: [drm:skl_enable_dc6 [xe]] Enabling DC6
<7> [964.840744] xe 0000:00:02.0: [drm:gen9_set_dc_state.part.0 [xe]] Setting DC state from 00 to 02
<4> [969.292008] xe 0000:00:02.0: [drm] Pending enable failed to respond
<6> [969.292047] xe 0000:00:02.0: [drm] GT0: trying reset
<6> [969.292060] xe 0000:00:02.0: [drm] GT0: reset queued
<4> [969.292076] xe 0000:00:02.0: [drm] Schedule disable failed to respond
<6> [969.292084] xe 0000:00:02.0: [drm] GT0: trying reset
<6> [969.292121] xe 0000:00:02.0: [drm] GT0: reset started
<3> [969.292150] xe 0000:00:02.0: [drm] *ERROR* GuC RC enable failed: -ENODEV
<4> [969.292314] ------------[ cut here ]------------
<4> [969.292320] WARNING: CPU: 19 PID: 143 at drivers/gpu/drm/xe/xe_uc.c:220 xe_uc_gucrc_disable+0x1f/0x30 [xe]
<4> [969.292603] Modules linked in: hid_sensor_custom_intel_hinge hid_sensor_gyro_3d hid_sensor_magn_3d hid_sensor_incl_3d hid_sensor_rotation hid_sensor_accel_3d hid_sensor_als hid_sensor_trigger industrialio_triggered_buffer hid_sensor_iio_common kfifo_buf industrialio hid_sensor_custom hid_sensor_hub hid_generic intel_ishtp_hid hid snd_sof_pci_intel_tgl snd_sof_intel_hda_common snd_soc_hdac_hda snd_sof_pci snd_sof_xtensa_dsp snd_sof_intel_hda snd_sof intel_uncore_frequency intel_uncore_frequency_common snd_sof_utils x86_pkg_temp_thermal snd_soc_acpi_intel_match intel_powerclamp snd_soc_acpi snd_intel_dspcfg coretemp snd_hda_codec snd_hwdep snd_soc_core kvm_intel cdc_ncm snd_compress cdc_ether snd_sof_intel_hda_mlink usbnet snd_hda_ext_core kvm nls_iso8859_1 crct10dif_pclmul snd_hda_core xe crc32_pclmul polyval_clmulni polyval_generic ghash_clmulni_intel snd_pcm sha512_ssse3 drm_gpuvm sha256_ssse3 cmdlinepart gpu_sched sha1_ssse3 drm_ttm_helper snd_seq mei_pxp mei_hdcp spi_nor ttm aesni_intel crypto_simd mtd cryptd
<4> [969.292886] snd_seq_device i2c_algo_bit processor_thermal_device_pci snd_timer spi_pxa2xx_platform drm_suballoc_helper r8152 processor_thermal_device dw_dmac rapl processor_thermal_wt_hint drm_exec mii intel_cstate libphy intel_rapl_msr efi_pstore drm_display_helper snd dw_dmac_core processor_thermal_rfim idma64 drm_kunit_helpers mei_me i2c_i801 processor_thermal_rapl spi_intel_pci e1000e soundcore drm_kms_helper spi_intel i2c_smbus mei intel_rapl_common processor_thermal_wt_req kunit processor_thermal_power_floor processor_thermal_mbox intel_ish_ipc intel_ishtp int340x_thermal_zone thunderbolt drm_buddy igen6_edac wmi_bmof mac_hid intel_skl_int3472_tps68470 video tps68470_regulator ov13858 clk_tps68470 v4l2_fwnode v4l2_async intel_pmc_core videodev mc wmi intel_vsec intel_skl_int3472_discrete pmt_telemetry pinctrl_tigerlake int3400_thermal pmt_class intel_hid acpi_thermal_rel sparse_keymap acpi_pad acpi_tad sch_fq_codel msr parport_pc ppdev lp parport drm ip_tables x_tables autofs4
<4> [969.293212] CPU: 19 PID: 143 Comm: kworker/u80:2 Tainted: G U W 6.9.0-rc3-xe #1
<4> [969.293223] Hardware name: Intel Corporation Alder Lake Client Platform/AlderLake-P DDR5 RVP, BIOS RPLPFWI1.R00.4035.A00.2301200723 01/20/2023
<4> [969.293231] Workqueue: gt-ordered-wq gt_reset_worker [xe]
<4> [969.293408] RIP: 0010:xe_uc_gucrc_disable+0x1f/0x30 [xe]
<4> [969.293623] Code: 90 90 90 90 90 90 90 90 90 90 90 0f 1f 44 00 00 55 48 81 c7 10 0b 00 00 48 89 e5 e8 1b b5 fc ff 85 c0 75 06 5d c3 cc cc cc cc <0f> 0b 5d c3 cc cc cc cc 66 0f 1f 84 00 00 00 00 00 90 90 90 90 90
<4> [969.293631] RSP: 0018:ffffc900006a3db8 EFLAGS: 00010286
<4> [969.293643] RAX: 00000000ffffffed RBX: ffff888148ef82a8 RCX: 0000000000000000
<4> [969.293651] RDX: 0000000000000027 RSI: 00000000ffffb316 RDI: ffff88849f7b1a48
<4> [969.293658] RBP: ffffc900006a3db8 R08: 0000000000000000 R09: 00000000ffffb316
<4> [969.293665] R10: ffffc900006a3a98 R11: ffff8884afa8aa10 R12: 0000000000000000
<4> [969.293671] R13: ffff888148ef8028 R14: ffff888148ef8058 R15: ffff888148ef9920
<4> [969.293678] FS: 0000000000000000(0000) GS:ffff88849f780000(0000) knlGS:0000000000000000
<4> [969.293687] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4> [969.293694] CR2: 00007fcde8f33910 CR3: 000000000aa54000 CR4: 0000000000750ef0
<4> [969.293701] PKRU: 55555554
<4> [969.293707] Call Trace:
<4> [969.293714] <TASK>
<4> [969.293724] ? show_regs+0x67/0x70
<4> [969.293739] ? xe_uc_gucrc_disable+0x1f/0x30 [xe]
<4> [969.293938] ? __warn+0x8e/0x1b0
<4> [969.293955] ? xe_uc_gucrc_disable+0x1f/0x30 [xe]
<4> [969.294151] ? report_bug+0x1b7/0x1d0
<4> [969.294175] ? handle_bug+0x46/0x80
<4> [969.294190] ? exc_invalid_op+0x19/0x70
<4> [969.294205] ? asm_exc_invalid_op+0x1b/0x20
<4> [969.294238] ? xe_uc_gucrc_disable+0x1f/0x30 [xe]
<4> [969.294430] ? xe_uc_gucrc_disable+0x15/0x30 [xe]
<4> [969.294618] gt_reset_worker+0xf4/0x250 [xe]
<4> [969.294796] process_scheduled_works+0x389/0x710
<4> [969.294837] worker_thread+0x159/0x300
<4> [969.294854] ? __pfx_worker_thread+0x10/0x10
<4> [969.294865] kthread+0x105/0x140
<4> [969.294875] ? __pfx_kthread+0x10/0x10
<4> [969.294889] ret_from_fork+0x39/0x60
<4> [969.294898] ? __pfx_kthread+0x10/0x10
<4> [969.294909] ret_from_fork_asm+0x1a/0x30
<4> [969.294949] </TASK>
<4> [969.294954] irq event stamp: 213171
<4> [969.294961] hardirqs last enabled at (213177): [<ffffffff811b29ea>] console_unlock+0x13a/0x150
<4> [969.294975] hardirqs last disabled at (213182): [<ffffffff811b29cf>] console_unlock+0x11f/0x150
<4> [969.294984] softirqs last enabled at (211908): [<ffffffff81ffda97>] neigh_managed_work+0xa7/0xc0
<4> [969.294996] softirqs last disabled at (211904): [<ffffffff81ffda19>] neigh_managed_work+0x29/0xc0
<4> [969.295005] ---[ end trace 0000000000000000 ]---
<7> [969.295684] xe 0000:00:02.0: [drm:xe_reg_sr_apply_mmio [xe]] GT0: Applying GT save-restore MMIOs
<7> [969.295992] xe 0000:00:02.0: [drm:xe_reg_sr_apply_mmio [xe]] GT0: REG[0x9424] = 0xfffffffc
<7> [969.296203] xe 0000:00:02.0: [drm:xe_reg_sr_apply_mmio [xe]] GT0: REG[0x9550] = 0x000003ff
<7> [969.296413] xe 0000:00:02.0: [drm:xe_wopcm_init [xe]] WOPCM: 2048K
<7> [969.296643] xe 0000:00:02.0: [drm:xe_wopcm_init [xe]] GuC WOPCM is already locked [592K, 1420K)
<6> [969.503976] xe 0000:00:02.0: [drm] GT0: GuC load failed: status = 0x400000A0
<6> [969.503997] xe 0000:00:02.0: [drm] GT0: GuC status: Reset = 0, BootROM = 0X50, UKernel = 0X0, MIA = 0X0, Auth = 0X1
<6> [969.504009] xe 0000:00:02.0: [drm] GT0: GuC firmware signature verification failed
<3> [969.562159] xe 0000:00:02.0: [drm] *ERROR* GT0: GuC mmio request 0x508: no reply 0x508
<3> [969.562277] xe 0000:00:02.0: [drm] *ERROR* GT0: Failed to enable GuC CT (-ETIMEDOUT)
<3> [969.562408] xe 0000:00:02.0: [drm] *ERROR* GuC PC reset: -ENODEV
<4> [969.562561] ------------[ cut here ]------------
<4> [969.562569] WARNING: CPU: 16 PID: 143 at drivers/gpu/drm/xe/xe_guc.c:912 xe_guc_start+0x2e/0x40 [xe]
<4> [969.562801] Modules linked in: hid_sensor_custom_intel_hinge hid_sensor_gyro_3d hid_sensor_magn_3d hid_sensor_incl_3d hid_sensor_rotation hid_sensor_accel_3d hid_sensor_als hid_sensor_trigger industrialio_triggered_buffer hid_sensor_iio_common kfifo_buf industrialio hid_sensor_custom hid_sensor_hub hid_generic intel_ishtp_hid hid snd_sof_pci_intel_tgl snd_sof_intel_hda_common snd_soc_hdac_hda snd_sof_pci snd_sof_xtensa_dsp snd_sof_intel_hda snd_sof intel_uncore_frequency intel_uncore_frequency_common snd_sof_utils x86_pkg_temp_thermal snd_soc_acpi_intel_match intel_powerclamp snd_soc_acpi snd_intel_dspcfg coretemp snd_hda_codec snd_hwdep snd_soc_core kvm_intel cdc_ncm snd_compress cdc_ether snd_sof_intel_hda_mlink usbnet snd_hda_ext_core kvm nls_iso8859_1 crct10dif_pclmul snd_hda_core xe crc32_pclmul polyval_clmulni polyval_generic ghash_clmulni_intel snd_pcm sha512_ssse3 drm_gpuvm sha256_ssse3 cmdlinepart gpu_sched sha1_ssse3 drm_ttm_helper snd_seq mei_pxp mei_hdcp spi_nor ttm aesni_intel crypto_simd mtd cryptd
<4> [969.563107] snd_seq_device i2c_algo_bit processor_thermal_device_pci snd_timer spi_pxa2xx_platform drm_suballoc_helper r8152 processor_thermal_device dw_dmac rapl processor_thermal_wt_hint drm_exec mii intel_cstate libphy intel_rapl_msr efi_pstore drm_display_helper snd dw_dmac_core processor_thermal_rfim idma64 drm_kunit_helpers mei_me i2c_i801 processor_thermal_rapl spi_intel_pci e1000e soundcore drm_kms_helper spi_intel i2c_smbus mei intel_rapl_common processor_thermal_wt_req kunit processor_thermal_power_floor processor_thermal_mbox intel_ish_ipc intel_ishtp int340x_thermal_zone thunderbolt drm_buddy igen6_edac wmi_bmof mac_hid intel_skl_int3472_tps68470 video tps68470_regulator ov13858 clk_tps68470 v4l2_fwnode v4l2_async intel_pmc_core videodev mc wmi intel_vsec intel_skl_int3472_discrete pmt_telemetry pinctrl_tigerlake int3400_thermal pmt_class intel_hid acpi_thermal_rel sparse_keymap acpi_pad acpi_tad sch_fq_codel msr parport_pc ppdev lp parport drm ip_tables x_tables autofs4
<4> [969.563458] CPU: 16 PID: 143 Comm: kworker/u80:2 Tainted: G U W 6.9.0-rc3-xe #1
<4> [969.563469] Hardware name: Intel Corporation Alder Lake Client Platform/AlderLake-P DDR5 RVP, BIOS RPLPFWI1.R00.4035.A00.2301200723 01/20/2023
<4> [969.563478] Workqueue: gt-ordered-wq gt_reset_worker [xe]
<4> [969.563669] RIP: 0010:xe_guc_start+0x2e/0x40 [xe]
<4> [969.563915] Code: 00 55 48 89 e5 41 54 49 89 fc 48 81 c7 10 0b 00 00 e8 56 c1 00 00 85 c0 75 10 4c 89 e7 e8 2a 20 01 00 41 5c 5d c3 cc cc cc cc <0f> 0b 4c 89 e7 e8 18 20 01 00 41 5c 5d c3 cc cc cc cc 90 90 90 90
<4> [969.563927] RSP: 0018:ffffc900006a3da0 EFLAGS: 00010286
<4> [969.563940] RAX: 00000000ffffffed RBX: ffff888148ef82a8 RCX: 0000000000000006
<4> [969.563950] RDX: 0000000000000000 RSI: 0000000000000000 RDI: ffffffff823810ec