Few tests - abort - is trying to acquire lock:, at: i915_hwmon_power_max_disable.*, but task is already holding lock:, at: intel_gt_reset.*
<6> [113.999959] [IGT] gem_ctx_exec: starting subtest basic-norecovery
<7> [114.000335] [drm:eb_lookup_vmas [i915]] EINVAL at eb_validate_vma:511
<6> [114.181845] i915 0000:03:00.0: [drm] GPU HANG: ecode 12:0:00000000
<6> [114.181848] GPU hangs can indicate a bug anywhere in the entire gfx stack, including userspace.
<6> [114.181850] Please file a _new_ bug report at https://gitlab.freedesktop.org/drm/intel/issues/new.
<6> [114.181851] Please see https://drm.pages.freedesktop.org/intel-docs/how-to-file-i915-bugs.html for details.
<6> [114.181852] drm/i915 developers can then reassign to the right component if it's not a kernel issue.
<6> [114.181853] The GPU crash dump is required to analyze GPU hangs, so please always attach it.
<6> [114.181855] GPU crash dump saved to /sys/class/drm/card0/error
<5> [114.183453] i915 0000:03:00.0: [drm] GT0: Resetting chip for Manually reset engine mask to ffffffffffffffff
<5> [114.185016] i915 0000:03:00.0: [drm] gem_ctx_exec[910] context reset due to GPU hang
<7> [114.185050] i915 0000:03:00.0: [drm:__i915_request_reset [i915]] client gem_ctx_exec[910]: gained 1 ban score, now 1
<6> [114.185584] i915 0000:03:00.0: [drm] GT0: GuC firmware i915/dg2_guc_70.bin version 70.20.0
<6> [114.185587] i915 0000:03:00.0: [drm] GT0: HuC firmware i915/dg2_huc_gsc.bin version 7.10.3
<4> [114.185615]
<4> [114.185618] ======================================================
<4> [114.185621] WARNING: possible circular locking dependency detected
<4> [114.185623] 6.8.0-rc7-CI_DRM_14400-gc9b9b8d44492+ #1 Not tainted
<4> [114.185626] ------------------------------------------------------
<4> [114.185629] gem_ctx_exec/910 is trying to acquire lock:
<4> [114.185632] ffff88811b274240 (&hwmon->hwmon_lock){+.+.}-{3:3}, at: i915_hwmon_power_max_disable+0x43/0xb0 [i915]
<4> [114.185799]
but task is already holding lock:
<4> [114.185801] ffff88810d8418e8 (>->reset.mutex){+.+.}-{3:3}, at: intel_gt_reset+0x181/0x480 [i915]
<4> [114.185930]
which lock already depends on the new lock.
<4> [114.185933]
the existing dependency chain (in reverse order) is:
<4> [114.185936]
-> #2 (>->reset.mutex){+.+.}-{3:3}:
<4> [114.185940] lock_acquire+0xd8/0x2d0
<4> [114.185945] i915_gem_shrinker_taints_mutex+0x31/0x50 [i915]
<4> [114.186085] intel_gt_init_reset+0x65/0x80 [i915]
<4> [114.186255] intel_gt_common_init_early+0xd9/0x120 [i915]
<4> [114.186400] intel_root_gt_init_early+0x5e/0x70 [i915]
<4> [114.186512] i915_driver_probe+0x1e6/0xd40 [i915]
<4> [114.186607] i915_pci_probe+0xd5/0x200 [i915]
<4> [114.186703] pci_device_probe+0x95/0x120
<4> [114.186707] really_probe+0x164/0x3c0
<4> [114.186711] __driver_probe_device+0x73/0x160
<4> [114.186714] driver_probe_device+0x19/0xa0
<4> [114.186718] __driver_attach+0xb6/0x180
<4> [114.186721] bus_for_each_dev+0x77/0xd0
<4> [114.186724] bus_add_driver+0x114/0x210
<4> [114.186727] driver_register+0x5b/0x110
<4> [114.186730] 0xffffffffa008f033
<4> [114.186736] do_one_initcall+0x57/0x270
<4> [114.186740] do_init_module+0x5f/0x210
<4> [114.186743] load_module+0x1d1a/0x1f80
<4> [114.186745] init_module_from_file+0x86/0xd0
<4> [114.186748] idempotent_init_module+0x17c/0x230
<4> [114.186751] __x64_sys_finit_module+0x56/0xb0
<4> [114.186754] do_syscall_64+0x6f/0x140
<4> [114.186757] entry_SYSCALL_64_after_hwframe+0x6e/0x76
<4> [114.186761]
-> #1 (fs_reclaim){+.+.}-{0:0}:
<4> [114.186766] lock_acquire+0xd8/0x2d0
<4> [114.186769] fs_reclaim_acquire+0xa1/0xd0
<4> [114.186773] __kmalloc+0x9a/0x350
<4> [114.186776] acpi_ns_internalize_name.part.0+0x4a/0xb0
<4> [114.186780] acpi_ns_get_node_unlocked+0x60/0xf0
<4> [114.186783] acpi_ns_get_node+0x3b/0x60
<4> [114.186785] acpi_get_handle+0x57/0xb0
<4> [114.186788] acpi_has_method+0x20/0x50
<4> [114.186792] acpi_pci_set_power_state+0x43/0x120
<4> [114.186795] pci_power_up+0x24/0x1c0
<4> [114.186798] pci_pm_default_resume_early+0x9/0x30
<4> [114.186802] pci_pm_runtime_resume+0x2d/0x90
<4> [114.186805] __rpm_callback+0x3c/0x110
<4> [114.186809] rpm_callback+0x58/0x70
<4> [114.186812] rpm_resume+0x51e/0x730
<4> [114.186815] rpm_resume+0x267/0x730
<4> [114.186818] rpm_resume+0x267/0x730
<4> [114.186821] rpm_resume+0x267/0x730
<4> [114.186824] __pm_runtime_resume+0x49/0x90
<4> [114.186827] __intel_runtime_pm_get+0x19/0xa0 [i915]
<4> [114.186928] hwm_energy+0x55/0x100 [i915]
<4> [114.187132] hwm_read+0x9a/0x310 [i915]
<4> [114.187375] hwmon_attr_show+0x36/0x120
<4> [114.187380] dev_attr_show+0x15/0x60
<4> [114.187383] sysfs_kf_seq_show+0xb5/0x100
<4> [114.187387] seq_read_iter+0x111/0x450
<4> [114.187391] vfs_read+0x206/0x340
<4> [114.187394] ksys_read+0x64/0xe0
<4> [114.187396] do_syscall_64+0x6f/0x140
<4> [114.187399] entry_SYSCALL_64_after_hwframe+0x6e/0x76
Edited by Tejasree Illipilli