laptop GPU nondeterministic hang(?) stuck in powered state
My laptop has a NVIDIA Corporation TU117GLM [T600 Laptop GPU] [10de:1fb6]
for a second GPU that's usually unused, but periodically powers up, usually when starting some software (such as some games), probably because that tries to enumerate all GPUs. Sometimes (nondeterministic, sometimes in 10 minutes after boot, sometimes in a few days) the system gets into a state where the GPU cannot power down (/sys/bus/pci/devices/0000:01:00.0/power/runtime_status
contains error
), significantly reducing the battery life.
Here's the nouveau-related part of the previous boot that had this problem
Kernel logs extract
[ 10.587830] kernel: VGA switcheroo: detected Optimus DSM method \_SB_.PC00.PEG1.PEGP handle
[ 10.587842] kernel: nouveau: detected PR support, will not use DSM
[ 10.587850] kernel: nouveau 0000:01:00.0: enabling device (0006 -> 0007)
[..]
[ 10.588014] kernel: nouveau 0000:01:00.0: NVIDIA TU117 (167000a1)
[ 10.588071] kernel: Loading firmware: nvidia/tu117/gsp/gsp-535.113.01.bin
[..]
[ 10.588253] kernel: Loading firmware: nvidia/tu117/gsp/booter_load-535.113.01.bin
[ 10.588260] kernel: Loading firmware: nvidia/tu117/gsp/booter_unload-535.113.01.bin
[ 10.588267] kernel: Loading firmware: nvidia/tu117/gsp/bootloader-535.113.01.bin
[..]
[ 10.588354] kernel: nouveau 0000:01:00.0: bios: version 90.17.94.00.65
[ 10.588412] kernel: Loading firmware: nvidia/tu117/acr/bl.bin
[..]
[ 10.590389] kernel: nouveau 0000:01:00.0: drm: VRAM: 4096 MiB
[ 10.590440] kernel: nouveau 0000:01:00.0: drm: GART: 536870912 MiB
[ 10.590489] kernel: nouveau 0000:01:00.0: drm: MM: using COPY for buffer copies
[ 10.590538] kernel: [drm] Initialized nouveau 1.4.0 for 0000:01:00.0 on minor 1
[ 10.590545] kernel: nouveau 0000:01:00.0: [drm] No compatible format found
[ 10.590593] kernel: nouveau 0000:01:00.0: [drm] Cannot find any crtc or sizes
[.. and a number of this line at different times] kernel: Loading firmware: nvidia/tu117/acr/bl.bin
[ 22.365976] kernel: nouveau 0000:01:00.0: gsp: intr 00008000
[.. and a few more of loading bl.bin]
[ 87.823652] kernel: nouveau 0000:01:00.0: gsp: rc engn:00000001 chid:0 type:45 scope:1 part:233
[ 87.823880] kernel: nouveau 0000:01:00.0: fifo:000000:0000:0000:[(udev-worker)[819]] errored - disabling channel
[ 87.823997] kernel: nouveau 0000:01:00.0: DRM: channel 0 killed!
[ 87.824103] kernel: nouveau 0000:01:00.0: gsp: rc engn:00000001 chid:8 type:45 scope:1 part:233
[ 87.824205] kernel: nouveau 0000:01:00.0: fifo:bd207ad:0001:0008:[(udev-worker)[819]] errored - disabling channel
[ 87.824307] kernel: nouveau 0000:01:00.0: DRM: channel 8 killed!
[ 87.824406] kernel: nouveau 0000:01:00.0: gsp: rc engn:00000001 chid:16 type:45 scope:1 part:233
[ 87.824546] kernel: nouveau 0000:01:00.0: fifo:000000:0002:0010:[X[1265]] errored - disabling channel
[ 87.824630] kernel: nouveau 0000:01:00.0: X[1265]: channel 16 killed!
[ 87.824687] kernel: ------------[ cut here ]------------
[ 87.824695] kernel: WARNING: CPU: 0 PID: 4223 at include/linux/dma-fence.h:581 nouveau_fence_context_kill+0x9d/0xb0 [nouveau]
[ 87.824702] kernel: Modules linked in: snd_seq_dummy snd_hrtimer snd_seq snd_seq_device rfcomm michael_mic uhid vfat fat bnep qrtr_mhi snd_soc_skl_hda_dsp snd_soc_intel_sof_board_helpers snd_soc_intel_hda_dsp_common snd_s>
[ 87.824753] kernel: snd_hda_codec videobuf2_v4l2 mxm_wmi btrtl processor_thermal_device_pci qmi_helpers i915 videobuf2_common snd_hda_core btintel processor_thermal_device kvm drm_gpuvm iTCO_wdt snd_hwdep btbcm mac80211 >
[ 87.824774] kernel: nfnetlink dm_crypt crct10dif_pclmul crc32_pclmul rtsx_pci_sdmmc crc32c_intel polyval_clmulni polyval_generic mmc_core ghash_clmulni_intel sha512_ssse3 sha256_ssse3 sha1_ssse3 nvme rtsx_pci ucsi_acpi n>
[ 87.824786] kernel: CPU: 0 UID: 1000 PID: 4223 Comm: steamwebhelper Not tainted 6.12.4-gentoo-dist #1
[ 87.824795] kernel: Hardware name: LENOVO 21D8A00ACD/21D8A00ACD, BIOS N3EET39W (1.25 ) 07/03/2024
[ 87.824802] kernel: RIP: 0010:nouveau_fence_context_kill+0x9d/0xb0 [nouveau]
[ 87.824808] kernel: Code: 48 8b 43 08 4c 39 e0 75 c1 c7 83 e0 00 00 00 01 00 00 00 4c 89 f6 48 89 df 5b 5d 41 5c 41 5d 41 5e e9 87 ab c8 ef 0f 0b eb be <0f> 0b eb b1 66 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 40 00 90 90 90
[ 87.824815] kernel: RSP: 0018:ffffba02cd4bb5e8 EFLAGS: 00010002
[ 87.824825] kernel: RAX: ffff8e1d417b1400 RBX: ffff8e1d41769d00 RCX: 0000000000000027
[ 87.824833] kernel: RDX: 0000000000000001 RSI: 00000000ffffffed RDI: ffff8e1d417b13c0
[ 87.824840] kernel: RBP: 00000000ffffffed R08: 0000000000000000 R09: 0000000000000000
[ 87.824846] kernel: R10: 0000000000000010 R11: 0000000000000000 R12: ffff8e1d41769d08
[ 87.824854] kernel: R13: 0000000000000fec R14: 0000000000000086 R15: 0000000000000082
[ 87.824862] kernel: FS: 00007f8e26c82140(0000) GS:ffff8e2c5ee00000(0000) knlGS:0000000000000000
[ 87.824868] kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 87.824874] kernel: CR2: 00007f8e1aa4ce90 CR3: 00000001365f6000 CR4: 0000000000f52ef0
[ 87.824882] kernel: PKRU: 55555558
[ 87.824889] kernel: Call Trace:
[ 87.824894] kernel: <TASK>
[ 87.824901] kernel: ? nouveau_fence_context_kill+0x9d/0xb0 [nouveau]
[ 87.824907] kernel: ? __warn.cold+0x93/0xfa
[ 87.824916] kernel: ? nouveau_fence_context_kill+0x9d/0xb0 [nouveau]
[ 87.824922] kernel: ? report_bug+0xff/0x140
[ 87.824926] kernel: ? handle_bug+0x58/0x90
[ 87.824932] kernel: ? exc_invalid_op+0x17/0x70
[ 87.824938] kernel: ? asm_exc_invalid_op+0x1a/0x20
[ 87.824944] kernel: ? nouveau_fence_context_kill+0x9d/0xb0 [nouveau]
[ 87.824950] kernel: ? nouveau_fence_context_kill+0x26/0xb0 [nouveau]
[ 87.824959] kernel: nvkm_mm_dump+0x17973/0x17a90 [nouveau]
[ 87.824969] kernel: nouveau_led_fini+0x5f/0x2e0 [nouveau]
[ 87.824978] kernel: nvkm_event_ntfy+0xa2/0xe0 [nouveau]
[ 87.824983] kernel: nvkm_chan_error+0x79/0xa0 [nouveau]
[ 87.824987] kernel: ad102_gsp_new+0x758/0x2590 [nouveau]
[ 87.824995] kernel: ad102_gsp_new+0xd6e/0x2590 [nouveau]
[ 87.825001] kernel: ? nvkm_instobj_new+0xfe/0x190 [nouveau]
[ 87.825010] kernel: ad102_gsp_new+0x193b/0x2590 [nouveau]
[ 87.825017] kernel: ad102_gsp_new+0x1b27/0x2590 [nouveau]
[ 87.825025] kernel: ad102_gsp_new+0x256/0x2590 [nouveau]
[ 87.825031] kernel: ad102_gsp_new+0x12a4/0x2590 [nouveau]
[ 87.825039] kernel: tu102_mmu_new+0xf3/0x3d0 [nouveau]
[ 87.825046] kernel: nvkm_uvmm_new+0x137/0x7c0 [nouveau]
[ 87.825053] kernel: nvkm_intr_rearm+0x713/0x850 [nouveau]
[ 87.825059] kernel: ? __pfx_nvkm_uvmm_new+0x10/0x10 [nouveau]
[ 87.825067] kernel: nvkm_ioctl+0xbc/0x190 [nouveau]
[ 87.825073] kernel: nvif_object_ctor+0x117/0x1a0 [nouveau]
[ 87.825080] kernel: nvif_vmm_ctor+0x13c/0x2a0 [nouveau]
[ 87.825088] kernel: nouveau_vmm_init+0x3a/0x2f0 [nouveau]
[ 87.825094] kernel: nouveau_drm_ioctl+0x390/0x1cb0 [nouveau]
[ 87.825101] kernel: nouveau_drm_ioctl+0x836/0x1cb0 [nouveau]
[ 87.825109] kernel: drm_file_alloc+0x1e3/0x290
[ 87.825117] kernel: drm_open_helper+0x8b/0x150
[ 87.825123] kernel: drm_open+0x73/0x110
[ 87.825130] kernel: drm_stub_open+0x9b/0xd0
[ 87.825139] kernel: chrdev_open+0xb2/0x230
[ 87.825146] kernel: ? __pfx_chrdev_open+0x10/0x10
[ 87.825153] kernel: do_dentry_open+0x14c/0x4a0
[ 87.825160] kernel: vfs_open+0x2e/0xe0
[ 87.825166] kernel: path_openat+0x82e/0x12d0
[ 87.825173] kernel: do_filp_open+0xc4/0x170
[ 87.825181] kernel: do_sys_openat2+0xae/0xe0
[ 87.825190] kernel: __x64_sys_openat+0x55/0xa0
[ 87.825200] kernel: do_syscall_64+0x82/0x190
[ 87.825207] kernel: ? exc_page_fault+0x7e/0x180
[ 87.825214] kernel: entry_SYSCALL_64_after_hwframe+0x76/0x7e
[ 87.825220] kernel: RIP: 0033:0x7f8e28917270
[ 87.825227] kernel: Code: 48 89 44 24 20 75 92 44 89 54 24 0c e8 d9 84 f8 ff 44 8b 54 24 0c 89 da 48 89 ee 41 89 c0 bf 9c ff ff ff b8 01 01 00 00 0f 05 <48> 3d 00 f0 ff ff 77 38 44 89 c7 89 44 24 0c e8 2c 85 f8 ff 8b 44
[ 87.825234] kernel: RSP: 002b:00007ffc6b761a20 EFLAGS: 00000293 ORIG_RAX: 0000000000000101
[ 87.825242] kernel: RAX: ffffffffffffffda RBX: 0000000000080002 RCX: 00007f8e28917270
[ 87.825248] kernel: RDX: 0000000000080002 RSI: 0000564efaf629a8 RDI: 00000000ffffff9c
[ 87.825257] kernel: RBP: 0000564efaf629a8 R08: 0000000000000000 R09: 0000564efae61010
[ 87.825263] kernel: R10: 0000000000000000 R11: 0000000000000293 R12: 0000000000000000
[ 87.825271] kernel: R13: 0000564efaf25000 R14: 0000000000000002 R15: 00007ffc6b7630b8
[ 87.825279] kernel: </TASK>
[ 87.825284] kernel: ---[ end trace 0000000000000000 ]---
[ 87.825291] kernel: nouveau 0000:01:00.0: gsp:msg fn:103 len:0x78/0x58 res:0x62 resp:0x62
[ 87.825343] kernel: msg: 00000000: 03 00 d0 c1 03 00 d0 c1 00 00 1d de 80 00 00 00 ................
[ 87.825353] kernel: msg: 00000010: 62 00 00 00 38 00 00 00 00 00 00 00 00 00 00 00 b...8...........
[ 87.825360] kernel: msg: 00000020: 00 00 00 00 03 00 d0 c1 00 00 00 00 00 00 00 00 ................
[ 87.825370] kernel: msg: 00000030: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 ................
[ 87.825377] kernel: msg: 00000040: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 ................
[ 87.825382] kernel: msg: 00000050: 00 00 00 00 00 00 00 00 ........
[ 87.825388] kernel: nouveau 0000:01:00.0: steamwebhelper[4223]: VMM allocation failed: -22
[ 88.616821] kernel: nouveau 0000:01:00.0: gsp: rc engn:00000001 chid:0 type:45 scope:1 part:233
[ 88.617008] kernel: nouveau 0000:01:00.0: gsp: rc engn:00000001 chid:8 type:45 scope:1 part:233
[ 88.617069] kernel: nouveau 0000:01:00.0: gsp: rc engn:00000001 chid:16 type:45 scope:1 part:233
[ 88.617117] kernel: nouveau 0000:01:00.0: gsp: rc engn:00000001 chid:0 type:45 scope:1 part:233
[ 88.617162] kernel: nouveau 0000:01:00.0: gsp: rc engn:00000001 chid:8 type:45 scope:1 part:233
[ 88.620159] kernel: nouveau 0000:01:00.0: gsp: rc engn:00000001 chid:16 type:45 scope:1 part:233
[ 88.620323] kernel: nouveau 0000:01:00.0: gsp:msg fn:103 len:0x78/0x58 res:0x62 resp:0x62
[ 88.620383] kernel: msg: 00000000: 03 00 d0 c1 03 00 d0 c1 00 00 1d de 80 00 00 00 ................
[ 88.620392] kernel: msg: 00000010: 62 00 00 00 38 00 00 00 00 00 00 00 00 00 00 00 b...8...........
[ 88.620399] kernel: msg: 00000020: 00 00 00 00 03 00 d0 c1 00 00 00 00 00 00 00 00 ................
[ 88.620404] kernel: msg: 00000030: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 ................
[ 88.620410] kernel: msg: 00000040: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 ................
[ 88.620415] kernel: msg: 00000050: 00 00 00 00 00 00 00 00 ........
[ 88.620421] kernel: nouveau 0000:01:00.0: vulkandriverque[4299]: VMM allocation failed: -22
[ 89.146732] kernel: nouveau 0000:01:00.0: gsp: rc engn:00000001 chid:0 type:45 scope:1 part:233
[ 89.146900] kernel: nouveau 0000:01:00.0: gsp: rc engn:00000001 chid:8 type:45 scope:1 part:233
[ 89.146960] kernel: nouveau 0000:01:00.0: gsp: rc engn:00000001 chid:16 type:45 scope:1 part:233
[ 89.147009] kernel: nouveau 0000:01:00.0: gsp: rc engn:00000001 chid:0 type:45 scope:1 part:233
[ 89.150131] kernel: nouveau 0000:01:00.0: gsp: rc engn:00000001 chid:8 type:45 scope:1 part:233
[ 89.150236] kernel: nouveau 0000:01:00.0: gsp: rc engn:00000001 chid:16 type:45 scope:1 part:233
[ 89.150296] kernel: nouveau 0000:01:00.0: gsp:msg fn:103 len:0x78/0x58 res:0x62 resp:0x62
[ 89.150348] kernel: msg: 00000000: 03 00 d0 c1 03 00 d0 c1 00 00 1d de 80 00 00 00 ................
[ 89.150355] kernel: msg: 00000010: 62 00 00 00 38 00 00 00 00 00 00 00 00 00 00 00 b...8...........
[ 89.150361] kernel: msg: 00000020: 00 00 00 00 03 00 d0 c1 00 00 00 00 00 00 00 00 ................
[ 89.150368] kernel: msg: 00000030: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 ................
[ 89.150374] kernel: msg: 00000040: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 ................
[ 89.150379] kernel: msg: 00000050: 00 00 00 00 00 00 00 00 ........
[ 89.153458] kernel: nouveau 0000:01:00.0: vulkandriverque[4325]: VMM allocation failed: -22
[ 95.406738] kernel: nouveau 0000:01:00.0: gsp: rc engn:00000001 chid:0 type:45 scope:1 part:233
[ 95.406926] kernel: nouveau 0000:01:00.0: gsp: rc engn:00000001 chid:8 type:45 scope:1 part:233
[ 95.406993] kernel: nouveau 0000:01:00.0: gsp: rc engn:00000001 chid:16 type:45 scope:1 part:233
[ 95.407049] kernel: nouveau 0000:01:00.0: gsp:msg fn:103 len:0x78/0x58 res:0x62 resp:0x62
[ 95.407100] kernel: msg: 00000000: 03 00 d0 c1 03 00 d0 c1 00 00 1d de 80 00 00 00 ................
[ 95.407110] kernel: msg: 00000010: 62 00 00 00 38 00 00 00 00 00 00 00 00 00 00 00 b...8...........
[ 95.407118] kernel: msg: 00000020: 00 00 00 00 03 00 d0 c1 00 00 00 00 00 00 00 00 ................
[ 95.407125] kernel: msg: 00000030: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 ................
[ 95.407130] kernel: msg: 00000040: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 ................
[ 95.407136] kernel: msg: 00000050: 00 00 00 00 00 00 00 00 ........
[ 95.407141] kernel: nouveau 0000:01:00.0: imem: suspend failed, -22
[ 95.407194] kernel: nouveau: drm:00000000:00000080: suspend failed with -22
[ 95.419999] kernel: nouveau 0000:01:00.0: can't suspend (nouveau_drm_ioctl [nouveau] returned -22)
[.. the following happened when shutting down]
[60804.404026] kernel: ------------[ cut here ]------------
[60804.404225] kernel: WARNING: CPU: 5 PID: 1265 at drivers/gpu/drm/nouveau/nouveau_gem.c:210 nouveau_gem_object_close+0x283/0x2f0 [nouveau]
[60804.404271] kernel: Modules linked in: uinput snd_seq_dummy snd_hrtimer snd_seq snd_seq_device rfcomm michael_mic uhid vfat fat bnep qrtr_mhi snd_soc_skl_hda_dsp snd_soc_intel_sof_board_helpers snd_soc_intel_hda_dsp_commo>
[60804.404373] kernel: kvm_intel snd_hda_codec videobuf2_v4l2 mxm_wmi btrtl processor_thermal_device_pci qmi_helpers i915 videobuf2_common snd_hda_core btintel processor_thermal_device kvm drm_gpuvm iTCO_wdt snd_hwdep btbcm>
[60804.404463] kernel: loop fuse nfnetlink dm_crypt crct10dif_pclmul crc32_pclmul rtsx_pci_sdmmc crc32c_intel polyval_clmulni polyval_generic mmc_core ghash_clmulni_intel sha512_ssse3 sha256_ssse3 sha1_ssse3 nvme rtsx_pci u>
[60804.404508] kernel: CPU: 5 UID: 0 PID: 1265 Comm: X Tainted: G W 6.12.4-gentoo-dist #1
[60804.404541] kernel: Tainted: [W]=WARN
[60804.404575] kernel: Hardware name: LENOVO 21D8A00ACD/21D8A00ACD, BIOS N3EET39W (1.25 ) 07/03/2024
[60804.404604] kernel: RIP: 0010:nouveau_gem_object_close+0x283/0x2f0 [nouveau]
[60804.404636] kernel: Code: 48 8b 45 18 48 89 42 08 48 89 10 4c 89 6d 10 4c 89 6d 18 48 8d 7d 30 48 89 2c 24 e8 77 7c 03 00 48 89 e7 e8 9f 33 00 00 eb a3 <0f> 0b eb ac e8 a4 16 cb ef 31 f6 ba 58 02 00 00 4c 89 ef e8 15 b1
[60804.404675] kernel: RSP: 0018:ffffba02c17d7a28 EFLAGS: 00010283
[60804.404712] kernel: RAX: 00000000ffffffea RBX: ffff8e1d580b3400 RCX: ffff8e1d40050a00
[60804.404753] kernel: RDX: ffffffffb0d28f90 RSI: 0000000000000202 RDI: ffff8e1d433b71ac
[60804.404792] kernel: RBP: ffff8e1d4b9cb240 R08: ffff8e1d40052290 R09: 000000000040003f
[60804.404829] kernel: R10: 000000000040003f R11: 0000000000000000 R12: ffff8e1d433b70c8
[60804.404860] kernel: R13: 0000000000000001 R14: ffff8e1d4b9a2850 R15: ffff8e1d5e839270
[60804.404890] kernel: FS: 00007ff85c4a75c0(0000) GS:ffff8e2c5f080000(0000) knlGS:0000000000000000
[60804.404921] kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[60804.404960] kernel: CR2: 00007ff85b0b79d0 CR3: 00000001214b2000 CR4: 0000000000f52ef0
[60804.404990] kernel: PKRU: 55555554
[60804.405022] kernel: Call Trace:
[60804.405052] kernel: <TASK>
[60804.405088] kernel: ? nouveau_gem_object_close+0x283/0x2f0 [nouveau]
[60804.405119] kernel: ? __warn.cold+0x93/0xfa
[60804.405144] kernel: ? nouveau_gem_object_close+0x283/0x2f0 [nouveau]
[60804.405173] kernel: ? report_bug+0xff/0x140
[60804.405202] kernel: ? handle_bug+0x58/0x90
[60804.405235] kernel: ? exc_invalid_op+0x17/0x70
[60804.405264] kernel: ? asm_exc_invalid_op+0x1a/0x20
[60804.405289] kernel: ? rpm_resume+0x310/0x700
[60804.405322] kernel: ? nouveau_gem_object_close+0x283/0x2f0 [nouveau]
[60804.405346] kernel: ? nouveau_gem_object_close+0x179/0x2f0 [nouveau]
[60804.405375] kernel: drm_gem_object_release_handle+0x2e/0x60
[60804.405403] kernel: ? __pfx_drm_gem_object_release_handle+0x10/0x10
[60804.405436] kernel: idr_for_each+0x6f/0xf0
[60804.405464] kernel: drm_gem_release+0x20/0x30
[60804.405492] kernel: drm_file_free+0x1f7/0x270
[60804.405524] kernel: drm_release+0x5e/0xd0
[60804.405552] kernel: __fput+0xdc/0x2a0
[60804.405580] kernel: __x64_sys_close+0x3c/0x80
[60804.405618] kernel: do_syscall_64+0x82/0x190
[60804.405650] kernel: ? syscall_exit_to_user_mode+0x10/0x200
[60804.405678] kernel: ? do_syscall_64+0x8e/0x190
[60804.405715] kernel: ? __rseq_handle_notify_resume+0xa6/0x4e0
[60804.405753] kernel: ? task_work_run+0x59/0x90
[60804.405782] kernel: ? syscall_exit_to_user_mode+0x165/0x200
[60804.405816] kernel: ? do_syscall_64+0x8e/0x190
[60804.405834] kernel: ? __rseq_handle_notify_resume+0xa6/0x4e0
[60804.405858] kernel: ? syscall_exit_to_user_mode+0x1c3/0x200
[60804.405884] kernel: ? do_syscall_64+0x8e/0x190
[60804.405913] kernel: ? syscall_exit_to_user_mode+0x10/0x200
[60804.405930] kernel: ? do_syscall_64+0x8e/0x190
[60804.405950] kernel: ? exc_page_fault+0x7e/0x180
[60804.405973] kernel: entry_SYSCALL_64_after_hwframe+0x76/0x7e
[60804.405995] kernel: RIP: 0033:0x7ff85ca4ff3a
[60804.406025] kernel: Code: 00 00 0f 05 48 3d 00 f0 ff ff 77 44 c3 0f 1f 00 48 83 ec 18 89 7c 24 0c e8 03 c8 f8 ff 8b 7c 24 0c 89 c2 b8 03 00 00 00 0f 05 <48> 3d 00 f0 ff ff 77 36 89 d7 89 44 24 0c e8 63 c8 f8 ff 8b 44 24
[60804.406052] kernel: RSP: 002b:00007ffc51cb4f10 EFLAGS: 00000293 ORIG_RAX: 0000000000000003
[60804.406081] kernel: RAX: ffffffffffffffda RBX: 000055be43790cb0 RCX: 00007ff85ca4ff3a
[60804.406115] kernel: RDX: 0000000000000000 RSI: 0000000000000000 RDI: 0000000000000020
[60804.406144] kernel: RBP: 000055be43ff7cb0 R08: 000000055be42b1a R09: 000055be4294d010
[60804.406169] kernel: R10: 0000000000000007 R11: 0000000000000293 R12: 000055be43ff6ae0
[60804.406195] kernel: R13: 0000000000000001 R14: 000055be4406ad20 R15: 000055be20f5ed80
[60804.406220] kernel: </TASK>
[60804.406245] kernel: ---[ end trace 0000000000000000 ]---
[60804.406266] kernel: ------------[ cut here ]------------
[60804.406286] kernel: WARNING: CPU: 5 PID: 1265 at drivers/gpu/drm/nouveau/nouveau_gem.c:85 nouveau_gem_object_del+0x6f/0x330 [nouveau]
[60804.406311] kernel: Modules linked in: uinput snd_seq_dummy snd_hrtimer snd_seq snd_seq_device rfcomm michael_mic uhid vfat fat bnep qrtr_mhi snd_soc_skl_hda_dsp snd_soc_intel_sof_board_helpers snd_soc_intel_hda_dsp_commo>
[60804.406369] kernel: kvm_intel snd_hda_codec videobuf2_v4l2 mxm_wmi btrtl processor_thermal_device_pci qmi_helpers i915 videobuf2_common snd_hda_core btintel processor_thermal_device kvm drm_gpuvm iTCO_wdt snd_hwdep btbcm>
[60804.406404] kernel: loop fuse nfnetlink dm_crypt crct10dif_pclmul crc32_pclmul rtsx_pci_sdmmc crc32c_intel polyval_clmulni polyval_generic mmc_core ghash_clmulni_intel sha512_ssse3 sha256_ssse3 sha1_ssse3 nvme rtsx_pci u>
[60804.406432] kernel: CPU: 5 UID: 0 PID: 1265 Comm: X Tainted: G W 6.12.4-gentoo-dist #1
[60804.406453] kernel: Tainted: [W]=WARN
[60804.406475] kernel: Hardware name: LENOVO 21D8A00ACD/21D8A00ACD, BIOS N3EET39W (1.25 ) 07/03/2024
[60804.406496] kernel: RIP: 0010:nouveau_gem_object_del+0x6f/0x330 [nouveau]
[60804.406524] kernel: Code: 89 df e8 d4 39 65 ef 48 89 df e8 dc 4c 29 ff e8 97 ac b6 ee 48 89 ef be 0d 00 00 00 48 89 85 d8 01 00 00 5b 5d e9 91 fa 69 ef <0f> 0b be 0d 00 00 00 48 89 ef 5b 5d e9 80 fa 69 ef 90 90 90 90 90
[60804.406549] kernel: RSP: 0018:ffffba02c17d7a50 EFLAGS: 00010283
[60804.406577] kernel: RAX: 00000000ffffffea RBX: ffff8e1d580b3400 RCX: 000000000040003e
[60804.406602] kernel: RDX: ffffffffb0d28f90 RSI: 0000000000000202 RDI: ffff8e1d433b71ac
[60804.406622] kernel: RBP: ffff8e1d433b70c8 R08: ffff8e1d4349a800 R09: 000000000040003e
[60804.406650] kernel: R10: 000000000040003e R11: 0000000000000000 R12: 0000000000000002
[60804.406675] kernel: R13: 0000000000000001 R14: ffff8e1d4b9a2850 R15: ffff8e1d5e839270
[60804.406697] kernel: FS: 00007ff85c4a75c0(0000) GS:ffff8e2c5f080000(0000) knlGS:0000000000000000
[60804.406718] kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[60804.406739] kernel: CR2: 00007ff85b0b79d0 CR3: 00000001214b2000 CR4: 0000000000f52ef0
[60804.406761] kernel: PKRU: 55555554
[60804.406781] kernel: Call Trace:
[60804.406802] kernel: <TASK>
[60804.406823] kernel: ? nouveau_gem_object_del+0x6f/0x330 [nouveau]
[60804.406847] kernel: ? __warn.cold+0x93/0xfa
[60804.406869] kernel: ? nouveau_gem_object_del+0x6f/0x330 [nouveau]
[60804.406888] kernel: ? report_bug+0xff/0x140
[60804.406906] kernel: ? handle_bug+0x58/0x90
[60804.406925] kernel: ? exc_invalid_op+0x17/0x70
[60804.406945] kernel: ? asm_exc_invalid_op+0x1a/0x20
[60804.406967] kernel: ? rpm_resume+0x310/0x700
[60804.406990] kernel: ? nouveau_gem_object_del+0x6f/0x330 [nouveau]
[60804.407010] kernel: drm_gem_object_release_handle+0x54/0x60
[60804.407035] kernel: ? __pfx_drm_gem_object_release_handle+0x10/0x10
[60804.407059] kernel: idr_for_each+0x6f/0xf0
[60804.407078] kernel: drm_gem_release+0x20/0x30
[60804.407098] kernel: drm_file_free+0x1f7/0x270
[60804.407116] kernel: drm_release+0x5e/0xd0
[60804.407135] kernel: __fput+0xdc/0x2a0
[60804.407154] kernel: __x64_sys_close+0x3c/0x80
[60804.407169] kernel: do_syscall_64+0x82/0x190
[60804.407186] kernel: ? syscall_exit_to_user_mode+0x10/0x200
[60804.407207] kernel: ? do_syscall_64+0x8e/0x190
[60804.407225] kernel: ? __rseq_handle_notify_resume+0xa6/0x4e0
[60804.407243] kernel: ? task_work_run+0x59/0x90
[60804.407280] kernel: ? syscall_exit_to_user_mode+0x165/0x200
[60804.407300] kernel: ? do_syscall_64+0x8e/0x190
[60804.407323] kernel: ? __rseq_handle_notify_resume+0xa6/0x4e0
[60804.407336] kernel: ? syscall_exit_to_user_mode+0x1c3/0x200
[60804.407356] kernel: ? do_syscall_64+0x8e/0x190
[60804.407375] kernel: ? syscall_exit_to_user_mode+0x10/0x200
[60804.407397] kernel: ? do_syscall_64+0x8e/0x190
[60804.407411] kernel: ? exc_page_fault+0x7e/0x180
[60804.407429] kernel: entry_SYSCALL_64_after_hwframe+0x76/0x7e
[60804.407451] kernel: RIP: 0033:0x7ff85ca4ff3a
[60804.407472] kernel: Code: 00 00 0f 05 48 3d 00 f0 ff ff 77 44 c3 0f 1f 00 48 83 ec 18 89 7c 24 0c e8 03 c8 f8 ff 8b 7c 24 0c 89 c2 b8 03 00 00 00 0f 05 <48> 3d 00 f0 ff ff 77 36 89 d7 89 44 24 0c e8 63 c8 f8 ff 8b 44 24
[60804.407492] kernel: RSP: 002b:00007ffc51cb4f10 EFLAGS: 00000293 ORIG_RAX: 0000000000000003
[60804.407506] kernel: RAX: ffffffffffffffda RBX: 000055be43790cb0 RCX: 00007ff85ca4ff3a
[60804.407524] kernel: RDX: 0000000000000000 RSI: 0000000000000000 RDI: 0000000000000020
[60804.407542] kernel: RBP: 000055be43ff7cb0 R08: 000000055be42b1a R09: 000055be4294d010
[60804.407561] kernel: R10: 0000000000000007 R11: 0000000000000293 R12: 000055be43ff6ae0
[60804.407579] kernel: R13: 0000000000000001 R14: 000055be4406ad20 R15: 000055be20f5ed80
[60804.407597] kernel: </TASK>
[60804.407615] kernel: ---[ end trace 0000000000000000 ]---
[.. these have repeated with minor variations 15 times or so]
Hardware: Thinkpad P15v
Software: Gentoo Linux, kernel 6.12.4, mesa 24.3.1
Seen problems like this in kernel 6.11 and likely before.