5.16.3 nouveau.ko crash, 5.16.2 was good
after update to 5.16.3, nouveau crash a lot, 5.16.2 was good.
hardware: Dell XPS 17 9700
GPU list:
00:02.0 VGA compatible controller [0300]: Intel Corporation CometLake-H GT2 [UHD Graphics] [8086:9bc4] (rev 05)
01:00.0 VGA compatible controller [0300]: NVIDIA Corporation TU106M [GeForce RTX 2060 Max-Q] [10de:1f12] (rev a1)
dmesg output
[ 33.066771] nouveau 0000:01:00.0: acr: unload binary failed
[ 72.530462] ------------[ cut here ]------------
[ 72.530464] nouveau 0000:01:00.0: timeout
[ 72.530476] WARNING: CPU: 4 PID: 33 at drivers/gpu/drm/nouveau/nvkm/falcon/v1.c:247 nvkm_falcon_v1_wait_for_halt+0xb1/0xc0 [nouveau]
[ 72.530499] Modules linked in: fuse 8021q garp mrp iptable_nat xt_MASQUERADE nf_nat nf_conntrack nf_defrag_ipv4 ip_tables x_tables efivarfs binfmt_misc hid_multitouch cdc_mbim cdc_wdm cdc_ncm cdc_ether usbnet mii dell_smm_hwmon coretemp iwlmvm rtsx_pci_sdmmc mmc_core intel_tcc_cooling dell_laptop x86_pkg_temp_thermal i2c_designware_platform dell_wmi i2c_designware_core sparse_keymap hid_generic ledtrig_audio intel_powerclamp ucsi_ccg dell_wmi_sysman dell_smbios mac80211 firmware_attributes_class dell_wmi_descriptor wmi_bmof libarc4 nouveau i915 dcdbas kvm_intel snd_hda_codec_hdmi iwlwifi snd_hda_intel drm_ttm_helper mxm_wmi snd_intel_dspcfg kvm hwmon uvcvideo snd_hda_codec videobuf2_vmalloc i2c_algo_bit cfg80211 ttm videobuf2_memops videobuf2_v4l2 snd_hwdep irqbypass videobuf2_common psmouse evdev crc32c_intel drm_kms_helper snd_hda_core serio_raw efi_pstore videodev i2c_i801 snd_pcm i2c_smbus syscopyarea sysfillrect rtsx_pci intel_lpss_pci sysimgblt snd_timer intel_lpss fb_sys_fops
[ 72.530524] idma64 mc drm usbhid mfd_core rfkill snd intel_pch_thermal i2c_nvidia_gpu soundcore processor_thermal_device_pci_legacy intel_soc_dts_iosf processor_thermal_device intel_gtt processor_thermal_rfim agpgart processor_thermal_mbox ucsi_acpi i2c_hid_acpi typec_ucsi i2c_hid hid roles typec wmi int3403_thermal int340x_thermal_zone video battery button int3400_thermal acpi_pad acpi_thermal_rel ac nvme nvme_core usb_storage
[ 72.530536] CPU: 4 PID: 33 Comm: kworker/4:0 Not tainted 5.16.3-dell-1 #1
[ 72.530538] Hardware name: Dell Inc. XPS 17 9700/0P1CHN, BIOS 1.11.1 11/18/2021
[ 72.530539] Workqueue: pm pm_runtime_work
[ 72.530542] RIP: 0010:nvkm_falcon_v1_wait_for_halt+0xb1/0xc0 [nouveau]
[ 72.530557] Code: 8b 40 10 48 8b 78 10 4c 8b 67 50 4d 85 e4 75 03 4c 8b 27 e8 a1 1f b3 df 4c 89 e2 48 c7 c7 32 81 c4 a1 48 89 c6 e8 5c e6 da df <0f> 0b eb ad e8 16 cd de df 66 0f 1f 44 00 00 0f 1f 44 00 00 41 54
[ 72.530558] RSP: 0018:ffffc90000223a20 EFLAGS: 00010282
[ 72.530559] RAX: 0000000000000000 RBX: ffffffffffffff92 RCX: 0000000000000000
[ 72.530559] RDX: 0000000000000001 RSI: ffffffff81f882d1 RDI: 00000000ffffffff
[ 72.530560] RBP: ffff8881044c0098 R08: ffffffff82333728 R09: 00000000ffffdfff
[ 72.530561] R10: ffffffff82253740 R11: ffffffff82253740 R12: ffff888101c35890
[ 72.530561] R13: 0000000000000000 R14: 0000000000000000 R15: ffff888103971800
[ 72.530562] FS: 0000000000000000(0000) GS:ffff88887d500000(0000) knlGS:0000000000000000
[ 72.530563] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 72.530563] CR2: 000000000202d1a8 CR3: 000000010755c005 CR4: 00000000007706e0
[ 72.530564] PKRU: 55555554
[ 72.530564] Call Trace:
[ 72.530566] <TASK>
[ 72.530567] gm200_acr_hsfw_boot+0xc6/0x160 [nouveau]
[ 72.530582] nvkm_acr_hsf_boot+0x82/0xe0 [nouveau]
[ 72.530596] tu102_acr_init+0x15/0x30 [nouveau]
[ 72.530610] nvkm_acr_load+0x3e/0x100 [nouveau]
[ 72.530623] ? nvkm_notify_get+0x54/0x60 [nouveau]
[ 72.530635] ? ktime_get+0x35/0x90
[ 72.530637] nvkm_subdev_init+0x8f/0xd0 [nouveau]
[ 72.530650] ? ktime_get+0x35/0x90
[ 72.530651] nvkm_device_init+0x11f/0x1b0 [nouveau]
[ 72.530676] nvkm_udevice_init+0x41/0x60 [nouveau]
[ 72.530701] nvkm_object_init+0x3b/0x110 [nouveau]
[ 72.530714] nvkm_object_init+0x73/0x110 [nouveau]
[ 72.530726] nvkm_object_init+0x73/0x110 [nouveau]
[ 72.530739] nouveau_do_resume+0x2b/0xc0 [nouveau]
[ 72.530760] nouveau_pmops_runtime_resume+0x7a/0x150 [nouveau]
[ 72.530780] pci_pm_runtime_resume+0xa7/0xc0
[ 72.530782] ? pci_pm_freeze_noirq+0x100/0x100
[ 72.530783] __rpm_callback+0x41/0x150
[ 72.530784] ? pci_pm_freeze_noirq+0x100/0x100
[ 72.530785] rpm_callback+0x59/0x70
[ 72.530786] rpm_resume+0x4ac/0x7e0
[ 72.530787] ? __schedule+0x313/0x920
[ 72.530789] __pm_runtime_resume+0x4a/0x80
[ 72.530790] rpm_get_suppliers+0x3c/0xc0
[ 72.530791] ? pci_pm_freeze_noirq+0x100/0x100
[ 72.530792] __rpm_callback+0xa2/0x150
[ 72.530793] ? pci_pm_freeze_noirq+0x100/0x100
[ 72.530794] rpm_callback+0x59/0x70
[ 72.530795] rpm_resume+0x4ac/0x7e0
[ 72.530796] ? _raw_spin_unlock_irqrestore+0x1b/0x30
[ 72.530797] ? try_to_wake_up+0x94/0x4b0
[ 72.530798] ? preempt_count_add+0x68/0xa0
[ 72.530800] pm_runtime_work+0x6c/0xa0
[ 72.530801] process_one_work+0x1c3/0x3d0
[ 72.530804] worker_thread+0x4d/0x3d0
[ 72.530805] ? rescuer_thread+0x390/0x390
[ 72.530807] kthread+0x169/0x190
[ 72.530808] ? set_kthread_struct+0x40/0x40
[ 72.530810] ret_from_fork+0x1f/0x30
[ 72.530811] </TASK>
[ 72.530812] ---[ end trace 5c2881698c1c5255 ]---
[ 72.530813] nouveau 0000:01:00.0: acr: AHESASC binary failed
[ 72.530814] nouveau 0000:01:00.0: acr: init failed, -110
[ 72.530988] nouveau 0000:01:00.0: init failed with -110
[ 72.530989] nouveau: Xorg[1190]:00000000:00000080: init failed with -110
[ 72.530990] nouveau: DRM-master:00000000:00000000: init failed with -110
[ 72.530991] nouveau: DRM-master:00000000:00000000: init failed with -110
[ 72.530991] nouveau 0000:01:00.0: DRM: Client resume failed with error: -110
[ 72.530992] nouveau 0000:01:00.0: DRM: resume fa
after that, a suspend/resume test has error also:
[ 93.103360] OOM killer disabled.
[ 93.103361] Freezing remaining freezable tasks ... (elapsed 0.000 seconds) done.
[ 93.104098] printk: Suspending console(s) (use no_console_suspend to debug)
[ 93.109135] ------------[ cut here ]------------
[ 93.109136] xhci_hcd 0000:01:00.2: disabling already-disabled device
[ 93.109141] WARNING: CPU: 4 PID: 236 at drivers/pci/pci.c:2201 pci_disable_device+0xce/0x130
[ 93.109145] Modules linked in: fuse 8021q garp mrp iptable_nat xt_MASQUERADE nf_nat nf_conntrack nf_defrag_ipv4 ip_tables x_tables efivarfs binfmt_misc hid_multitouch cdc_mbim cdc_wdm cdc_ncm cdc_ether usbnet mii dell_smm_hwmon coretemp iwlmvm rtsx_pci_sdmmc mmc_core intel_tcc_cooling dell_laptop x86_pkg_temp_thermal i2c_designware_platform dell_wmi i2c_designware_core sparse_keymap hid_generic ledtrig_audio intel_powerclamp ucsi_ccg dell_wmi_sysman dell_smbios mac80211 firmware_attributes_class dell_wmi_descriptor wmi_bmof libarc4 nouveau i915 dcdbas kvm_intel snd_hda_codec_hdmi iwlwifi snd_hda_intel drm_ttm_helper mxm_wmi snd_intel_dspcfg kvm hwmon uvcvideo snd_hda_codec videobuf2_vmalloc i2c_algo_bit cfg80211 ttm videobuf2_memops videobuf2_v4l2 snd_hwdep irqbypass videobuf2_common psmouse evdev crc32c_intel drm_kms_helper snd_hda_core serio_raw efi_pstore videodev i2c_i801 snd_pcm i2c_smbus syscopyarea sysfillrect rtsx_pci intel_lpss_pci sysimgblt snd_timer intel_lpss fb_sys_fops
[ 93.109172] idma64 mc drm usbhid mfd_core rfkill snd intel_pch_thermal i2c_nvidia_gpu soundcore processor_thermal_device_pci_legacy intel_soc_dts_iosf processor_thermal_device intel_gtt processor_thermal_rfim agpgart processor_thermal_mbox ucsi_acpi i2c_hid_acpi typec_ucsi i2c_hid hid roles typec wmi int3403_thermal int340x_thermal_zone video battery button int3400_thermal acpi_pad acpi_thermal_rel ac nvme nvme_core usb_storage
[ 93.109183] CPU: 4 PID: 236 Comm: kworker/u32:6 Tainted: G W 5.16.3-dell-1 #1
[ 93.109185] Hardware name: Dell Inc. XPS 17 9700/0P1CHN, BIOS 1.11.1 11/18/2021
[ 93.109186] Workqueue: events_unbound async_run_entry_fn
[ 93.109188] RIP: 0010:pci_disable_device+0xce/0x130
[ 93.109190] Code: 4d 85 e4 75 07 4c 8b a3 d0 00 00 00 48 8d bb d0 00 00 00 e8 04 b6 0e 00 4c 89 e2 48 c7 c7 d0 6c fb 81 48 89 c6 e8 bf 7c 36 00 <0f> 0b e9 5d ff ff ff 48 8d 54 24 06 be 04 00 00 00 48 89 df e8 79
[ 93.109191] RSP: 0018:ffffc9000055fd88 EFLAGS: 00010282
[ 93.109192] RAX: 0000000000000000 RBX: ffff888101c27000 RCX: 0000000000000000
[ 93.109192] RDX: 0000000000000001 RSI: 0000000000000082 RDI: 00000000ffffffff
[ 93.109193] RBP: ffff888103ccc000 R08: 40000000ffffe4f1 R09: 0000000082938b80
[ 93.109194] R10: ffffffffffffffff R11: ffffffffffffffff R12: ffff888101c35940
[ 93.109194] R13: 0000000000000000 R14: ffff888101c27150 R15: ffff88810006e005
[ 93.109195] FS: 0000000000000000(0000) GS:ffff88887d500000(0000) knlGS:0000000000000000
[ 93.109196] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 93.109196] CR2: 0000000000ab6658 CR3: 000000000220a006 CR4: 00000000007706e0
[ 93.109197] PKRU: 55555554
[ 93.109198] Call Trace:
[ 93.109199] <TASK>
[ 93.109200] suspend_common+0xd5/0x150
[ 93.109202] pci_pm_suspend+0x71/0x160
[ 93.109203] ? pci_pm_freeze+0xb0/0xb0
[ 93.109204] dpm_run_callback+0x3f/0x160
[ 93.109206] ? _raw_spin_lock_irqsave+0x38/0x40
[ 93.109208] __device_suspend+0x130/0x4d0
[ 93.109210] async_suspend+0x1b/0x90
[ 93.109211] async_run_entry_fn+0x1d/0xa0
[ 93.109212] process_one_work+0x1c3/0x3d0
[ 93.109214] worker_thread+0x4d/0x3d0
[ 93.109216] ? rescuer_thread+0x390/0x390
[ 93.109217] kthread+0x169/0x190
[ 93.109219] ? set_kthread_struct+0x40/0x40
[ 93.109220] ret_from_fork+0x1f/0x30
[ 93.109222] </TASK>
[ 93.109222] ---[ end trace 5c2881698c1c5256 ]---