Due to an influx of spam, we have had to impose restrictions on new accounts. Please see this wiki page for instructions on how to get full permissions. Sorry for the inconvenience.
After updating my archlinux computer from linux 6.1.12 to linux 6.2.1 today, nouveau crashes (screen freeze) right after booting with:
kernel: nouveau 0000:01:00.0: DRM: [DRM/00000002:sor-0002-0342] [ACQUIRE proto:TMDS head:0 hdmi:1 max_ac_packet:6 rekey:56 scdc:0 hda:0] or:0 link:0 (ret:-22)kernel: nouveau 0000:01:00.0: disp: ERROR 3 [RESERVED_METHOD] 00 [] chid 0 mthd 05c0 data 00000201kernel: nouveau 0000:01:00.0: disp: ERROR 5 [INVALID_STATE] 05 [] chid 0 mthd 0080 data 00000002...
I had to reboot with ssh.
The complete kernel log is joined here: kernel.log
With a linux-rt 6.0.5.14 that was already installed on my computer, everything is working well. I still haven't tested with a reinstall of previous kernel, but I guess it will work too.
It seems something doesn't work nicely between nouveau and latest kernel specifically.
Thanks for your help.
To upload designs, you'll need to enable LFS and have an admin enable hashed storage. More information
Child items
0
Show closed items
No child items are currently assigned. Use child items to break down this issue into smaller parts.
Linked items
0
Link issues together to show that they're related.
Learn more.
Able to repro this on 6.3.5 (also arch) with a 9600GT.
Using analogue DVI works fine, hot-plugging HDMI doesn't do anything, leaving HDMI plugged in and rebooting yields nothing but garbled analogue output (but I can use a terminal just fine) and generates the same error in dmesg
It doesn't solve the issue but there seems to be a copy-paste error inside g84_sor_hdmi_infoframe_vsi in drivers/gpu/drm/nouveau/nvkm/engine/disp/g84.c :
nvkm_mask(device, 0x61653c + hoff, 0x00010001, 0x00010000); if (!size) return;
but it should be :
nvkm_mask(device, 0x61653c + hoff, 0x00000001, 0x00000000); if (!size) return;
Same with me. My lspci is 01:00.0 VGA compatible controller: NVIDIA Corporation G96CM [GeForce 9600M GT] (rev a1)
I'm not sure if HDMI worked on my machine, but on 6.2.2-300.fc37.x86_64 connecting a HDMI cable slows down considerably the system or hangs it, and on 6.0.7-301.fc37.x86_64 it at least shows something (not usable though).
I use Fedora 37 and have xorg-x11-drv-nouveau-1.0.17 installed. In both cases xrandr shows: DVI-D-1 (I don't have any DVI port on my laptop) as an output while nvidia-340xx shows HDMI-0
I don't know if it can help, but when I set grub to some low text mode resolution, the display stays in that mode during all boot process and never switches to a better graphical resolution.
huh... that's interesting as this suggest nouveau fails to load. Do you have a kernel log from that? I'm still trying to get my hands on a G9x GPU with HDMI, so sorry if it takes a while to resolve this as no nouveau developer seems to have access to such a GPU sadly
yeah.. I'm mostly wondering why my suggestion doesn't work, but maybe you also did something different than I wanted you to, so maybe I just post a patch here:
The only difference I can notice is a less garbled screen:
So the patch may be a part of the solution, but something else is still missing. Is some order of operations/initialization changed by commit f530bc60?
mhhh... yeah, the order could have changed, but it could also be something else regressed it on top. Do you know if the patch works on top of f530bc60? If so, there might be another change you could try to bisect, which would be a bit more painful as you have to carry that patch around.
Yeah.. I think it changed some ordering around, but I don't expect it to cause anything... but if the fix on top of f530bc60 doesn't fix it, I suspect I'll have to look a bit deeper to figure out what's wrong there.
Do I need to change something else? These are the nouveau switches used in my kernel config.
CONFIG_DRM_NOUVEAU=m# CONFIG_NOUVEAU_LEGACY_CTX_SUPPORT is not setCONFIG_NOUVEAU_DEBUG=5CONFIG_NOUVEAU_DEBUG_DEFAULT=3# CONFIG_NOUVEAU_DEBUG_MMU is not setCONFIG_NOUVEAU_DEBUG_PUSH=yCONFIG_DRM_NOUVEAU_BACKLIGHT=yCONFIG_DRM_NOUVEAU_SVM=y
@rv I just noticed that testing fixes on top of f530bc60 won't work, because it requires future changes to get pulled in Mind trying the patch out on linux/master ?
mars 02 22:01:10 archlinux kernel: ACPI Error: AE_NOT_FOUND, While resolving a named reference package element - LNKC (20221020/dspkginit-438) many similar lines, seems a BIOS warning, not a real bug.
mars 02 22:01:10 archlinux kernel: pci 0000:00:00.0: [Firmware Bug]: reg 0x1c: invalid BAR (can't size)
mars 02 22:01:10 archlinux kernel: pnp 00:01: disabling [mem 0x00000000-0x00000fff window] because it overlaps 0000:03:00.0 BAR 6 [mem 0x00000000-0x0001ffff pref]
mars 02 22:01:10 archlinux kernel: pnp 00:05: disabling [mem 0x000d8400-0x000dbfff] because it overlaps 0000:01:00.0 BAR 6 [mem 0x000c0000-0x000dffff]
ars 02 22:01:10 archlinux kernel: system 00:05: [mem 0x000f0000-0x000f7fff] could not be reservedmars 02 22:01:10 archlinux kernel: system 00:05: [mem 0x000f8000-0x000fbfff] could not be reservedmars 02 22:01:10 archlinux kernel: system 00:05: [mem 0x000fc000-0x000fffff] could not be reservedmars 02 22:01:10 archlinux kernel: system 00:05: [mem 0xcfcf0000-0xcfcfffff] could not be reservedmars 02 22:01:10 archlinux kernel: system 00:05: [mem 0xffff0000-0xffffffff] has been reservedmars 02 22:01:10 archlinux kernel: system 00:05: [mem 0x00000000-0x0009ffff] could not be reservedmars 02 22:01:10 archlinux kernel: system 00:05: [mem 0x00100000-0xcfceffff] could not be reservedmars 02 22:01:10 archlinux kernel: system 00:05: [mem 0xcfd00000-0xcfdfffff] has been reservedmars 02 22:01:10 archlinux kernel: system 00:05: [mem 0xcfe00000-0xcfefffff] could not be reservedmars 02 22:01:10 archlinux kernel: system 00:05: [mem 0xfec00000-0xfec00fff] could not be reservedmars 02 22:01:10 archlinux kernel: system 00:05: [mem 0xfee00000-0xfee00fff] could not be reserved
mars 02 22:01:14 maisondubonheur kernel: ACPI Warning: SystemIO range 0x0000000000000B00-0x0000000000000B08 conflicts with OpRegion 0x0000000000000B00-0x0000000000000B0F (\SOR1) (20221020/utaddress-204)mars 02 22:01:14 maisondubonheur kernel: ACPI: OSL: Resource conflict; ACPI support missing from driver?
mars 02 22:01:23 maisondubonheur kernel: nouveau 0000:01:00.0: DRM: [DRM/00000002:sor-0002-0342] [ACQUIRE proto:TMDS head:0 hdmi:1 max_ac_packet:6 rekey:56 scdc:0 hda:0] or:0 link:0 (ret:-22)mars 02 22:01:23 maisondubonheur kernel: nouveau 0000:01:00.0: disp: ERROR 3 [RESERVED_METHOD] 00 [] chid 0 mthd 05c0 data 00000201mars 02 22:01:23 maisondubonheur kernel: nouveau 0000:01:00.0: disp: ERROR 5 [INVALID_STATE] 05 [] chid 0 mthd 0080 data 00000002mars 02 22:01:23 maisondubonheur kernel: nouveau 0000:01:00.0: disp: Core:mars 02 22:01:23 maisondubonheur kernel: nouveau 0000:01:00.0: disp: 0084: 03000400 -> 80000000mars 02 22:01:23 maisondubonheur kernel: nouveau 0000:01:00.0: disp: 0088: 00000000 -> f0000000
The key to me seems this (previously written):
mars 02 22:01:23 maisondubonheur kernel: nouveau 0000:01:00.0: DRM: [DRM/00000002:sor-0002-0342] [ACQUIRE proto:TMDS head:0 hdmi:1 max_ac_packet:6 rekey:56 scdc:0 hda:0] or:0 link:0 (ret:-22)
mhh yeah, Lyude already has some ideas on what's up here, but I couldn't reproduce this with any of my HDMI cards, so we kinda have to wait until somebody actually figures out what's wrong here.
Just mentioning the link with the error containing: sor term in DRM/00000002:sor-0002-0342 and previous error ACPI Warning: SystemIO range 0x0000000000000B00-0x0000000000000B08 conflicts with OpRegion 0x0000000000000B00-0x0000000000000B0F (\SOR1)