UBSAN: shift-out-of-bounds
Brief summary of the problem:
I just boot and get this error. It does not seem to affect stability this time (contrary to #1779 (closed)) although it looks very much like #1472 (closed)
One thing I'm noticing (and I'm not quite sure if it's related) is that the screen is never put in sleep mode, the card just outputs a black (or should I say blank) image.
Hardware description:
- CPU: Radeon R5 5600XT
- Motherboard: Gigabyte B550 AORUS ELITE V2, BIOS F14e
- GPU: (Sapphire Radeon 380X 4GB) 04:00.0 VGA compatible controller [0300]: Advanced Micro Devices, Inc. [AMD/ATI] Tonga XT / Amethyst XT [Radeon R9 380X / R9 M295X] [1002:6938] (rev f1)
- System Memory: 32GB DDR4 3600MHz (2x16GB)
- Display(s): HP 24ea
- Type of Display Connection: HDMI
System information:
- Distro name and Version: Pop_OS! 21.10
- Kernel version: Linux pop-os 5.16.15-76051615-generic #202203161444
164796402721.10~e706226 SMP PREEMPT Tue Mar 22 17 x86_64 x86_64 x86_64 GNU/Linux - Custom kernel: No
- AMD official driver version: N/A
How to reproduce the issue:
I'm using a stock installation but with Pop_OS! nVidia drivers installed, since I swap this drive usually over to another rig with an nvidia GPU. I just get this error on boot.
Apr 17 13:12:26 pop-os kernel: [ 7.668558] [drm] VCE initialized successfully.
Apr 17 13:12:26 pop-os kernel: [ 7.669648] kfd kfd: amdgpu: Allocated 3969056 bytes on gart
Apr 17 13:12:26 pop-os kernel: [ 7.669701] ================================================================================
Apr 17 13:12:26 pop-os kernel: [ 7.669701] UBSAN: shift-out-of-bounds in /build/linux-iurSuf/linux-5.16.15/drivers/gpu/drm/amd/amdgpu/../amdkfd/kfd_device_queue_manager.c:998:32
Apr 17 13:12:26 pop-os kernel: [ 7.669702] shift exponent 64 is too large for 64-bit type 'long long unsigned int'
Apr 17 13:12:26 pop-os kernel: [ 7.669703] CPU: 11 PID: 240 Comm: systemd-udevd Not tainted 5.16.15-76051615-generic #202203161444~1647964027~21.10~e706226
Apr 17 13:12:26 pop-os kernel: [ 7.669705] Hardware name: Gigabyte Technology Co., Ltd. B550 AORUS ELITE V2/B550 AORUS ELITE V2, BIOS F14e 10/13/2021
Apr 17 13:12:26 pop-os kernel: [ 7.669705] Call Trace:
Apr 17 13:12:26 pop-os kernel: [ 7.669707] <TASK>
Apr 17 13:12:26 pop-os kernel: [ 7.669708] show_stack+0x52/0x58
Apr 17 13:12:26 pop-os kernel: [ 7.669711] dump_stack_lvl+0x4c/0x63
Apr 17 13:12:26 pop-os kernel: [ 7.669712] dump_stack+0x10/0x12
Apr 17 13:12:26 pop-os kernel: [ 7.669713] ubsan_epilogue+0x9/0x45
Apr 17 13:12:26 pop-os kernel: [ 7.669715] __ubsan_handle_shift_out_of_bounds.cold+0x61/0xef
Apr 17 13:12:26 pop-os kernel: [ 7.669716] initialize_nocpsch.cold+0x15/0x59 [amdgpu]
Apr 17 13:12:26 pop-os kernel: [ 7.669838] device_queue_manager_init+0x20b/0x3b0 [amdgpu]
Apr 17 13:12:26 pop-os kernel: [ 7.669931] kgd2kfd_device_init.cold+0x1b9/0x49a [amdgpu]
Apr 17 13:12:26 pop-os kernel: [ 7.670044] amdgpu_amdkfd_device_init+0x135/0x170 [amdgpu]
Apr 17 13:12:26 pop-os kernel: [ 7.670132] amdgpu_device_ip_init+0x66e/0x68d [amdgpu]
Apr 17 13:12:26 pop-os kernel: [ 7.670244] amdgpu_device_init.cold+0x281/0x801 [amdgpu]
Apr 17 13:12:26 pop-os kernel: [ 7.670351] ? do_pci_enable_device+0xd7/0x100
Apr 17 13:12:26 pop-os kernel: [ 7.670354] amdgpu_driver_load_kms+0x6d/0x350 [amdgpu]
Apr 17 13:12:26 pop-os kernel: [ 7.670426] amdgpu_pci_probe+0x1e2/0x2a0 [amdgpu]
Apr 17 13:12:26 pop-os kernel: [ 7.670494] local_pci_probe+0x4b/0x90
Apr 17 13:12:26 pop-os kernel: [ 7.670496] ? pci_match_device+0xde/0x130
Apr 17 13:12:26 pop-os kernel: [ 7.670497] pci_device_probe+0xf8/0x1f0
Apr 17 13:12:26 pop-os kernel: [ 7.670498] really_probe+0x21e/0x420
Apr 17 13:12:26 pop-os kernel: [ 7.670500] __driver_probe_device+0x115/0x190
Apr 17 13:12:26 pop-os kernel: [ 7.670500] driver_probe_device+0x23/0xc0
Apr 17 13:12:26 pop-os kernel: [ 7.670501] __driver_attach+0xbd/0x1d0
Apr 17 13:12:26 pop-os kernel: [ 7.670502] ? __device_attach_driver+0x110/0x110
Apr 17 13:12:26 pop-os kernel: [ 7.670503] bus_for_each_dev+0x7e/0xc0
Apr 17 13:12:26 pop-os kernel: [ 7.670505] driver_attach+0x1e/0x20
Apr 17 13:12:26 pop-os kernel: [ 7.670505] bus_add_driver+0x161/0x200
Apr 17 13:12:26 pop-os kernel: [ 7.670506] driver_register+0x95/0xf0
Apr 17 13:12:26 pop-os kernel: [ 7.670507] __pci_register_driver+0x68/0x70
Apr 17 13:12:26 pop-os kernel: [ 7.670508] amdgpu_init+0x7c/0x1000 [amdgpu]
Apr 17 13:12:26 pop-os kernel: [ 7.670577] ? 0xffffffffc0f84000
Apr 17 13:12:26 pop-os kernel: [ 7.670578] do_one_initcall+0x48/0x210
Apr 17 13:12:26 pop-os kernel: [ 7.670580] ? kmem_cache_alloc_trace+0x1a6/0x320
Apr 17 13:12:26 pop-os kernel: [ 7.670583] do_init_module+0x52/0x250
Apr 17 13:12:26 pop-os kernel: [ 7.670585] load_module+0xb24/0xc00
Apr 17 13:12:26 pop-os kernel: [ 7.670587] __do_sys_finit_module+0xbf/0x120
Apr 17 13:12:26 pop-os kernel: [ 7.670588] __x64_sys_finit_module+0x18/0x20
Apr 17 13:12:26 pop-os kernel: [ 7.670590] do_syscall_64+0x5c/0xc0
Apr 17 13:12:26 pop-os kernel: [ 7.670592] ? fput+0x13/0x20
Apr 17 13:12:26 pop-os kernel: [ 7.670593] ? ksys_mmap_pgoff+0x10c/0x250
Apr 17 13:12:26 pop-os kernel: [ 7.670595] ? exit_to_user_mode_prepare+0x37/0xb0
Apr 17 13:12:26 pop-os kernel: [ 7.670596] ? syscall_exit_to_user_mode+0x27/0x50
Apr 17 13:12:26 pop-os kernel: [ 7.670598] ? __x64_sys_mmap+0x33/0x40
Apr 17 13:12:26 pop-os kernel: [ 7.670599] ? do_syscall_64+0x69/0xc0
Apr 17 13:12:26 pop-os kernel: [ 7.670600] ? exit_to_user_mode_prepare+0x37/0xb0
Apr 17 13:12:26 pop-os kernel: [ 7.670601] ? syscall_exit_to_user_mode+0x27/0x50
Apr 17 13:12:26 pop-os kernel: [ 7.670602] ? __x64_sys_lseek+0x18/0x20
Apr 17 13:12:26 pop-os kernel: [ 7.670603] ? do_syscall_64+0x69/0xc0
Apr 17 13:12:26 pop-os kernel: [ 7.670604] ? do_syscall_64+0x69/0xc0
Apr 17 13:12:26 pop-os kernel: [ 7.670605] ? do_syscall_64+0x69/0xc0
Apr 17 13:12:26 pop-os kernel: [ 7.670606] entry_SYSCALL_64_after_hwframe+0x44/0xae
Apr 17 13:12:26 pop-os kernel: [ 7.670608] RIP: 0033:0x7f839a3279ad
Apr 17 13:12:26 pop-os kernel: [ 7.670609] Code: 5b 41 5c c3 66 0f 1f 84 00 00 00 00 00 f3 0f 1e fa 48 89 f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c 8b 4c 24 08 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d 53 b4 0f 00 f7 d8 64 89 01 48
Apr 17 13:12:26 pop-os kernel: [ 7.670610] RSP: 002b:00007ffd1b6cb968 EFLAGS: 00000246 ORIG_RAX: 0000000000000139
Apr 17 13:12:26 pop-os kernel: [ 7.670611] RAX: ffffffffffffffda RBX: 00005616fb4f0a30 RCX: 00007f839a3279ad
Apr 17 13:12:26 pop-os kernel: [ 7.670612] RDX: 0000000000000000 RSI: 00005616fb4f19b0 RDI: 0000000000000018
Apr 17 13:12:26 pop-os kernel: [ 7.670612] RBP: 0000000000020000 R08: 0000000000000000 R09: 0000000000000000
Apr 17 13:12:26 pop-os kernel: [ 7.670613] R10: 0000000000000018 R11: 0000000000000246 R12: 00005616fb4f19b0
Apr 17 13:12:26 pop-os kernel: [ 7.670613] R13: 00005616fb514780 R14: 0000000000000000 R15: 00005616fb516f20
Apr 17 13:12:26 pop-os kernel: [ 7.670614] </TASK>
Apr 17 13:12:26 pop-os kernel: [ 7.670615] ================================================================================
Attached files:
Screenshots/video files
N/A
Log files (for system lockups / game freezes / crashes)
- Dmesg log (full log)dmesg.0
- Xorg log Xorg.0.log