Ubuntu 20.04.1 LTS Boot hangs sometimes after updating amdgpu
After updating my amdgpu firmware & mesa gl drivers, I noticed that my boot hangs sometimes. After selecting Ubuntu from GRUB it displays my BIOS splash screen then hangs indefinitely. I have to perform a hard reset until it the boot succeeds which is about 50% of the time.
I initially suspected it was due to installing a new kernel or removing old unused kernels until I booted without "quiet splash" and saw the messages below. Details here, (it might be good to see what troubleshooting I did): Original Issue Posted on Ask Ubuntu
Summary of what was tried
- Removed all old unused kernels
- Reinstall latest Ubuntu generic kernel
- Reinstall linux-firmware (manually deleted /lib/firmware/amdgpu and verified what it was reinstalled)
- Ran sudo update-initramfs -u
- Ran sudo update-grub
My Hardware
- MOBO: ASUS TUF Gaming X570 Wifi Plus
- CPU: Ryzen 7 3700X
- GPU: XFX RX 5700 XT RAW II
- DISPLAYS: 2X Display Port Monitors
OS
- Distro: Ubuntu 20.04.1 LTS
- Kernel: 5.4.0-70-generic
Updated amdgpu drivers and firmware from dpkg logs
- 2021-04-09 15:27:16 upgrade libdrm-amdgpu1:amd64 2.4.102-1ubuntu1kisak1
f 2.4.103-1ubuntu1f~mesarc0 - 2021-04-09 15:27:16 upgrade libdrm-amdgpu1:i386 2.4.102-1ubuntu1kisak1
f 2.4.103-1ubuntu1f~mesarc0 - 2021-04-09 15:27:19 upgrade linux-firmware:all 1.187.10 1.190.2~mesarc0
- 2021-04-09 15:27:25 upgrade mesa-va-drivers:amd64 21.0.1
kisak1f 21.0.2kisak1f - 2021-04-09 15:27:27 upgrade xserver-xorg-video-amdgpu:amd64 19.1.0-1 19.1.0+git2102021245.6ed4863
fmesarc0
Boot Logs displayed when boot hangs
- Apr 9 22:36:50 hackintosh kernel: [ 0.956618] amdgpu 0000:0c:00.0: remove_conflicting_pci_framebuffers: bar 0: 0x7fe0000000 -> 0x7fefffffff
- Apr 9 22:36:50 hackintosh kernel: [ 0.956619] amdgpu 0000:0c:00.0: remove_conflicting_pci_framebuffers: bar 2: 0x7ff0000000 -> 0x7ff01fffff
- Apr 9 22:36:50 hackintosh kernel: [ 0.956619] amdgpu 0000:0c:00.0: remove_conflicting_pci_framebuffers: bar 5: 0xfcd00000 -> 0xfcd7ffff
- Apr 9 22:36:50 hackintosh kernel: [ 0.956620] checking generic (7fe0000000 7e9000) vs hw (7fe0000000 10000000)
- Apr 9 22:36:50 hackintosh kernel: [ 0.956621] fb0: switching to amdgpudrmfb from EFI VGA
I've also attached the /var/log/syslog at around the time of a similar crash: Apr 9 22:36:50 hackintosh kernel: [ 0.956621] fb0: switching to amdgpudrmfb from EFI VGA SystemLogTimeOfCrash.txt
More Logs Also it seems that XOrg is crashing
Please help! Thanks!