Hard crashes and reboots with RX 7800 XT
Brief summary of the problem:
Since I upgraded from an RX 6700 XT to an RX 7800 XT, I've been experiencing hard crashes resulting to system reboot. These crashes only happen during gaming, usually (but not only) during some kind of a transition scene (opening a map, entering/leaving loading screen). Sometimes it happenes right after I start the game, sometimes I can play for an hour or two. It's like, if I test the stability, it works. But when I finally think that the problem is gone and I start gaming, it's immediatelly back. I tried to discuss this on Reddit but after a month of troubleshooting, I guess it's time to report this.
Detailed description:
Everything runs fine until suddenly the monitor goes black, the computer enters a restart phase, CPU fan is still running, system boots. If a speaker is connecter, buzzling sound can be heard for a few seconds.
So far I've tried:
- upgrading my PSU from 650W to 850W
- reinstalling the OS
- different distribution (openSUSE Tumbleweed)
- limiting the core clock with CoreCtrl to 2140 (took a while but it happened again)
- different kernels - 6.6.7, 6.6.8, 6.6.9, 6.7.0 (kernel-mainline AUR package)
- disabling C-States in BIOS
- running memtest (passed 7 times)
- updating BIOS
Games that I remember caused this:
- AC Syndicate - random crashes but also a crash when loading a map
- Resident Evil 4 - crash when loading a save (screen going from 2D to 3D)
- Hogwarts Legacy - opening settings, random crashes as well
- Call of Duty 2 - random crashes
- Talos Principle 2 - used to happen randomly, can't reproduce since Mesa 23.3
- Serious Sam 4 - random crashes
- Serious Sam Syberian Mayhem - random crashes
- Red Dead Redemption 2 - random crash
Games that are, for whatever reason, stable during my testing:
- Counter-Strike 2 (EDIT 9th January 2023 - two crashes in CS2)
Hardware description:
- CPU: AMD Ryzen 5 7600
- GPU: 03:00.0 VGA compatible controller [0300]: Advanced Micro Devices, Inc. [AMD/ATI] Navi 32 [Radeon RX 7700 XT / 7800 XT] [1002:747e] (rev c8)
- System Memory: 32GB DDR5 4800MHz CL40
- Display(s): Samsung Odyssey G40B 25" 1080p
- Type of Display Connection: DisplayPort
System information:
- Distro name and Version: Arch Linux
- Kernel version: 6.6.8 | 6.6.9
- Custom kernel: 6.7.0-rc8-1-mainline
- AMD official driver version: no proprietary driver, just: lib32-libva-mesa-driver 1:23.3.2-2, lib32-mesa 1:23.3.2-2, lib32-mesa-demos 9.0.0-1, lib32-mesa-utils 9.0.0-1, lib32-vulkan-mesa-layers 1:23.3.2-2, libva-mesa-driver 1:23.3.2-2, mesa 1:23.3.2-2, mesa-demos 9.0.0-3, mesa-utils 9.0.0-3, mesa-vdpau 1:23.3.2-2, vulkan-mesa-layers 1:23.3.2-2
- Display server: Wayland
- GPU model: Sapphire Pulse RX 7800 XT
How to reproduce the issue:
Play a DXVK/VKD3D (can't reproduce this in native titles [EDIT 9th January - happened in CS2]) game. Sometimes it doesn't happen right away, it might take an hour or two. If it all seems stable, try shutting down the PC, boot it up again and start the game again. If the game that is being played has a 2D GUI (map, settings), switch to that often.
Log files (for system lockups / game freezes / crashes)
- Dmesg log (full log) - dmesg.txt
- journalctl logs from the time of the crash: ac_syndicate.txt, re4.txt