amdgpu crashes when entering the main map in Crusader Kings III
Brief summary of the problem:
< When I press "new game", select a character and press start, the game shows a black screen after a while and my windowmanager mentions that ck3 is not responding anymore. There is an extensive dmesg output >
[ 145.478208] amdgpu 0000:0e:00.0: amdgpu: in page starting at address 0x0000800274450000 from client 10
[ 145.478211] amdgpu 0000:0e:00.0: amdgpu: GCVM_L2_PROTECTION_FAULT_STATUS:0x00541051
[ 145.478212] amdgpu 0000:0e:00.0: amdgpu: Faulty UTCL2 client ID: TCP (0x8)
[ 145.478214] amdgpu 0000:0e:00.0: amdgpu: MORE_FAULTS: 0x1
[ 145.478215] amdgpu 0000:0e:00.0: amdgpu: WALKER_ERROR: 0x0
[ 145.478217] amdgpu 0000:0e:00.0: amdgpu: PERMISSION_FAULTS: 0x5
[ 145.478218] amdgpu 0000:0e:00.0: amdgpu: MAPPING_ERROR: 0x0
[ 145.478219] amdgpu 0000:0e:00.0: amdgpu: RW: 0x1
[ 145.478224] amdgpu 0000:0e:00.0: amdgpu: [gfxhub] page fault (src_id:0 ring:40 vmid:5 pasid:32791, for process ck3 pid 11214 thread ck3 pid 11214)
[ 145.478226] amdgpu 0000:0e:00.0: amdgpu: in page starting at address 0x0000800274850000 from client 10
[ 145.478228] amdgpu 0000:0e:00.0: amdgpu: GCVM_L2_PROTECTION_FAULT_STATUS:0x00541051
[ 145.478229] amdgpu 0000:0e:00.0: amdgpu: Faulty UTCL2 client ID: TCP (0x8)
[ 145.478230] amdgpu 0000:0e:00.0: amdgpu: MORE_FAULTS: 0x1
[ 145.478232] amdgpu 0000:0e:00.0: amdgpu: WALKER_ERROR: 0x0
[ 145.478233] amdgpu 0000:0e:00.0: amdgpu: PERMISSION_FAULTS: 0x5
[ 145.478234] amdgpu 0000:0e:00.0: amdgpu: MAPPING_ERROR: 0x0
[ 145.478235] amdgpu 0000:0e:00.0: amdgpu: RW: 0x1
[ 145.478240] amdgpu 0000:0e:00.0: amdgpu: [gfxhub] page fault (src_id:0 ring:40 vmid:5 pasid:32791, for process ck3 pid 11214 thread ck3 pid 11214)
[ 145.478242] amdgpu 0000:0e:00.0: amdgpu: in page starting at address 0x00008002743b0000 from client 10
[ 145.478244] amdgpu 0000:0e:00.0: amdgpu: GCVM_L2_PROTECTION_FAULT_STATUS:0x00541051
[ 145.478245] amdgpu 0000:0e:00.0: amdgpu: Faulty UTCL2 client ID: TCP (0x8)
[ 145.478246] amdgpu 0000:0e:00.0: amdgpu: MORE_FAULTS: 0x1
[ 145.478247] amdgpu 0000:0e:00.0: amdgpu: WALKER_ERROR: 0x0
[ 145.478248] amdgpu 0000:0e:00.0: amdgpu: PERMISSION_FAULTS: 0x5
[ 145.478250] amdgpu 0000:0e:00.0: amdgpu: MAPPING_ERROR: 0x0
[ 145.478251] amdgpu 0000:0e:00.0: amdgpu: RW: 0x1
[ 145.478255] amdgpu 0000:0e:00.0: amdgpu: [gfxhub] page fault (src_id:0 ring:40 vmid:5 pasid:32791, for process ck3 pid 11214 thread ck3 pid 11214)
[ 145.478257] amdgpu 0000:0e:00.0: amdgpu: in page starting at address 0x0000800274310000 from client 10
[ 145.478259] amdgpu 0000:0e:00.0: amdgpu: GCVM_L2_PROTECTION_FAULT_STATUS:0x00541051
[ 145.478260] amdgpu 0000:0e:00.0: amdgpu: Faulty UTCL2 client ID: TCP (0x8)
[ 145.478261] amdgpu 0000:0e:00.0: amdgpu: MORE_FAULTS: 0x1
[ 145.478263] amdgpu 0000:0e:00.0: amdgpu: WALKER_ERROR: 0x0
[ 145.478264] amdgpu 0000:0e:00.0: amdgpu: PERMISSION_FAULTS: 0x5
[ 145.478265] amdgpu 0000:0e:00.0: amdgpu: MAPPING_ERROR: 0x0
[ 145.478266] amdgpu 0000:0e:00.0: amdgpu: RW: 0x1
[ 145.478270] amdgpu 0000:0e:00.0: amdgpu: [gfxhub] page fault (src_id:0 ring:40 vmid:5 pasid:32791, for process ck3 pid 11214 thread ck3 pid 11214)
[ 145.478272] amdgpu 0000:0e:00.0: amdgpu: in page starting at address 0x0000800274430000 from client 10
[ 145.478274] amdgpu 0000:0e:00.0: amdgpu: GCVM_L2_PROTECTION_FAULT_STATUS:0x00541051
[ 145.478275] amdgpu 0000:0e:00.0: amdgpu: Faulty UTCL2 client ID: TCP (0x8)
[ 145.478276] amdgpu 0000:0e:00.0: amdgpu: MORE_FAULTS: 0x1
[ 145.478278] amdgpu 0000:0e:00.0: amdgpu: WALKER_ERROR: 0x0
[ 145.478279] amdgpu 0000:0e:00.0: amdgpu: PERMISSION_FAULTS: 0x5
[ 145.478280] amdgpu 0000:0e:00.0: amdgpu: MAPPING_ERROR: 0x0
[ 145.478281] amdgpu 0000:0e:00.0: amdgpu: RW: 0x1
[ 145.478285] amdgpu 0000:0e:00.0: amdgpu: [gfxhub] page fault (src_id:0 ring:40 vmid:5 pasid:32791, for process ck3 pid 11214 thread ck3 pid 11214)
[ 145.478287] amdgpu 0000:0e:00.0: amdgpu: in page starting at address 0x0000800274570000 from client 10
[ 145.478289] amdgpu 0000:0e:00.0: amdgpu: GCVM_L2_PROTECTION_FAULT_STATUS:0x00541051
[ 145.478290] amdgpu 0000:0e:00.0: amdgpu: Faulty UTCL2 client ID: TCP (0x8)
[ 145.478291] amdgpu 0000:0e:00.0: amdgpu: MORE_FAULTS: 0x1
[ 145.478293] amdgpu 0000:0e:00.0: amdgpu: WALKER_ERROR: 0x0
[ 145.478294] amdgpu 0000:0e:00.0: amdgpu: PERMISSION_FAULTS: 0x5
[ 145.478295] amdgpu 0000:0e:00.0: amdgpu: MAPPING_ERROR: 0x0
[ 145.478296] amdgpu 0000:0e:00.0: amdgpu: RW: 0x1
[ 145.478301] amdgpu 0000:0e:00.0: amdgpu: [gfxhub] page fault (src_id:0 ring:40 vmid:5 pasid:32791, for process ck3 pid 11214 thread ck3 pid 11214)
[ 145.478302] amdgpu 0000:0e:00.0: amdgpu: in page starting at address 0x0000800274500000 from client 10
[ 145.478304] amdgpu 0000:0e:00.0: amdgpu: GCVM_L2_PROTECTION_FAULT_STATUS:0x00541051
[ 145.478305] amdgpu 0000:0e:00.0: amdgpu: Faulty UTCL2 client ID: TCP (0x8)
[ 145.478306] amdgpu 0000:0e:00.0: amdgpu: MORE_FAULTS: 0x1
[ 145.478308] amdgpu 0000:0e:00.0: amdgpu: WALKER_ERROR: 0x0
[ 145.478309] amdgpu 0000:0e:00.0: amdgpu: PERMISSION_FAULTS: 0x5
[ 145.478310] amdgpu 0000:0e:00.0: amdgpu: MAPPING_ERROR: 0x0
[ 145.478311] amdgpu 0000:0e:00.0: amdgpu: RW: 0x1
[ 145.478316] amdgpu 0000:0e:00.0: amdgpu: [gfxhub] page fault (src_id:0 ring:40 vmid:5 pasid:32791, for process ck3 pid 11214 thread ck3 pid 11214)
[ 145.478318] amdgpu 0000:0e:00.0: amdgpu: in page starting at address 0x0000800274410000 from client 10
[ 145.478319] amdgpu 0000:0e:00.0: amdgpu: GCVM_L2_PROTECTION_FAULT_STATUS:0x00541051
[ 145.478320] amdgpu 0000:0e:00.0: amdgpu: Faulty UTCL2 client ID: TCP (0x8)
[ 145.478321] amdgpu 0000:0e:00.0: amdgpu: MORE_FAULTS: 0x1
[ 145.478323] amdgpu 0000:0e:00.0: amdgpu: WALKER_ERROR: 0x0
[ 145.478324] amdgpu 0000:0e:00.0: amdgpu: PERMISSION_FAULTS: 0x5
[ 145.478325] amdgpu 0000:0e:00.0: amdgpu: MAPPING_ERROR: 0x0
[ 145.478326] amdgpu 0000:0e:00.0: amdgpu: RW: 0x1
[ 145.478331] amdgpu 0000:0e:00.0: amdgpu: [gfxhub] page fault (src_id:0 ring:40 vmid:5 pasid:32791, for process ck3 pid 11214 thread ck3 pid 11214)
[ 145.478333] amdgpu 0000:0e:00.0: amdgpu: in page starting at address 0x00008002744a0000 from client 10
[ 145.478334] amdgpu 0000:0e:00.0: amdgpu: GCVM_L2_PROTECTION_FAULT_STATUS:0x00541051
[ 145.478335] amdgpu 0000:0e:00.0: amdgpu: Faulty UTCL2 client ID: TCP (0x8)
[ 145.478336] amdgpu 0000:0e:00.0: amdgpu: MORE_FAULTS: 0x1
[ 145.478338] amdgpu 0000:0e:00.0: amdgpu: WALKER_ERROR: 0x0
[ 145.478339] amdgpu 0000:0e:00.0: amdgpu: PERMISSION_FAULTS: 0x5
[ 145.478340] amdgpu 0000:0e:00.0: amdgpu: MAPPING_ERROR: 0x0
[ 145.478341] amdgpu 0000:0e:00.0: amdgpu: RW: 0x1
[ 145.478345] amdgpu 0000:0e:00.0: amdgpu: [gfxhub] page fault (src_id:0 ring:40 vmid:5 pasid:32791, for process ck3 pid 11214 thread ck3 pid 11214)
[ 145.478347] amdgpu 0000:0e:00.0: amdgpu: in page starting at address 0x0000800274710000 from client 10
[ 145.478349] amdgpu 0000:0e:00.0: amdgpu: GCVM_L2_PROTECTION_FAULT_STATUS:0x00541051
[ 145.478350] amdgpu 0000:0e:00.0: amdgpu: Faulty UTCL2 client ID: TCP (0x8)
[ 145.478351] amdgpu 0000:0e:00.0: amdgpu: MORE_FAULTS: 0x1
[ 145.478353] amdgpu 0000:0e:00.0: amdgpu: WALKER_ERROR: 0x0
[ 145.478354] amdgpu 0000:0e:00.0: amdgpu: PERMISSION_FAULTS: 0x5
[ 145.478355] amdgpu 0000:0e:00.0: amdgpu: MAPPING_ERROR: 0x0
[ 145.478356] amdgpu 0000:0e:00.0: amdgpu: RW: 0x1
[ 155.945164] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx_0.0.0 timeout, but soft recovered
Hardware and System description:
System:
Host: el-ryzerino Kernel: 6.8.7-300.fc40.x86_64 arch: x86_64 bits: 64
Console: pty pts/2 Distro: Fedora Linux 40 (Workstation Edition)
Machine:
Type: Desktop Mobo: ASRock model: B550 Taichi serial: N/A UEFI: American
Megatrends LLC. v: P3.40 date: 01/18/2024
CPU:
Info: 16-core model: AMD Ryzen 9 5950X bits: 64 type: MT MCP cache:
L2: 8 MiB
Speed (MHz): avg: 2331 min/max: 2200/5083 cores: 1: 2200 2: 2200 3: 2200
4: 2200 5: 2200 6: 2200 7: 2200 8: 2200 9: 3400 10: 2200 11: 2200 12: 2200
13: 2200 14: 2200 15: 2200 16: 2200 17: 3400 18: 2200 19: 2200 20: 2200
21: 2200 22: 2200 23: 2800 24: 2200 25: 3400 26: 2200 27: 2200 28: 2200
29: 2200 30: 2200 31: 2200 32: 2200
Graphics:
Device-1: AMD Navi 31 [Radeon RX 7900 XT/7900 XTX/7900M] driver: amdgpu
v: kernel
Display: server: X.Org v: 23.2.6 with: Xwayland v: 23.2.6 driver: X:
loaded: amdgpu unloaded: fbdev,modesetting,radeon,vesa dri: radeonsi
gpu: amdgpu resolution: 1: 2560x1440~90Hz 2: 1920x1200~60Hz
API: EGL v: 1.5 drivers: radeonsi,swrast platforms: x11,surfaceless,device
API: OpenGL v: 4.6 compat-v: 4.5 vendor: amd mesa v: 24.0.5 renderer: AMD
Radeon RX 7900 GRE (radeonsi navi31 LLVM 18.1.1 DRM 3.57
6.8.7-300.fc40.x86_64)
API: Vulkan v: 1.3.280 drivers: N/A surfaces: xcb,xlib
Audio:
Device-1: AMD Navi 31 HDMI/DP Audio driver: snd_hda_intel
Device-2: AMD Starship/Matisse HD Audio driver: snd_hda_intel
Device-3: SteelSeries ApS Arctis 7
driver: hid-generic,snd-usb-audio,usbhid type: USB
API: ALSA v: k6.8.7-300.fc40.x86_64 status: kernel-api
Network:
Device-1: Intel Ethernet I225-V driver: igc
IF: enp4s0 state: down mac: 68:54:5a:62:a3:01
Device-2: Intel Ethernet I225-V driver: igc
IF: enp6s0 state: up speed: 1000 Mbps duplex: full mac: a8:a1:59:36:84:43
IF-ID-1: bridge0 state: up speed: 1000 Mbps duplex: unknown
mac: a8:a1:59:36:84:43
IF-ID-2: virbr0 state: down mac: 52:54:00:e1:14:4a
Drives:
Local Storage: total: 4.55 TiB used: 1.21 TiB (26.7%)
ID-1: /dev/nvme0n1 vendor: Samsung model: SSD 970 EVO Plus 1TB
size: 931.51 GiB
ID-2: /dev/nvme1n1 vendor: Samsung model: SSD 970 EVO Plus 2TB
size: 1.82 TiB
ID-3: /dev/sda vendor: Samsung model: SSD 870 EVO 2TB size: 1.82 TiB
Partition:
ID-1: / size: 728.48 GiB used: 53.15 GiB (7.3%) fs: btrfs dev: /dev/dm-0
ID-2: /boot size: 973.4 MiB used: 327.2 MiB (33.6%) fs: ext4
dev: /dev/nvme0n1p5
ID-3: /boot/efi size: 96 MiB used: 76 MiB (79.2%) fs: vfat
dev: /dev/nvme0n1p1
ID-4: /home size: 1.82 TiB used: 496.97 GiB (26.7%) fs: btrfs
dev: /dev/dm-1
Swap:
ID-1: swap-1 type: zram size: 8 GiB used: 0 KiB (0.0%) dev: /dev/zram0
Sensors:
System Temperatures: cpu: 38.0 C mobo: 34.0 C gpu: amdgpu temp: 47.0 C
Fan Speeds (rpm): fan-1: 0 fan-2: 0 fan-3: 0 fan-4: 0 fan-5: 0 fan-6: 0
fan-7: 0 gpu: amdgpu fan: 0
Info:
Memory: total: 64 GiB available: 62.72 GiB used: 10.02 GiB (16.0%)
Processes: 684 Uptime: 18m Shell: Bash inxi: 3.3.34
How to reproduce the issue:
< TODO: Describe step-by-step how to reproduce the issue >
- Start CK3
- Press Start New Game
- Choose a Charakter
- Press "Start"
- wait for crash