AMDGPU - GPU fault detected - VM_CONTEXT1_PROTECTION_FAULT_ADDR since Kernel 6.3.x
System information
System:
Host: el-ryzerino Kernel: 6.2.15-300.fc38.x86_64 arch: x86_64 bits: 64
compiler: gcc v: 2.39-9.fc38 Desktop: GNOME v: 44.1 tk: GTK v: 3.24.37
wm: gnome-shell dm: GDM Distro: Fedora release 38 (Thirty Eight)
CPU:
Info: 16-core model: AMD Ryzen 9 5950X bits: 64 type: MT MCP arch: Zen 3+
rev: 2 cache: L1: 1024 KiB L2: 8 MiB L3: 64 MiB
Speed (MHz): avg: 2218 high: 2795 min/max: 2200/5130 boost: enabled cores:
1: 2200 2: 2200 3: 2200 4: 2200 5: 2200 6: 2200 7: 2200 8: 2200 9: 2200
10: 2200 11: 2200 12: 2200 13: 2200 14: 2200 15: 2200 16: 2200 17: 2196
18: 2200 19: 2200 20: 2200 21: 2200 22: 2200 23: 2200 24: 2200 25: 2795
26: 2200 27: 2200 28: 2200 29: 2200 30: 2200 31: 2200 32: 2200
bogomips: 217205
Flags: avx avx2 ht lm nx pae sse sse2 sse3 sse4_1 sse4_2 sse4a ssse3 svm
Graphics:
Device-1: AMD Ellesmere [Radeon RX 470/480/570/570X/580/580X/590]
vendor: Gigabyte driver: amdgpu v: kernel arch: GCN-4 pcie: speed: 8 GT/s
lanes: 16 ports: active: DP-4,HDMI-A-1 empty: DP-1, DP-2, DP-3, DP-5,
DVI-D-1 bus-ID: 0d:00.0 chip-ID: 1002:67df temp: 49.0 C
Display: wayland server: X.org v: 1.20.14 with: Xwayland v: 22.1.9
compositor: gnome-shell driver: X: loaded: amdgpu
unloaded: fbdev,modesetting,vesa dri: radeonsi gpu: amdgpu display-ID: 0
Monitor-1: DP-4 model: HP Z24n G2 res: 1920x1200 dpi: 94
diag: 611mm (24.1")
Monitor-2: HDMI-A-1 model: XG27WQ res: 2560x1440 dpi: 107
diag: 703mm (27.7")
API: OpenGL v: 4.6 Mesa 23.0.3 renderer: AMD Radeon RX 480 Graphics
(polaris10 LLVM 16.0.1 DRM 3.49 6.2.15-300.fc38.x86_64) direct-render: Yes
Describe the issue
After booting e.g Kernel 6.3.3 I noticed sluggish behaviour of the windowmanager (Gnome) and an application (Civilization 6) I checked dmesg and found an amdgpu related issue (log is below)
Regression
It is probably a regression, since it still works with Kernel 6.2.15
Log files as attachment
Mai 21 22:40:49 el-ryzerino kernel: amdgpu 0000:0d:00.0: amdgpu: GPU fault detected: 147 0x0a124402 for process skypeforlinux pid 7723 thread skypeforli:cs0 pid 7741
Mai 21 22:40:49 el-ryzerino kernel: amdgpu 0000:0d:00.0: amdgpu: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x00106F42
Mai 21 22:40:49 el-ryzerino kernel: amdgpu 0000:0d:00.0: amdgpu: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0F044002
Mai 21 22:40:49 el-ryzerino kernel: amdgpu 0000:0d:00.0: amdgpu: VM fault (0x02, vmid 7, pasid 32776) at page 1077058, write from 'TC5' (0x54433500) (68)
Mai 21 22:40:50 el-ryzerino kernel: amdgpu 0000:0d:00.0: amdgpu: GPU fault detected: 147 0x0a124402 for process skypeforlinux pid 7723 thread skypeforli:cs0 pid 7741
Mai 21 22:40:50 el-ryzerino kernel: amdgpu 0000:0d:00.0: amdgpu: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x00106F42
Mai 21 22:40:50 el-ryzerino kernel: amdgpu 0000:0d:00.0: amdgpu: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0F044002
Mai 21 22:40:50 el-ryzerino kernel: amdgpu 0000:0d:00.0: amdgpu: VM fault (0x02, vmid 7, pasid 32776) at page 1077058, write from 'TC5' (0x54433500) (68)
Mai 21 22:40:50 el-ryzerino kernel: amdgpu 0000:0d:00.0: amdgpu: GPU fault detected: 147 0x0a0a8802 for process skypeforlinux pid 7723 thread skypeforli:cs0 pid 7741
Mai 21 22:40:50 el-ryzerino kernel: amdgpu 0000:0d:00.0: amdgpu: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x00106F44
Mai 21 22:40:50 el-ryzerino kernel: amdgpu 0000:0d:00.0: amdgpu: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0E0A2002
Mai 21 22:40:50 el-ryzerino kernel: amdgpu 0000:0d:00.0: amdgpu: VM fault (0x02, vmid 7, pasid 32776) at page 1077060, read from 'CBC4' (0x43424334) (162)
Mai 21 22:40:50 el-ryzerino kernel: amdgpu 0000:0d:00.0: amdgpu: GPU fault detected: 147 0x0a1a0402 for process skypeforlinux pid 7723 thread skypeforli:cs0 pid 7741
Mai 21 22:40:50 el-ryzerino kernel: amdgpu 0000:0d:00.0: amdgpu: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x00106EAE
Mai 21 22:40:50 el-ryzerino kernel: amdgpu 0000:0d:00.0: amdgpu: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0F020002
Mai 21 22:40:50 el-ryzerino kernel: amdgpu 0000:0d:00.0: amdgpu: VM fault (0x02, vmid 7, pasid 32776) at page 1076910, write from 'CB2' (0x43423200) (32)
Mai 21 22:40:50 el-ryzerino kernel: amdgpu 0000:0d:00.0: amdgpu: GPU fault detected: 147 0x0a0a0402 for process skypeforlinux pid 7723 thread skypeforli:cs0 pid 7741
Mai 21 22:40:50 el-ryzerino kernel: amdgpu 0000:0d:00.0: amdgpu: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x00106E83
Mai 21 22:40:50 el-ryzerino kernel: amdgpu 0000:0d:00.0: amdgpu: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0F0D0002
Mai 21 22:40:50 el-ryzerino kernel: amdgpu 0000:0d:00.0: amdgpu: VM fault (0x02, vmid 7, pasid 32776) at page 1076867, write from 'CB7' (0x43423700) (208)
Mai 21 22:40:50 el-ryzerino kernel: amdgpu 0000:0d:00.0: amdgpu: GPU fault detected: 147 0x0a024402 for process skypeforlinux pid 7723 thread skypeforli:cs0 pid 7741
Mai 21 22:40:50 el-ryzerino kernel: amdgpu 0000:0d:00.0: amdgpu: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x00106EE4
Mai 21 22:40:50 el-ryzerino kernel: amdgpu 0000:0d:00.0: amdgpu: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0F0A0002
Mai 21 22:40:50 el-ryzerino kernel: amdgpu 0000:0d:00.0: amdgpu: VM fault (0x02, vmid 7, pasid 32776) at page 1076964, write from 'CB4' (0x43423400) (160)
Mai 21 22:40:50 el-ryzerino kernel: amdgpu 0000:0d:00.0: amdgpu: GPU fault detected: 147 0x0a1a8802 for process skypeforlinux pid 7723 thread skypeforli:cs0 pid 7741
Mai 21 22:40:50 el-ryzerino kernel: amdgpu 0000:0d:00.0: amdgpu: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x00106F04
Mai 21 22:40:50 el-ryzerino kernel: amdgpu 0000:0d:00.0: amdgpu: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0F020002
Mai 21 22:40:50 el-ryzerino kernel: amdgpu 0000:0d:00.0: amdgpu: VM fault (0x02, vmid 7, pasid 32776) at page 1076996, write from 'CB2' (0x43423200) (32)
Mai 21 22:40:50 el-ryzerino kernel: amdgpu 0000:0d:00.0: amdgpu: GPU fault detected: 147 0x0a120802 for process skypeforlinux pid 7723 thread skypeforli:cs0 pid 7741
Mai 21 22:40:50 el-ryzerino kernel: amdgpu 0000:0d:00.0: amdgpu: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x00106F26
Mai 21 22:40:50 el-ryzerino kernel: amdgpu 0000:0d:00.0: amdgpu: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0F010002
Mai 21 22:40:50 el-ryzerino kernel: amdgpu 0000:0d:00.0: amdgpu: VM fault (0x02, vmid 7, pasid 32776) at page 1077030, write from 'CB3' (0x43423300) (16)
Mai 21 22:40:50 el-ryzerino kernel: amdgpu 0000:0d:00.0: amdgpu: GPU fault detected: 147 0x0a128402 for process skypeforlinux pid 7723 thread skypeforli:cs0 pid 7741
Mai 21 22:40:50 el-ryzerino kernel: amdgpu 0000:0d:00.0: amdgpu: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x00106D88
Mai 21 22:40:50 el-ryzerino kernel: amdgpu 0000:0d:00.0: amdgpu: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0F020002
Mai 21 22:40:50 el-ryzerino kernel: amdgpu 0000:0d:00.0: amdgpu: VM fault (0x02, vmid 7, pasid 32776) at page 1076616, write from 'CB2' (0x43423200) (32)
Mai 21 22:40:50 el-ryzerino kernel: amdgpu 0000:0d:00.0: amdgpu: GPU fault detected: 147 0x0a124802 for process skypeforlinux pid 7723 thread skypeforli:cs0 pid 7741
Mai 21 22:40:50 el-ryzerino kernel: amdgpu 0000:0d:00.0: amdgpu: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x00106D91
Mai 21 22:40:50 el-ryzerino kernel: amdgpu 0000:0d:00.0: amdgpu: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0F090002
Mai 21 22:40:50 el-ryzerino kernel: amdgpu 0000:0d:00.0: amdgpu: VM fault (0x02, vmid 7, pasid 32776) at page 1076625, write from 'CB5' (0x43423500) (144)