VGPU fails and kills KVM host
velde666@gmail.com
Submitted byAssigned to Terrence Xu
Link to original bug (#111582)
Description
Created attachment 145291
trace 1
Hi there,
I am running CentOS 7 on an Intel NUC7i3BNH with with KVM/QEMU using GPU virtualization passthrough to a Windows 10 (1903) VM.
Kernel is 5.2.11 compiled with merged configs from standard CentOS kernel 3.10.x and 5.2.x kernel-ml from elrepo-kernel.
I have an issue where the VGPU used in the Win 10 Guest throws following errors a gazillion times on the host:
Sep 6 19:59:55 floor13 kernel: gvt: vgpu 1: fail: shadow page 00000000c3628cae guest entry 0xffffffffffffffff type 9
Sep 6 19:59:55 floor13 kernel: gvt: vgpu 1: fail: spt 00000000eed2450a guest entry 0xffffffffffffffff type 9
Sep 6 19:59:55 floor13 kernel: gvt: vgpu 1: fail: shadow page 00000000eed2450a guest entry 0xffffffffffffffff type 9.
Sep 6 19:59:55 floor13 kernel: gvt: guest page write error, gpa 193446000
Sep 6 19:59:55 floor13 kernel: gvt: vgpu 1: fail: shadow page 00000000c3628cae guest entry 0xffffffffffffffff type 9
Sep 6 19:59:55 floor13 kernel: gvt: vgpu 1: fail: spt 00000000eed2450a guest entry 0xffffffffffffffff type 9
Sep 6 19:59:55 floor13 kernel: gvt: vgpu 1: fail: shadow page 00000000eed2450a guest entry 0xffffffffffffffff type 9.
Sep 6 19:59:55 floor13 kernel: gvt: guest page write error, gpa 193446008
Sep 6 19:59:55 floor13 kernel: gvt: vgpu 1: fail: shadow page 00000000c3628cae guest entry 0xffffffffffffffff type 9
Sep 6 19:59:55 floor13 kernel: gvt: vgpu 1: fail: spt 00000000eed2450a guest entry 0xffffffffffffffff type 9
Sep 6 19:59:55 floor13 kernel: gvt: vgpu 1: fail: shadow page 00000000eed2450a guest entry 0xffffffffffffffff type 9.
Sep 6 19:59:55 floor13 kernel: gvt: guest page write error, gpa 193446010
[...]
After that there is an entry
Sep 6 20:02:37 floor13 kernel: gvt: vgpu 1: GVT doesn't support 1GB entry
followed by three nearly identical stack traces I will attach to this bug report
The result ist that a) the Win10 VM dies and b) the complete KMV host dies.
Many thanks in advance for your time and support! :)
Alex
Attachment 145291, "trace 1":
trace1.txt