Skip to content
This repository was archived by the owner on Oct 3, 2024. It is now read-only.
This repository was archived by the owner on Oct 3, 2024. It is now read-only.

Error causing all kvm gvt guests to freeze #153

@reedog117

Description

@reedog117

This error happens sporadically but seems to happen more often with more VGPU load or multiple VMs running gvt.

This seems like an error with ppgtt_populate_spt

Jun  2 11:42:04 virt-slc-90 kernel: [  943.161714] WARNING: CPU: 0 PID: 27
42 at drivers/gpu/drm/i915/gvt/gtt.c:683 ppgtt_populate_spt+0x1f1/0x3a0 [i
915]

Proxmox 6.2-4 - kernel 5.4.41-1-pve
Coffee Lake ER - Xeon E-2276G
Windows 10 Guest - Newest available DCH drivers but have tried the last 3 versions

From the kernel log:

Jun  2 11:42:04 virt-slc-90 kernel: [  943.157228] gvt: guest page write error, gpa 13e8cacf8
Jun  2 11:42:04 virt-slc-90 kernel: [  943.158203] gvt: vgpu 2: fail: shadow page 000000008eed785e guest entry 0x7d3408cc24763883 type 9.
Jun  2 11:42:04 virt-slc-90 kernel: [  943.158204] gvt: guest page write error, gpa 1409c9bb0
Jun  2 11:42:04 virt-slc-90 kernel: [  943.158656] gvt: vgpu 1: fail: shadow page 000000001ee6d741 guest entry 0xffffffffffffffff type 9
Jun  2 11:42:04 virt-slc-90 kernel: [  943.159188] ------------[ cut here ]------------
Jun  2 11:42:04 virt-slc-90 kernel: [  943.159702] gvt: vgpu 1: fail: spt 
00000000dd87fe57 guest entry 0xffffffffffffffff type 9
Jun  2 11:42:04 virt-slc-90 kernel: [  943.159703] gvt: vgpu 1: fail: shadow page 00000000dd87fe57 guest entry 0xffffffffffffffff type 9.
Jun  2 11:42:04 virt-slc-90 kernel: [  943.159704] gvt: guest page write error, gpa 13e8cad00
Jun  2 11:42:04 virt-slc-90 kernel: [  943.160713] invalid entry type
Jun  2 11:42:04 virt-slc-90 kernel: [  943.161207] gvt: vgpu 1: fail: shadow page 000000001ee6d741 guest entry 0xffffffffffffffff type 9
Jun  2 11:42:04 virt-slc-90 kernel: [  943.161714] WARNING: CPU: 0 PID: 2742 at drivers/gpu/drm/i915/gvt/gtt.c:683 ppgtt_populate_spt+0x1f1/0x3a0 [i915]
Jun  2 11:42:04 virt-slc-90 kernel: [  943.162152] gvt: vgpu 1: fail: spt 00000000dd87fe57 guest entry 0xffffffffffffffff type 9
Jun  2 11:42:04 virt-slc-90 kernel: [  943.162153] gvt: vgpu 1: fail: shadow page 00000000dd87fe57 guest entry 0xffffffffffffffff type 9.
Jun  2 11:42:04 virt-slc-90 kernel: [  943.162598] Modules linked in: xt_multiport(E) ipt_REJECT(E) nf_reject_ipv4(E) ebtable_filter(E) ebtables(E) 
ip_set(E) ip6table_raw(E) iptable_raw(E) ip6table_filter(E) ip6_tables(E) 
sctp(E) iptable_filter(E) vxlan(E) ip6_udp_tunnel(E) udp_tunnel(E) iptable_nat(E) xt_MASQUERADE(E) bpfilter(E) bonding(E) openvswitch(E) nsh(E) nf_c
onncount(E) nf_nat(E) softdog(E) nfnetlink_log(E) nfnetlink(E) zfs(POE) zunicode(POE) zlua(POE) zavl(POE) icp(POE) ipmi_ssif(E) zcommon(POE) znvpair(POE) spl(OE) vhost_net(E) vhost(E) tap(E) nf_conntrack_pptp(E) nf_conntrack(E) nf_defrag_ipv6(E) nf_defrag_ipv4(E) ib_iser(E) rdma_cm(E) iw_cm(E) ib_cm(E) ib_core(E) iscsi_tcp(E) libiscsi_tcp(E) libiscsi(E) scsi_transport_iscsi(E) vfio_pci(E) vfio_virqfd(E) kvmgt(E) intel_rapl_msr(E) intel_rapl_common(E) x86_pkg_temp_thermal(E) intel_powerclamp(E) coretemp(E) kvm_intel(E) crct10dif_pclmul(E) crc32_pclmul(E) ghash_clmulni_intel(E) aesni_int
el(E) crypto_simd(E) cdc_ether(E) cryptd(E) glue_helper(E) ast(E) joydev(E)
Jun  2 11:42:04 virt-slc-90 kernel: [  943.162613]  usbnet(E)
Jun  2 11:42:04 virt-slc-90 kernel: [  943.163561] gvt: guest page write error, gpa 13e8cad08
Jun  2 11:42:04 virt-slc-90 kernel: [  943.164533]  intel_cstate(E) input_leds(E) drm_vram_helper(E) mii(E) intel_rapl_perf(E) wmi_bmof(E) pcspkr(E)
 ttm(E) 8250_dw(E) mei_me(E) mei(E) intel_pch_thermal(E) ie31200_edac(E) ipmi_si(E) ipmi_devintf(E) ipmi_msghandler(E) mac_hid(E) acpi_power_meter(E) acpi_pad(E) acpi_tad(E) i915(E) vfio_mdev(E) mdev(E) vfio_iommu_type1(E) vfio(E) drm_kms_helper(E) drm(E) fb_sys_fops(E) syscopyarea(E) sysfillrect(E) sysimgblt(E) kvm(E) irqbypass(E) sunrpc(E) ip_tables(E) x_tables(E) autofs4(E) hid_generic(E) usbkbd(E) usbmouse(E) usbhid(E) hid(E) btrfs(E) xor(E) zstd_compress(E) uas(E) usb_storage(E) raid6_pq(E) libcrc32c(E) igb(E) i2c_i801(E) ahci(E) intel_lpss_pci(E) xhci_pci(E) i2c_algo_bit(E) intel
_lpss(E) dca(E) libahci(E) idma64(E) xhci_hcd(E) virt_dma(E) wmi(E) video(E) pinctrl_cannonlake(E) pinctrl_intel(E)
Jun  2 11:42:04 virt-slc-90 kernel: [  943.164605] CPU: 0 PID: 2742 Comm: 
kvm Tainted: P           OE     5.4.41-1-pve #1
Jun  2 11:42:04 virt-slc-90 kernel: [  943.164606] Hardware name: To Be Filled By O.E.M. To Be Filled By O.E.M./E3C246D4U, BIOS P2.30 12/25/2019
Jun  2 11:42:04 virt-slc-90 kernel: [  943.165680] gvt: vgpu 1: fail: shadow page 000000001ee6d741 guest entry 0xffffffffffffffff type 9
Jun  2 11:42:04 virt-slc-90 kernel: [  943.166623] RIP: 0010:ppgtt_populate_spt+0x1f1/0x3a0 [i915]

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions