Race with shared PDEs across multiple VM bind engines

I will take an AR to write a test which shows both of these races.

Tests that expose both issues: https://gitlab.freedesktop.org/drm/xe/igt-gpu-tools/-/merge_requests/13

For the above test to work properly, the kernel will need the following hack to W/A #39 (closed)

diff --git a/drivers/gpu/drm/xe/xe_vm.c b/drivers/gpu/drm/xe/xe_vm.c
index 15eb2ee26b3c..5402d759b588 100644
--- a/drivers/gpu/drm/xe/xe_vm.c
+++ b/drivers/gpu/drm/xe/xe_vm.c
@@ -2055,6 +2055,7 @@ static int vm_bind_ioctl(struct xe_vm *vm, struct xe_vma *vma,
 {
        int err;

+#if 0
        /*
         * We can't do an unbind until all in syncs are signalled as we destroy
         * the PTEs immediately in the unbind code. If doing an async VM unbind,
@@ -2069,6 +2070,7 @@ static int vm_bind_ioctl(struct xe_vm *vm, struct xe_vma *vma,
                                return err;
                }
        }
+#endif

        if (!(VM_BIND_OP(args->op) == XE_VM_BIND_OP_MAP_USERPTR)) {
                LIST_HEAD(objs);

Also to show that address partitioning works, the following hack to IGT makes the test pass:

diff --git a/tests/xe/xe_vm.c b/tests/xe/xe_vm.c
index f05b7a64..b802ca73 100644
--- a/tests/xe/xe_vm.c
+++ b/tests/xe/xe_vm.c
@@ -719,7 +719,7 @@ unbind_race(int fd, struct drm_xe_engine_class_instance *eci)
        igt_assert(!syncobj_wait(fd, &syncobjs[0], 1, 1, 0, NULL));

        /* Bind another address that shares PDE /w corked unbind */
-       addr += bo_size;
+       addr += 0x1ull << 39;
        sync[0].flags |= DRM_XE_SYNC_SIGNAL;
        sync[0].handle = syncobj_create(fd, 0);
        xe_vm_bind_async(fd, vm, bind_engines[1], bo, 0, addr, bo_size,
@@ -828,7 +828,7 @@ bind_race(int fd, struct drm_xe_engine_class_instance *eci)
        igt_assert(!syncobj_wait(fd, &syncobjs[0], 1, 1, 0, NULL));

        /* Bind another address that shares PDE /w corked bind */
-       addr += bo_size;
+       addr += 0x1ull << 39;
        sync[0].flags |= DRM_XE_SYNC_SIGNAL;
        sync[0].handle = syncobj_create(fd, 0);
        xe_vm_bind_async(fd, vm, bind_engines[1], bo, 0, addr, bo_size,

If the shift in the above hack is changed to 38, the test fails. This shows the problem with address sharing goes all the way to the root entry.

mentioned in merge request !39 (merged)

@zehortigoza - Do you use multiple bind engines in VK currently (e.g. sparse vs. immediate)? If so, we likely need to resolve this one fairly soon. If you do, in the meantime you should partition the address space based on engine all the way to the root (e.g. if 48 bits of address space, each engine should use addresses with different bits for to 48-39).

mentioned in merge request !187 (merged)

@mbrost To resolve a similar race, upstream i915 has an interval tree into which is inserted pending async unbinds, as a range and a dma-fence. These are self-removing from the interval tree when the dma-fence signals. Any async bind needs to search the interval tree for overlapping operations and add them as a dependency.

@thomash - I'm thinking we should tackle this problem after MR 187 + my outstanding MRs related SG + buddy allocator, I can rebase the IGT which will clearly show the problem.

@mbrost I agree.

This is fixed by: https://patchwork.freedesktop.org/series/121019/

This merged. Closing.

closed

Race with shared PDEs across multiple VM bind engines

Designs

Child items ...

Activity

Admin message

Admin message

Race with shared PDEs across multiple VM bind engines

Activity