drm/i915: Idle the GPU before shinking everything
authorChris Wilson <chris@chris-wilson.co.uk>
Wed, 8 Nov 2017 09:44:00 +0000 (09:44 +0000)
committerJani Nikula <jani.nikula@intel.com>
Thu, 9 Nov 2017 14:18:31 +0000 (16:18 +0200)
The handling of contexts are peculiar. Instead of tieing their vma to
activity, we pin the context. This means that we cannot simply unbind
the context object itself at will (which would normally cause us to wait
for the vma to be idle), but must manually idle the GPU and retire
requests first.

A consequence of this peculiarity is when doing a last desperate attempt
to recover memory. If the memory is tied up inside active context
objects, we will fail to recover any memory simply by trying to unbind
the objects without first doing a wait-for-idle.

A side-effect of removing the call to shrinker_lock_uninterruptible()
from i915_gem_shrinker_oom() was that we removed an unlocked
wait-for-idle, and so lost the "natural" shrinkage of context objects.
By replacing that with a locked wait from inside i915_gem_shrink(), we
not only replace it with the ability to recover all context objects, but
do so for all i915_gem_shrink_all() callers.

v2: Switching requires request allocation, which is not permitted from
inside the shrinker as it only uses ordinary allocations.

References: https://bugs.freedesktop.org/show_bug.cgi?id=102936
Fixes: f2123818ffad ("drm/i915: Move dev_priv->mm.[un]bound_list to its own lock")
Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
Cc: Joonas Lahtinen <joonas.lahtinen@linux.intel.com>
Link: https://patchwork.freedesktop.org/patch/msgid/20171108094400.1386-1-chris@chris-wilson.co.uk
Reviewed-by: Joonas Lahtinen <joonas.lahtinen@linux.intel.com>
(cherry picked from commit 2f6a3783833dde63f1c08982943a8b2229b97afb)
Signed-off-by: Jani Nikula <jani.nikula@intel.com>
drivers/gpu/drm/i915/i915_gem_shrinker.c

index eb31f8aa5c216752f3c0e6c11e6ce56a3ee32a55..3770e3323fc8e932b63fc94e7a0353a0d1d6784c 100644 (file)
@@ -162,6 +162,18 @@ i915_gem_shrink(struct drm_i915_private *dev_priv,
        if (!shrinker_lock(dev_priv, &unlock))
                return 0;
 
+       /*
+        * When shrinking the active list, also consider active contexts.
+        * Active contexts are pinned until they are retired, and so can
+        * not be simply unbound to retire and unpin their pages. To shrink
+        * the contexts, we must wait until the gpu is idle.
+        *
+        * We don't care about errors here; if we cannot wait upon the GPU,
+        * we will free as much as we can and hope to get a second chance.
+        */
+       if (flags & I915_SHRINK_ACTIVE)
+               i915_gem_wait_for_idle(dev_priv, I915_WAIT_LOCKED);
+
        trace_i915_gem_shrink(dev_priv, target, flags);
        i915_gem_retire_requests(dev_priv);