Due to an influx of spam, we have had to impose restrictions on new accounts. Please see this wiki page for instructions on how to get full permissions. Sorry for the inconvenience.
Admin message
Equinix is shutting down its operations with us on April 30, 2025. They have graciously supported us for almost 5 years, but all good things come to an end. We are expecting to transition to new infrastructure between late March and mid-April. We do not yet have a firm timeline for this, but it will involve (probably multiple) periods of downtime as we move our services whilst also changing them to be faster and more responsive. Any updates will be posted in freedesktop/freedesktop#2011 as it becomes clear, and any downtime will be announced with further broadcast messages.
[CI][DRMTIP] igt@kms_cursor_legacy@pipe - cleanup_done timed out
IGT-Version: 1.22-g19922005 (x86_64) (Linux: 4.18.0-rc3-CI-CI_DRM_4422+ x86_64)
Total updates 53012 (median of 4 processes is 13250.50)
Subtest pipe-B-forked-move: SUCCESS (21.686s)
Test requirement not met in function __real_main1358, file ../tests/kms_cursor_legacy.c:1378:
Test requirement: !(n >= display.n_pipes)
dmesg
[ 431.073104] [drm:drm_atomic_helper_setup_commit] ERROR [CRTC:65:pipe B] cleanup_done timed out
To upload designs, you'll need to enable LFS and have an admin enable hashed storage. More information
Child items
...
Show closed items
Linked items
0
Link issues together to show that they're related.
Learn more.
commit 41db645a33e775855aeeec1a437d5c1e24ff6c88 (HEAD -> drm-intel-next-queued, drm-intel/drm-intel-next-queued)
Author: Chris Wilson chris@chris-wilson.co.uk
Date: Thu Jul 12 12:57:29 2018 +0100
drm/i915: Bump priority of clean up work
We require that we keep the list of outstanding work short so that we do
not "leak" memory while pageflipping under stress. However that system
stress may delay kernel workers virtually indefinitely, which incurs the
pageflips stall and eventually hit a timeout waiting for the cleanup.
Try to combat CPU starvation of our short-lived cleanup workers by
switching to a high priority workqueue.
commit 41db645a33e775855aeeec1a437d5c1e24ff6c88 (HEAD ->
drm-intel-next-queued, drm-intel/drm-intel-next-queued)
Author: Chris Wilson chris@chris-wilson.co.uk
Date: Thu Jul 12 12:57:29 2018 +0100
drm/i915: Bump priority of clean up work
We require that we keep the list of outstanding work short so that we do
not "leak" memory while pageflipping under stress. However that system
stress may delay kernel workers virtually indefinitely, which incurs the
pageflips stall and eventually hit a timeout waiting for the cleanup.
Try to combat CPU starvation of our short-lived cleanup workers by
switching to a high priority workqueue.
The failure rate of this issue is rather low: once every 50.8 runs, across all the platforms. This was calculated by just looking at the shards results.
It looks like the issue has been fixed, as it has not been seen since CI_DRM_5670 (238 runs ago). However, the reproduction rate has fluctated a lot throughout the history of the bug, so it seems very timing-sensitive. We'll wait until CI_DRM_6178 to verify that this is indeed fixed and not that we just have been lucky!
The test is spawning $nproc x 2 children: half of them are continuously updating the cursor while the other half is hogging the CPUs. A set of both types gets pinned on each available CPU. The test has no asserts and just tries to see if anything blows up.
Based on the error, it seems like the issue is that the cleanup callback is not called fast-enough after a flip happens, which may lead to memory leaks. Since the reproduction rate of this issue is extremely low even with such a stress test, this should not have any significant user impact.
Let's see if we can reduce the occurrence rate of this issue even more, then close it as NOTABUG because Linux is not an RTOS and we cannot guarantee any timings.
Note that this test is based on a real bug report (long ago) about flip workers being starved leading to further mempressure causing more system slowdown. We have to abuse the system a lot to make it even likely to happen under test conditions; but users, users have a magic all of their own.
"Cleanup_done timed out" error not seen on latest DRM_9390 for 'igt@kms_cursor_legacy@pipe*' test cases. Hence closing the ticket as error signature not seen.