Due to an influx of spam, we have had to impose restrictions on new accounts. Please see this wiki page for instructions on how to get full permissions. Sorry for the inconvenience.
Admin message
The migration is almost done, at least the rest should happen in the background. There are still a few technical difference between the old cluster and the new ones, and they are summarized in this issue. Please pay attention to the TL:DR at the end of the comment.
The test is running as expected and passing, but since the injected failure also triggers a dump_stack() in the kernel, igt_runner incorrectly believes this to be a real dmesg error and flags the test as "dmesg_warn."
We can avoid this by explicitly setting the verbosity setting for the gt reset injection interface to "1" (instead of leaving it at the default "2") --- the general fault injection and wedged messages will still be printed (which the test can properly tell igt_runner to ignore), but the unnecessary stack trace will be skipped.
This is a good patch, you have my r-b for it here until I manage to revive my machine
so you can add:
Reviewed-by: Kamil Konieczny kamil.konieczny@linux.intel.com
Equivalent query: runconfig_tag IS IN ["xe"] AND machine_tag IS IN ["DG2", "PVC", "LNL", "BMG", "ADL-P"] AND ((testsuite_name = "IGT" AND test_name IS IN ["igt@xe_wedged@basic-wedged"])) AND ((testsuite_name = "IGT" AND status_name IS IN ["dmesg-warn"])) AND dmesg ~= '\*ERROR\* GT0: reset failed \(-ECANCELED\)'
It seems like the reset failure errors are being correctly masked, so we aren't hitting this issue per se. Instead, it seems like the CI filter is picking up a group of other warnings and is associating them with this issue. Namely:
<3> [557.431647] xe 0000:00:02.0: [drm] ERROR AUX B/DDI B/PHY B: did not complete or timeout within 10ms (status 0xad40023f)
Whether this is a correct association or not is debatable, but the observed error above can also be masked if needed.
The recent BMG failures have a different, complaining about AUDIO_PLAYBACK use count. Those are already being tracked by #3468 (closed).
The original issue here does seem to be gone, so re-closing. The continuing failures noted above should hopefully get captured by the proper tickets once this one gets out of the way and stops grabbing the results first.