ci: fix out of date llvmpipe & lavapipe fail lists, and add a nightly full run of each
Stress-tested 10 times each; the only flakiness is a test that takes around 58sec to run, and sometimes goes just over the 60sec timeout, but everything else is surprisingly stable