Fix CI expectations, skip heavy tests, trim Windows artifact size
A bit of a grab bag of stuff in here. Mark some tests I've seen flaking around on a618 recently. Skip tests which are way too resource-intensive on shared runners. Trim the artifact size on Windows by not uploading the shader cache.