dri: Use __DRI2_THROTTLE_COPYSUBBUFFER for CopySubBuffer
This should bypass the frames-in-flight throttling for glXCopySubBufferMESA, which isn't really supposed to be tied to end-of-frame like SwapBuffers AFAICT. Found while reviewing !2204 (merged). More details in the commit messages.