gallium: add pipe_context::multi_draw, merge draws in u_threaded_context, radeonsi support
This optimizes for the common case of consecutive draw calls with no state changes between them.
u_threaded_context
is the best place for it. The reason is explained in the code comments.