gallium/u_blitter: make emulated filter code conformant
This fixes a bunch of issues in the blitter shaders.
Note that the purpose of this MR is to have a reference and conformant blit implementation that we can validate other implementations against, such as a compute-shader-based blit or anything that drivers or hw can conjure.