gallivm: use pmulhrsw to make aos sampling more accurate.
This uses pmulhrsw avx2 and ssse3 variants. It fixes the precision of texture filtering calculations.
However it does leave these paths inaccurate on platforms that don't support it.
Edited by Dave Airlie