pan/midgard: Blend shader optimization parity
All of #2069 (closed) #2068 (closed) #2067 (closed) #2066 (closed) #2065 (closed) are needed, then switching blend shaders to fp16. At that point we ought to be matching the blob in instruction count, all else being equal.