nir: lower tg4 for broadcom
V3D returns tg4 samples in a different order, so we need to swizzle things around. Doing it in NIR was easy and avoids some mess in the backend (which uses nir_ssa_def_components_read() and would need to reswizzle that as well)