Draft: radv: lower subgroups after first optimizations
Without constant folding the cts tests for shuffleXor(a, const) don't actually trigger our optimizations. We need another optimization run between subgroup and explicit io lowering though to avoid LDS size regressions.