nir, ac/nir: Add workgroup divergence analysis pass and use it for mesh shader output counts
Based on !27680 (merged)
Adds a pass (similar to vertex divergence analysis) which deals with workgroup-divergence (as opposed to the default subgroup-divergence).
Then, use this pass in ac_nir_lower_ngg_ms
to handle the output counts more optimally.