radv: Use 128-sized vertex grouping for NGG shaders.

Timur Kristóf requested to merge Venemo/mesa:radv-ngg-vertexgrouping into main

This matches what RadeonSI also does. It seems to improve performance especially with NGG culling shaders.

Eg. in Doom Eternal this gives me +5~10 fps.

Merge request reports