intel/compiler/mesh: use U888X packed index format
This change reduces the required MUE size and number of send messages, and with other optimizations (especially !20050 (merged)) it makes mesh shaders much faster.
This MR replaces MR !15235 (closed).