gallivm/nir: Fix scalar load and broadcast logic, speed up deqp-vk by doing it more places (!14999) · Merge requests · Mesa / mesa

Emma Anholt requested to merge anholt/mesa:lp-loops into main Feb 11, 2022

Ignore the first commit here and look at !14994 (merged) for that.

For the rest of this: Clean up some gallivm logic for handling "my memory access offset is uniform and I could just read a scalar and broadcast it", then extend it to some more memory accesses. The payoff is reducing runtime of one of our slowest VK tests by 24.4002% +/- 1.94375% (n=7).

Be extra suspicious of a57cd6e0 -- I've tried to do my best to figure out what all the masks are that contribute, but I may have missed something.

Edited Feb 12, 2022 by Emma Anholt

gallivm/nir: Fix scalar load and broadcast logic, speed up deqp-vk by doing it more places

Merge request reports