radv: Use a lds stack for ray queries when possible

Quake II RTX: 56fps -> 60fps
Nvidia ao sample: 1ms -> 0.8ms

This would benefit from !16593 (merged) since q2rtx uses multiple queries per shader which will result in inefficient lds use.

