turnip: Improve 2D buffer-to-image copies for A7XX
A7XX supports buffer-to-images copies with a lower alignment requirement for the pitch and start VA, this makes it unnecessary to loop over every row and copy them individually for any previously unaligned images. The new alignment requirements match Vulkan requirements and should cover all cases that aren't handled by 3D copies.
This can result in a significant performance improvement for these unaligned copies, up to 10x or more in some cases according to GPU time profiling of dEQP-VK.api.copy_and_blit.dedicated_allocation.buffer_to_image.2d_images.*
VK-CTS tests with a larger image size.