fixup! microsoft/compiler: Lower shared loads/stores
I forgot to remove the check that was preventing the shared mem access lowering when the access was a scalar 32bit bit access. While this optimization made sense when we were using !DXIL variants for loads/non-masked-stores, it became a problem when we switched all accesses to DXIL-specific intrinsics.