freedreno: Reduce overhead of fd_resource_set_usage()
Taking the lock per UBO being updated is expensive, and we expect any BO to be used in the same way many times, so add an early out if it's already been flagged.
Improves nohw drawoverhead 8-ubos update throughput by 13.493% +/- 0.391444% (n=15).