brw: Minor improvements for 16-bit values
I was working on using 16-bit addressing for shared local memory and ran into a pile of code quality regressions. Some of those led me to find general improvements for a few 16-bit cases, which are here in this MR.
Not a whole lot is affected, but there were some pretty nice improvements for parallel-rdp in fossil-db.