r600/sfn: a bunch of cleanups and little optimizations
- Fix some memory issues
- Fix peephole optimizations with registers
- Handle KILL instruction scheduling according to documentation
- Add peephole optimizations for conditional kill ops
- Unroll loops more