aco: third round of GFX6 fixes
Should be mostly done at this point, but I still have some random GPU hangs when running CTS with deqp-runner. Apparently, no hangs when running with the (slow) CTS runner. I will investigate.
This MR implements all global operations (remember no FLAT on GFX6) and all 64-bit operations like floor etc.