i965: Implement ARB_compute_variable_group_size.
This patch adds the implementation of ARB_compute_variable_group_size for i965. We do this by storing the local group size in a push constant. Signed-off-by: Plamena Manolova <plamena.manolova@intel.com>