Rusticl: reduce kernel launch overhead
This series intends to reduce the kernel launch overhead by creating the cso on kernel creation time. Currently the interface requires information we only have available later, so this needs a bigger rework and clean up.