amd/ci: reduce Renoir's concurrency to 16

Martin Roukala requested to merge mupuf/mesa:renoir_concurrency into main

It seems like when we increased the number of tests per shard, we started over-committing the Renoir runner, leading to load averages higher than the 16 CPU threads could handle, while also running at 75-96% memory usage.

By dropping the concurrency to 16, we should be able to reduce this memory usage while also reducing the execution time.

Signed-off-by: Martin Roukala (né Peres)

