tests: systemclock: Stop all stress threads before joining them
This reduces the chance of the main thread getting starved while trying to shut down the test, potentially causing a timeout.
Even on an idle 96-processor system this reduces the duration of the systemclock tests from ~8s to ~3s.