executor: generate more precise runner tags
As our gateways start to become less and less homogenous in performance, we really want to start selecting different machines for different jobs.
Rather than hardcoding the exact runner, let's use tags to express the architecture, and classes of CPUs/RAM (8+ cores, 32+ cores, ...). This allows selecting fast machines without having to hardcode to a particular runner.