Optimize ser/de for array of basic/fixed types
Our Vec implementation is pretty slow, due to the genericity of value handling.
For a rough idea, it takes about 5s to ser/de a 10mb vector of bytes (ay) on a pretty fast CPU.
Here is the flamegraph link: https://elmarco.fedorapeople.org/zbus_vec_flamegraph.svg
Imho, optimizing 'ay' is the most important, as there is a higher chance to transfer big data blobs in this form.