ndi: don't accumulate meta with audio only streams
Currently, only closed caption metadata are supported. When the next video frame is received, pending meta are dequeued and parsed. If close captions are found, they are attached to the video frame.
For audio only streams, it doesn't make sense to enqueue metadata. They would
accumulate in pending_metadata
and would never be dequeued.
Fixes #460 (closed)