aws: transcriber: add support for language identification (!1518) · Merge requests · GStreamer / gst-plugins-rs

François Laignel requested to merge fengalin/gst-plugins-rs:aws-transcribe-auto-lang into main Mar 25, 2024

This commit adds support for language identification to the transcriber element and makes use of the identified language in the translation pad.

Language identification is activated with either of the following properties (which match the service API):

'identify-language' when a single language is expected in the stream.
'identify-multiple-languages' otherwise.

In both cases, the property 'language-options' must list the possible languages. Ex.: "en-US,es-US,fr-FR".

The following pipeline identifies languages from a stream prossibly containing multiple languages, outputs the transcription to the 'src' pad and translates when needed to French ('translate_src_0') & English ('translate_src_1'):

gst-launch-1.0 -e uridecodebin uri=file:///__PATH_TO_FILE__ ! audioconvert
  ! awstranscriber name=t \
      access-key="__TO_BE_DEFINED__" secret-access-key="__TO_BE_DEFINED__" \
      identify-multiple-languages=true \
      language-options="en-US,es-US,fr-FR" \
      translate_src_0::language-code=fr \
      translate_src_1::language-code=en \
        t. ! fakesink dump=true \
        t.translate_src_0 ! fakesink dump=true \
        t.translate_src_1 ! fakesink dump=true

Depends on !1521 (merged)

Edited Mar 27, 2024 by François Laignel

Admin message

aws: transcriber: add support for language identification

Depends on !1521 (merged)

Merge request reports