Transcribe audio into text

Supported in: Batch

Transcribes an audio file into text.

Expression categories: Media

Declared arguments

Media reference - The column containing media references to audio files in the media sets.
Expression<Media reference>
optional Language - The language to detect in the input file. If no language is provided, it will be inferred from the first 30 seconds of audio.
Enum<Afrikaans, Albanian, Amharic, Arabic, Armenian, Assamese, Azerbaijani, Bashkir, Basque, Belarusian, and more ...>
optional Output mode - Choose between a simple output of the specified type or a struct containing both the output and an error field.
Enum<Simple, With errors>
optional Performance mode - The performance mode to use when running transcription. If no mode is provided, we will default to the more economical option.
Enum<More economical, More performant>

Output type: String | Struct<ok, error>

Description: Transcribe the audio file Argument values:

mediaReference	Output
{"mimeType":"audio/mpeg","reference":{"type":"mediaSetItem","mediaSetItem":{"mediaSetRid":"ri.mio.main.media-set.a", "mediaItemRid":"ri.mio.main.media-item.a"}}}	This is an example transcription from Whisper

Argument values:

mediaReference	Output
null	null