This guide will walk through how to perform audio transcriptions in Foundry using media sets.
First, you should import your audio files as media sets. There are two ways to do this:
Once imported, you will be able to view your audio media set.
Create a new pipeline in Pipeline Builder. Detailed steps can be found in the initial set up section of the Pipeline Builder documentation.
Add your audio media set to the pipeline.
Your imported audio media set should look like this:
Convert the media set into table rows using Transforms.
This generates media references for the items in your media set. Media references enable you to use a media item in Foundry without having to make copies of the media item itself. Learn more about media reference.
Next, select the Transcribe audio into text transformation.
Specify the inputs for the Transcribe audio into text transformation and select Apply.
Use the mediaReference
generated from step 3, and select the desired language. If no language is provided, it will be inferred from the first 30 seconds of audio.
You can preview the outputs from the transcription in the table.
You can continue to transform your audio transcription string output with available string transformations if needed.
Choose the desired pipeline output. You may output as Dataset or choose to ontologize the output by selecting an Object Type output. Creating an object type will allow you to use your pipeline outputs in Workshop.