Once imported, you will be able to view your audio media set.
Part 2: Transcribe audio media set via Pipeline Builder
Create a new pipeline in Pipeline Builder. Detailed steps can be found in the initial set up section of the Pipeline Builder documentation.
Add your audio media set to the pipeline.
Your imported audio media set should look like this:
Next, select the Transcribe audio into text transformation using Transforms.
Specify the inputs for the Transcribe audio into text transformation and select Apply.
Use the media_reference column from the media set input, and select the desired language. If no language is provided, it will be inferred from the first 30 seconds of audio. Choose to output the transcription as a plain text string, or to include segment field details with timestamps and confidence scores.
You can preview the outputs from the transcription in the table.
You can continue to transform your audio transcription string output with available string transformations if needed.
Part 3: Save pipeline output
Choose the desired pipeline output. You may output as Dataset or choose to ontologize the output by selecting an Object Type output. Creating an object type will allow you to use your pipeline outputs in Workshop.