FAB Subtitler supports several transcription services for speech recognition for:
transcription of video/audio files in FAB Subtitler PRO/LIVE/MPEG (described on this page)
live transcription of live video/audio streams in FAB Subtitler LIVE/MPEG
FAB Subtitler PRO/LIVE/MPEG supports the use of cloud based speech recognition services for transcription of audio from video files to subtitle files with text and timecode. FAB Subtitler transfers video files to the cloud and the progress of the transfer and of the transcription can be viewed on the screen. When the cloud service finishes the transcription FAB Subtitler will download the JSON result file of the transcription and store it in a local folder and delete the video file in the cloud. It is also possible to configure FAB Subtitler to extract the audio from the video file and only transfer the audio to the cloud. Most transcription services only provide recognized text and timecode.
Some transcription services may provide additional information. Microsoft Video Indexer also provides the timecode of recognized scene changes and the speaker of the recognized text. FAB Subtitler will then display scene changes in the audio graph and the text for different speakers will be displayed in different colors (when this is configured in options of FAB Subtitler).
FAB Subtitler supports the following cloud transcription services:
- Microsoft Video Indexer
- Google Speech To Text
- Scriptix Speech to Text
- Speechmatics Cloud ASR
- Trint Transcription Service
- Microsoft Azure Cognitive Services Speech
The quality of the recognized text is generally not 100% perfect and it mostly requires some manual corrections in FAB Subtitler. However when using the transcription service the subtitle preparation process may require much less time because timecodes are generally already correct and only some text has to be corrected. For video material that works well with the selected transcription service up to 70% less time will be required to prepare the subtitles.
Configuration for file transcription
- “Maximum number of lines per subtitle” defines how many lines of text should be used in every subtitle for the recognized text when converting JSON result files returned by the transcription service to subtitle files.
- “Use spell check to automatically correct the transcript” defines that FAB Subtitler will use the spell check function to try to correct some errors returned by the transcription service
- “Assign colors to speakers when present in the transcript file” will instruct FAB Subtitler to use a different color for every speaker. This will however only work when the speaker ID is present in the transcript file returned by the transcription service.
- “Store the result file of the transcription in a separate folder (not in the video folder)” instructs FAB Subtitler to store transcript result files in a separate folder and not in the folder with video files.
Microsoft Video Indexer
To configure FAB Subtitler to use Microsoft Video Indexer as the transcription service open Options / Special / Transcription:
- API Key: The API Key is provided by Microsoft in the web interface of Microsoft Video Indexer so that external applications can use the Microsoft Video Indexer web service. Make sure to enter the API key for your Microsoft Video Indexer account. The following page describes how to obtain the API key:
- Location: The location of the service. For trial accounts you can use “Trial”
Scriptix Speech to Text
To configure FAB Subtitler to use Scriptix cloud service as the transcription service open Options / Special / Transcription:
- API Key: The API Key is provided by Scriptix in the web interface.
The Scriptix cloud service can also be used for live transcription. To use is for live transcription click on “Live transcription” and also enter the API Key for Scriptix Live Transcription.
Speechmatics Cloud ASR
To configure FAB Subtitler to use Speechmatics Cloud ASR as the transcription service open Options / Special / Transcription:
Auth token: The Auth token is provided by Speechmatics in the web interface.
User ID: The user ID is the user that is used to log in into the Speechmatics web interface.
Google Speech to Text
To configure FAB Subtitler to use Google Cloud as the transcription service open Options / Special / Transcription:
JSON Key: The JSON Key is a text file which has to be loaded from a JSON file or the content has to be copied into this field. You will find a description how to obtain the JSON key further below.
Google Cloud Storage Bucket Name: Enter the name of the storage bucket that will be used for storing of audio files transferred to the Google cloud. FAB Subtitler will always extract the audio from the video file and only transfer the audio to the Google cloud. The instructions below describe how to create a bucket. Please note that you will have to use a different bucket name (not fab-asr-bucket) because Google storage bucket names must be globally unique.
To configure the Google Cloud to be used with FAB Subtitler follow these instructions:
Visit https://cloud.google.com and create an account. In July 2018 Google offered a trial account with USD 300 credit which can be used within 12 months. This allows extensive testing of the Google cloud service.
Before you can use the Google cloud you must first create a project. Create a project with the name FAB-ASR-Project as shown below
- Select the project FAB-ASR-Project as the current project
- Create Credentials for a new service account in the API Manager
After the previous step the JSON file with the credentials for the new service account will be downloaded to the computer. The JSON file shall be imported into FAB Subtitler Options /Special / Transcription / Google Speech to Text.
Create the storage bucket with a globally unique name (do not use fab-asr-bucket). The bucket will be used to store audio files for transcription in the cloud
- Make sure to Add bucket permissions for the service account which you have created in one of the previous steps
- Set “Storage Admin” role for the service account
- Enable the cloud speech API
After configuring all above FAB Subtitler shall be able to use the transcription service of the Google cloud.
To configure FAB Subtitler to use Trint as the transcription service open Options / Special / Transcription:
- API Key: The API Key is provided by Trint in the web interface of https://trint.com so that external applications can use the Trint web service. Make sure to enter the API key for your Trint account. First create a user account on trint.com and then create the API key by clicking on the “User” symbol and selecting API:
Microsoft Azure Cognitive Services Speech
To configure FAB Subtitler to use Microsoft Azure as the transcription service open Options / Special / Transcription:
- API Key & Location: You will have to obtain both the API Key and the location from https://portal.azure.com
After logging into https://portal.azure.com:
- Click on “Create a resource” and search for “Speech”
- The click on “Create” for Speech
- Enter the data for the fields as in the picture below
- The confirmation page will also display the Location and then click on “Go to resource”
- Click on “Keys and Endpoint” to select one of the API Keys that has to be entered in FAB Subtitler settings
Using File Transcription Functionality in FAB Subtitler
To transfer video/audio files to the cloud transcription service click in the ribbon on Video / Transcription. A window will open which will display the status of the video analysis and allow uploading files to the cloud transcription service.
The upper part of the window allows selecting the video/audio file on the disk and the transfer of the file to the cloud transcription service is started when clicking on the button Start transfer. After the file analysis is finished, FAB Subtitler will download the JSON result file from the cloud transcription service and store it locally. After that FAB Subtitler will delete the video file in the cloud transcription service.
The lower part of the window displays the status of the file which has been transferred to the cloud transcription service:
- Uploading: the file is currently being transferred to the cloud transcription service
- Transcribing: the file is currently being transcribed by the cloud transcription service
- Completed: the file transcription is finished, FAB Subtitler will download the JSON result file when it becomes available in the cloud transcription service
- Failed: the transcription of the file has failed, the reason returned by the cloud transcription service will also be displayed
- Transcribed, available locally: JSON result files returned by the cloud transcription service which are stored in the local file folder and can be opened in FAB Subtitler as a subtitle file
- Other: any other status of the file provided by the cloud transcription service
Using File Transcription in Automatic Watchfolder Mode
FAB Subtitler PRO/LIVE/MPEG can be configured to monitor folders and use transcription functionality (upload video files and download transcribed files) as soon as a video file is present in the watchfolder.
To configure this type of automatic operation:
- In Options/Special/Automation configure the following:
- Save a file with the name “watchfolder.xml” which contains conversion settings to the input folder by using the following procedure:
Click on Video / Transcribe
Select the transcription provider and choose all necessary settings
Click on the Save button which will save the XML file with all settings which are currently configured
Save the file with the name “watchfolder.xml” to the input folder
The automatic conversion operation will start when the file “watchfolder.xml” and the file to be processed both exist in the input folder
- To activate the automatic operation click on Extras / Tools > Start automatic mode
- To run FAB Subtitler in automatic mode in a 24/7 operation it is recommended to start FAB Subtitler using FAB Activity Manager. FAB Activity Manager will:
- Start FAB Subtitler
- Continuously check if FAB Subtitler is running correctly
- Restart FAB Subtitler if any problems are detected
Using JSON result files in FAB Subtitler
JSON files returned by cloud transcription services can be opened in FAB Subtitler directly and the content (which is a structured file containing recognized text and times for words) will be converted to subtitles with text and timecode.
There are several options which influence the conversion from JSON to a subtitle file. The options are available in Options: