Transcribe Speech to Text
Made by Google Cloud
Transcribes audio files in Cloud Storage to .txt files using Cloud Speech To Text.
- Works with
- Cloud Storage and Cloud Firestore
- Version
- 0.1.5 | Source code
- Tags
- ai, speech-to-text, speech-transcription, google-ai
- License
- Apache-2.0
- Publisher
- Google Cloud
- Report
- Bug
- Abuse
How this extension works
This extension transcribes audio in Cloud Storage using Google Cloud Speech-to-Text API, and writes the resulting text output back to Cloud Storage.
Upon installation, you will be asked to provide a Storage path. New audio files (Storage objects) added to this path will trigger a Cloud Function that transcribes the audio to text, and writes it back to the same Storage path as a .txt file.
To determine if a given Storage object is an audio file, the extension checks the “Content-Type” field in the Storage Object metadata. If the Content-Type starts with “audio/” it will be considered valid for transcription.
Use Cases
Here are a few examples of how you could use the Firebase Storage Transcribe Audio Extension:
- A podcast hosting platform could use the extension to automatically transcribe podcast episodes and make them searchable on its website.
- A company could use the extension to transcribe audio files from customer support calls and store them in Firestore for future reference.
- An education platform could use the extension to transcribe lectures and create closed captions for their students.
- A journalist could use the extension to transcribe interviews and save them in Storage for later reference.
Additional Setup
Ensure you have a Cloud Storage bucket set up in your Firebase project.
Language and Model Configuration
During the installation process, you will be asked to provide the BCP-47 code for the transcription language and select a language model. You can find the supported languages and BCP-47 codes in the Language support documentation.
The language model determines the use-case the speech-to-text transcription algorithm should be optimized for. You can find more details on the available models in the model field documentation. If you are unsure, you can use the default model.
Billing
To install an extension, your project must be on the Blaze (pay as you go) plan.
You will be charged a small amount (typically around $0.01/month) for the Firebase resources required by this extension (even if it is not used).
This extension uses other Firebase and Google Cloud Platform services, which have associated charges if you exceed the service’s no-cost tier:
- Cloud Speech-to-Text API
- Cloud Storage
- Eventarc (optional)
- Cloud Functions (See FAQs)