Spaces:
Running
Running
A newer version of the Gradio SDK is available:
5.6.0
metadata
title: Audio Transcription
emoji: 🎙️
colorFrom: blue
colorTo: green
sdk: gradio
sdk_version: 4.36.0
app_file: app.py
pinned: false
Multi-Source Audio Transcription with Faster Whisper
This application transcribes audio from multiple sources using Faster Whisper v3 turbo int8, providing a flexible and powerful transcription solution.
Features
- Transcribe audio from various sources:
- Uploaded audio files
- Direct URLs to MP3 files
- YouTube video URLs
- Utilizes the latest GitHub version of Faster Whisper for optimal performance
- Adjustable batch size for performance tuning
- Provides detailed metrics including transcription time and real-time factor
How to Use
- Enter the source of your audio:
- Path to a local audio file
- URL of an MP3 file
- URL of a YouTube video
- Adjust the batch size if desired (default is 16)
- Click 'Submit' to start the transcription process
Output
The application will provide:
- A full transcription of the audio
- Detected language and confidence
- Duration of the audio
- Transcription time and real-time factor
- File size of the processed audio
Note
This application is a prototype and may be subject to further improvements and optimizations. Performance may vary based on the input source and the processing capabilities of the hosting environment.
Feedback and Contributions
I welcome feedback and contributions to improve this transcription tool.