metadata

title: Audio Transcription
emoji: 🎙️
colorFrom: blue
colorTo: green
sdk: gradio
sdk_version: 4.36.0
app_file: app.py
pinned: false

Multi-Source Audio Transcription with Faster Whisper

This application transcribes audio from multiple sources using Faster Whisper v3 turbo int8, providing a flexible and powerful transcription solution.

Features

Transcribe audio from various sources:
- Uploaded audio files
- Direct URLs to MP3 files
- YouTube video URLs
Utilizes the latest GitHub version of Faster Whisper for optimal performance
Adjustable batch size for performance tuning
Provides detailed metrics including transcription time and real-time factor

How to Use

Enter the source of your audio:
- Path to a local audio file
- URL of an MP3 file
- URL of a YouTube video
Adjust the batch size if desired (default is 16)
Click 'Submit' to start the transcription process

Output

The application will provide:

A full transcription of the audio
Detected language and confidence
Duration of the audio
Transcription time and real-time factor
File size of the processed audio

Note

This application is a prototype and may be subject to further improvements and optimizations. Performance may vary based on the input source and the processing capabilities of the hosting environment.

Feedback and Contributions

I welcome feedback and contributions to improve this transcription tool.