transcribe_audio / README.md
cstr's picture
Update README.md
8adc2d3 verified
|
raw
history blame
1.46 kB
---
title: Audio Transcription
emoji: 🎙️
colorFrom: blue
colorTo: green
sdk: gradio
sdk_version: 4.36.0
app_file: app.py
pinned: false
---
# Multi-Source Audio Transcription with Faster Whisper
This application transcribes audio from multiple sources using Faster Whisper v3 turbo int8, providing a flexible and powerful transcription solution.
## Features
- Transcribe audio from various sources:
- Uploaded audio files
- Direct URLs to MP3 files
- YouTube video URLs
- Utilizes the latest GitHub version of Faster Whisper for optimal performance
- Adjustable batch size for performance tuning
- Provides detailed metrics including transcription time and real-time factor
## How to Use
1. Enter the source of your audio:
- Path to a local audio file
- URL of an MP3 file
- URL of a YouTube video
2. Adjust the batch size if desired (default is 16)
3. Click 'Submit' to start the transcription process
## Output
The application will provide:
- A full transcription of the audio
- Detected language and confidence
- Duration of the audio
- Transcription time and real-time factor
- File size of the processed audio
## Note
This application is a prototype and may be subject to further improvements and optimizations. Performance may vary based on the input source and the processing capabilities of the hosting environment.
## Feedback and Contributions
I welcome feedback and contributions to improve this transcription tool.