transcribe_audio

Running

File size: 1,457 Bytes

9f645ed
8adc2d3
 
 
 
9f645ed
aaa6fe7
9f645ed
 
 
 
8adc2d3

---
title: Audio Transcription
emoji: 🎙️
colorFrom: blue
colorTo: green
sdk: gradio
sdk_version: 4.36.0
app_file: app.py
pinned: false
---

# Multi-Source Audio Transcription with Faster Whisper

This application transcribes audio from multiple sources using Faster Whisper v3 turbo int8, providing a flexible and powerful transcription solution.

## Features

- Transcribe audio from various sources:
  - Uploaded audio files
  - Direct URLs to MP3 files
  - YouTube video URLs
- Utilizes the latest GitHub version of Faster Whisper for optimal performance
- Adjustable batch size for performance tuning
- Provides detailed metrics including transcription time and real-time factor

## How to Use

1. Enter the source of your audio:
   - Path to a local audio file
   - URL of an MP3 file
   - URL of a YouTube video
2. Adjust the batch size if desired (default is 16)
3. Click 'Submit' to start the transcription process

## Output

The application will provide:
- A full transcription of the audio
- Detected language and confidence
- Duration of the audio
- Transcription time and real-time factor
- File size of the processed audio

## Note

This application is a prototype and may be subject to further improvements and optimizations. Performance may vary based on the input source and the processing capabilities of the hosting environment.

## Feedback and Contributions

I welcome feedback and contributions to improve this transcription tool.