Spaces:
Running
Running
Update README.md
Browse files
README.md
CHANGED
@@ -1,19 +1,50 @@
|
|
1 |
---
|
2 |
-
title:
|
3 |
-
emoji:
|
4 |
-
colorFrom:
|
5 |
-
colorTo:
|
6 |
sdk: gradio
|
7 |
sdk_version: 4.36.0
|
8 |
app_file: app.py
|
9 |
pinned: false
|
10 |
-
hf_oauth: true
|
11 |
-
hf_oauth_expiration_minutes: 60
|
12 |
-
hf_oauth_scopes:
|
13 |
-
- read-repos
|
14 |
-
- write-repos
|
15 |
-
- manage-repos
|
16 |
---
|
17 |
-
This transcribes audio using Faster Whisper v3 turbo int8.
|
18 |
|
19 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
---
|
2 |
+
title: Audio Transcription
|
3 |
+
emoji: 🎙️
|
4 |
+
colorFrom: blue
|
5 |
+
colorTo: green
|
6 |
sdk: gradio
|
7 |
sdk_version: 4.36.0
|
8 |
app_file: app.py
|
9 |
pinned: false
|
|
|
|
|
|
|
|
|
|
|
|
|
10 |
---
|
|
|
11 |
|
12 |
+
# Multi-Source Audio Transcription with Faster Whisper
|
13 |
+
|
14 |
+
This application transcribes audio from multiple sources using Faster Whisper v3 turbo int8, providing a flexible and powerful transcription solution.
|
15 |
+
|
16 |
+
## Features
|
17 |
+
|
18 |
+
- Transcribe audio from various sources:
|
19 |
+
- Uploaded audio files
|
20 |
+
- Direct URLs to MP3 files
|
21 |
+
- YouTube video URLs
|
22 |
+
- Utilizes the latest GitHub version of Faster Whisper for optimal performance
|
23 |
+
- Adjustable batch size for performance tuning
|
24 |
+
- Provides detailed metrics including transcription time and real-time factor
|
25 |
+
|
26 |
+
## How to Use
|
27 |
+
|
28 |
+
1. Enter the source of your audio:
|
29 |
+
- Path to a local audio file
|
30 |
+
- URL of an MP3 file
|
31 |
+
- URL of a YouTube video
|
32 |
+
2. Adjust the batch size if desired (default is 16)
|
33 |
+
3. Click 'Submit' to start the transcription process
|
34 |
+
|
35 |
+
## Output
|
36 |
+
|
37 |
+
The application will provide:
|
38 |
+
- A full transcription of the audio
|
39 |
+
- Detected language and confidence
|
40 |
+
- Duration of the audio
|
41 |
+
- Transcription time and real-time factor
|
42 |
+
- File size of the processed audio
|
43 |
+
|
44 |
+
## Note
|
45 |
+
|
46 |
+
This application is a prototype and may be subject to further improvements and optimizations. Performance may vary based on the input source and the processing capabilities of the hosting environment.
|
47 |
+
|
48 |
+
## Feedback and Contributions
|
49 |
+
|
50 |
+
I welcome feedback and contributions to improve this transcription tool.
|