cstr commited on
Commit
8adc2d3
1 Parent(s): 33e69cd

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +43 -12
README.md CHANGED
@@ -1,19 +1,50 @@
1
  ---
2
- title: Transcribe Audio
3
- emoji: 🌖
4
- colorFrom: gray
5
- colorTo: pink
6
  sdk: gradio
7
  sdk_version: 4.36.0
8
  app_file: app.py
9
  pinned: false
10
- hf_oauth: true
11
- hf_oauth_expiration_minutes: 60
12
- hf_oauth_scopes:
13
- - read-repos
14
- - write-repos
15
- - manage-repos
16
  ---
17
- This transcribes audio using Faster Whisper v3 turbo int8.
18
 
19
- It is yet a prototype.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ title: Audio Transcription
3
+ emoji: 🎙️
4
+ colorFrom: blue
5
+ colorTo: green
6
  sdk: gradio
7
  sdk_version: 4.36.0
8
  app_file: app.py
9
  pinned: false
 
 
 
 
 
 
10
  ---
 
11
 
12
+ # Multi-Source Audio Transcription with Faster Whisper
13
+
14
+ This application transcribes audio from multiple sources using Faster Whisper v3 turbo int8, providing a flexible and powerful transcription solution.
15
+
16
+ ## Features
17
+
18
+ - Transcribe audio from various sources:
19
+ - Uploaded audio files
20
+ - Direct URLs to MP3 files
21
+ - YouTube video URLs
22
+ - Utilizes the latest GitHub version of Faster Whisper for optimal performance
23
+ - Adjustable batch size for performance tuning
24
+ - Provides detailed metrics including transcription time and real-time factor
25
+
26
+ ## How to Use
27
+
28
+ 1. Enter the source of your audio:
29
+ - Path to a local audio file
30
+ - URL of an MP3 file
31
+ - URL of a YouTube video
32
+ 2. Adjust the batch size if desired (default is 16)
33
+ 3. Click 'Submit' to start the transcription process
34
+
35
+ ## Output
36
+
37
+ The application will provide:
38
+ - A full transcription of the audio
39
+ - Detected language and confidence
40
+ - Duration of the audio
41
+ - Transcription time and real-time factor
42
+ - File size of the processed audio
43
+
44
+ ## Note
45
+
46
+ This application is a prototype and may be subject to further improvements and optimizations. Performance may vary based on the input source and the processing capabilities of the hosting environment.
47
+
48
+ ## Feedback and Contributions
49
+
50
+ I welcome feedback and contributions to improve this transcription tool.