killian31 commited on
Commit
7788122
ยท
1 Parent(s): 1e48aff

docs: update README

Browse files
Files changed (2) hide show
  1. README.md +118 -24
  2. app_ex.png +0 -0
README.md CHANGED
@@ -1,37 +1,131 @@
1
- ---
2
- title: AudioVisualTranscription
3
- app_file: app.py
4
- sdk: gradio
5
- sdk_version: 5.1.0
6
- ---
7
- # Speech to Video Subtitles
8
 
9
- Get your synchronized subtitiled video in seconds!
 
 
10
 
11
  ![App screenshot](./app_ex.png)
12
 
13
- ## Installation
14
 
15
- In your terminal, run the following commands
 
16
 
17
- ```bash
18
- git clone https://github.com/killian31/AudioVisualTranscription
19
- cd AudioVisualTranscription
20
- pyenv virtualenv 3.11.9 avt
21
- pyenv activate avt
22
- pip install poetry
23
- poetry install
24
- ```
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
25
 
26
- The app needs ImageMagick and ffmpeg to run. To install them, run
 
27
 
28
- - MacOS: `bash ./install_macos.sh`
29
- - Debian/Ubuntu: `chmod +x install_linux.sh; ./install_linux.sh`
 
30
 
31
- ## Usage
32
 
33
- Launch the Gradio app with
 
 
 
 
 
 
 
34
 
35
  ```bash
36
- python3 app.py
37
  ```
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # AudioVisualTranscription - Speech to Video Subtitles ๐ŸŽฅโœจ
 
 
 
 
 
 
2
 
3
+ [![Open in Spaces](https://huggingface.co/datasets/huggingface/badges/resolve/main/open-in-hf-spaces-md.svg)](https://huggingface.co/spaces)
4
+
5
+ Get your synchronized subtitled video in minutes with AI!
6
 
7
  ![App screenshot](./app_ex.png)
8
 
9
+ ## ๐Ÿ“œ Overview
10
 
11
+ **AVT** is a tool that allows you to precisely subtitle your audio or video
12
+ content in minutes, using the power of AI.
13
 
14
+ Whether you need subtitles for accessibility, language learning, or just to make
15
+ your content more engaging, this app has got you covered. Simply upload your audio
16
+ or video file, select the language, and let the magic happen.
17
+
18
+ ## โœจ Features
19
+
20
+ - **Easy-to-use Interface**: Powered by [Gradio](https://gradio.app) for an
21
+ intuitive user experience.
22
+ - **Multi-Language Support**: Supports transcription in multiple languages:
23
+ English, Spanish, French, German, Italian, Dutch, Russian, Norwegian, Chinese,
24
+ and more.
25
+ - **Video Playback**: View your subtitled video directly in the web app.
26
+ - **Download Subtitles**: Save generated subtitle files for use with your preferred
27
+ video player.
28
+
29
+ ## ๐Ÿš€ Quickstart
30
+
31
+ The easiest way to use **AVT** is through this
32
+ [Hugging Face Space](https://huggingface.co/spaces/nelikCode/AudioVisualTranscription).
33
+
34
+ To use it locally, follow the steps below.
35
+
36
+ ### Installation
37
+
38
+ Follow these steps to set up the application on your local machine.
39
+
40
+ 1. **Clone the repository**:
41
+
42
+ ```bash
43
+ git clone https://github.com/killian31/AudioVisualTranscription
44
+ cd AudioVisualTranscription
45
+ ```
46
+
47
+ 2. **Create a Python environment** using pyenv:
48
+
49
+ ```bash
50
+ pyenv virtualenv 3.11.9 avt
51
+ pyenv activate avt
52
+ ```
53
+
54
+ 3. **Install Poetry**:
55
+
56
+ ```bash
57
+ pip install poetry
58
+ ```
59
+
60
+ 4. **Install dependencies**:
61
+
62
+ ```bash
63
+ poetry install
64
+ ```
65
 
66
+ 5. **Install system-level dependencies**:
67
+ - **MacOS**: Run the following script to install FFmpeg and ImageMagick.
68
 
69
+ ```bash
70
+ bash ./install_macos.sh
71
+ ```
72
 
73
+ - **Debian/Ubuntu**: Run the following commands to install FFmpeg and ImageMagick.
74
 
75
+ ```bash
76
+ chmod +x install_linux.sh
77
+ ./install_linux.sh
78
+ ```
79
+
80
+ ### Running the App
81
+
82
+ To launch the Gradio app:
83
 
84
  ```bash
85
+ python app.py
86
  ```
87
+
88
+ After launching, navigate to the provided local URL to interact with the
89
+ application in your browser.
90
+
91
+ ## ๐Ÿ“Š How It Works
92
+
93
+ 1. **Upload Your Content**: Use the provided options to upload an audio file
94
+ **or** a video file. Select the file type accordingly in the dropdown menu
95
+ (Video, Audio).
96
+ 2. **Select Your Preferences**: Choose the language of transcription and any
97
+ delay settings you prefer.
98
+ 3. **Generate Subtitles**: Click on the โ€œGenerate Subtitled Videoโ€ button to
99
+ process your input.
100
+ 4. **Download or View**: View the subtitled video directly on the web interface
101
+ or download the SRT subtitle file for later use. You need to generate the
102
+ subtitles before being able to ckick on the download button.
103
+
104
+ ## ๐Ÿ›  Requirements
105
+
106
+ The app relies on the following system-level dependencies:
107
+
108
+ - **[FFmpeg](https://ffmpeg.org/)**: Required for handling video and audio.
109
+ - **[ImageMagick](https://imagemagick.org/)**: Required for video processing.
110
+
111
+ Please ensure these are installed using the provided scripts before running the app.
112
+
113
+ ## ๐Ÿ“š Technologies Used
114
+
115
+ - **Gradio**: Provides the web interface for easy interaction.
116
+ - **Whisper by OpenAI**: Performs speech recognition.
117
+
118
+ ## ๐Ÿค Contributing
119
+
120
+ Contributions are welcome! If you'd like to improve the app or add new features,
121
+ feel free to fork the repository and open a pull request. Please format your code
122
+ with `black`.
123
+
124
+ ## ๐Ÿ“„ License
125
+
126
+ This project is open source and available under the [Apache 2.0 License](LICENSE).
127
+
128
+ ## โœ‰๏ธ Contact
129
+
130
+ If you have any questions, feel free to
131
+ [open an issue](https://github.com/killian31/AudioVisualTranscription/issues/new).
app_ex.png CHANGED