Spaces:

utkarsh-dixit
/

WhisperFusion

Paused

App Files Files Community

jpc commited on Jan 23

Commit

7bf3a05

•

1 Parent(s): ad3d1f7

Rename WhisperBot to WhisperFusion

Browse files

Files changed (8) hide show

README.md +11 -12
README.qmd +7 -7
docker/Dockerfile +4 -4
docker/build.sh +3 -3
docker/publish.sh +2 -2
docker/scripts/{run-whisperbot.sh → run-whisperfusion.sh} +1 -1
docker/scripts/{setup-whisperbot.sh → setup-whisperfusion.sh} +2 -2
docker/scripts/setup.sh +1 -1

README.md CHANGED Viewed

@@ -1,8 +1,8 @@
-# WhisperBot
-Welcome to WhisperBot. WhisperBot builds upon the capabilities of the
-[WhisperLive](https://github.com/collabora/WhisperLive) and
 [WhisperSpeech](https://github.com/collabora/WhisperSpeech) by
 integrating Mistral, a Large Language Model (LLM), on top of the
 real-time speech-to-text pipeline. WhisperLive relies on OpenAI Whisper,
@@ -149,17 +149,17 @@ cp -r phi-2 "$dest"
 cp -r "$phi_path" "$dest/phi-orig-model"
 ```
-## Build WhisperBot
 > [!NOTE]
 >
-> These steps are included in `docker/scripts/setup-whisperbot.sh`
 Clone this repo and install requirements
 ``` bash
-[ -d "WhisperBot" ] || git clone https://github.com/collabora/WhisperBot.git
-cd WhisperBot
 apt update
 apt install ffmpeg portaudio19-dev -y
 ```
@@ -174,7 +174,6 @@ Install all the other dependencies normally
 ``` bash
 pip install -r requirements.txt
-pip install openai-whisper whisperspeech soundfile
 ```
 force update huggingface_hub (tokenizers 0.14.1 spuriously require and
@@ -191,7 +190,7 @@ curl -L -o /root/.cache/whisper-live/silero_vad.onnx https://github.com/snakers4
 python -c 'from transformers.utils.hub import move_cache; move_cache()'
 ```
-### Run WhisperBot with Whisper and Mistral/Phi-2
 Take the folder path for Whisper TensorRT model, folder_path and
 tokenizer_path for Mistral/Phi-2 TensorRT from the build phase. If a
@@ -200,11 +199,11 @@ huggingface repo name as the tokenizer path.
 > [!NOTE]
 >
-> These steps are included in `docker/scripts/run-whisperbot.sh`
 ``` bash
 test -f /etc/shinit_v2 && source /etc/shinit_v2
-cd WhisperBot
 if [ "$1" != "mistral" ]; then
   exec python3 main.py --phi \
                   --whisper_tensorrt_path /root/whisper_small_en \
@@ -222,7 +221,7 @@ fi
   execute `run_client.py`
 ``` bash
-cd WhisperBot
 pip install -r requirements.txt
 python3 run_client.py
 ```

+# WhisperFusion
+Welcome to WhisperFusion. WhisperFusion builds upon the capabilities of
+the [WhisperLive](https://github.com/collabora/WhisperLive) and
 [WhisperSpeech](https://github.com/collabora/WhisperSpeech) by
 integrating Mistral, a Large Language Model (LLM), on top of the
 real-time speech-to-text pipeline. WhisperLive relies on OpenAI Whisper,
 cp -r "$phi_path" "$dest/phi-orig-model"
 ```
+## Build WhisperFusion
 > [!NOTE]
 >
+> These steps are included in `docker/scripts/setup-whisperfusion.sh`
 Clone this repo and install requirements
 ``` bash
+[ -d "WhisperFusion" ] || git clone https://github.com/collabora/WhisperFusion.git
+cd WhisperFusion
 apt update
 apt install ffmpeg portaudio19-dev -y
 ```
 ``` bash
 pip install -r requirements.txt
 ```
 force update huggingface_hub (tokenizers 0.14.1 spuriously require and
 python -c 'from transformers.utils.hub import move_cache; move_cache()'
 ```
+### Run WhisperFusion with Whisper and Mistral/Phi-2
 Take the folder path for Whisper TensorRT model, folder_path and
 tokenizer_path for Mistral/Phi-2 TensorRT from the build phase. If a
 > [!NOTE]
 >
+> These steps are included in `docker/scripts/run-whisperfusion.sh`
 ``` bash
 test -f /etc/shinit_v2 && source /etc/shinit_v2
+cd WhisperFusion
 if [ "$1" != "mistral" ]; then
   exec python3 main.py --phi \
                   --whisper_tensorrt_path /root/whisper_small_en \
   execute `run_client.py`
 ``` bash
+cd WhisperFusion
 pip install -r requirements.txt
 python3 run_client.py
 ```

README.qmd CHANGED Viewed

@@ -27,9 +27,9 @@ These steps are included in `{fname}`
     if code: print("```")
 ```
-# WhisperBot
-Welcome to WhisperBot. WhisperBot builds upon the capabilities of the [WhisperLive](https://github.com/collabora/WhisperLive) and [WhisperSpeech](https://github.com/collabora/WhisperSpeech) by integrating Mistral, a Large Language Model (LLM), on top of the real-time speech-to-text pipeline. WhisperLive relies on OpenAI Whisper, a powerful automatic speech recognition (ASR) system. Both Mistral and Whisper are optimized to run efficiently as TensorRT engines, maximizing performance and real-time processing capabilities.
 ## Features
 - **Real-Time Speech-to-Text**: Utilizes OpenAI WhisperLive to convert spoken language into text in real-time.
@@ -60,23 +60,23 @@ include_file('docker/scripts/build-mistral.sh')
 include_file('docker/scripts/build-phi-2.sh')
 ```
-## Build WhisperBot
 ```{python}
-include_file('docker/scripts/setup-whisperbot.sh')
 ```
-### Run WhisperBot with Whisper and Mistral/Phi-2
 Take the folder path for Whisper TensorRT model, folder_path and tokenizer_path for Mistral/Phi-2 TensorRT from the build phase. If a huggingface model is used to build mistral/phi-2 then just use the huggingface repo name as the tokenizer path.
 ```{python}
-include_file('docker/scripts/run-whisperbot.sh')
 ```
 - On the client side clone the repo, install the requirements and execute `run_client.py`
 ```bash
-cd WhisperBot
 pip install -r requirements.txt
 python3 run_client.py
 ```

     if code: print("```")
 ```
+# WhisperFusion
+Welcome to WhisperFusion. WhisperFusion builds upon the capabilities of the [WhisperLive](https://github.com/collabora/WhisperLive) and [WhisperSpeech](https://github.com/collabora/WhisperSpeech) by integrating Mistral, a Large Language Model (LLM), on top of the real-time speech-to-text pipeline. WhisperLive relies on OpenAI Whisper, a powerful automatic speech recognition (ASR) system. Both Mistral and Whisper are optimized to run efficiently as TensorRT engines, maximizing performance and real-time processing capabilities.
 ## Features
 - **Real-Time Speech-to-Text**: Utilizes OpenAI WhisperLive to convert spoken language into text in real-time.
 include_file('docker/scripts/build-phi-2.sh')
 ```
+## Build WhisperFusion
 ```{python}
+include_file('docker/scripts/setup-whisperfusion.sh')
 ```
+### Run WhisperFusion with Whisper and Mistral/Phi-2
 Take the folder path for Whisper TensorRT model, folder_path and tokenizer_path for Mistral/Phi-2 TensorRT from the build phase. If a huggingface model is used to build mistral/phi-2 then just use the huggingface repo name as the tokenizer path.
 ```{python}
+include_file('docker/scripts/run-whisperfusion.sh')
 ```
 - On the client side clone the repo, install the requirements and execute `run_client.py`
 ```bash
+cd WhisperFusion
 pip install -r requirements.txt
 python3 run_client.py
 ```

docker/Dockerfile CHANGED Viewed

@@ -1,8 +1,8 @@
-FROM ghcr.io/collabora/whisperbot-base:latest as base
 WORKDIR /root
-COPY scripts/setup-whisperbot.sh scripts/run-whisperbot.sh scratch-space/models /root/
-RUN ./setup-whisperbot.sh
-CMD ./run-whisperbot.sh

+FROM ghcr.io/collabora/whisperfusion-base:latest as base
 WORKDIR /root
+COPY scripts/setup-whisperfusion.sh scripts/run-whisperfusion.sh scratch-space/models /root/
+RUN ./setup-whisperfusion.sh
+CMD ./run-whisperfusion.sh

docker/build.sh CHANGED Viewed

@@ -4,11 +4,11 @@
 (
   cd base-image &&
-  docker build $ARGS -t ghcr.io/collabora/whisperbot-base:latest .
 )
 mkdir -p scratch-space
 cp -r scripts/build-* scratch-space
-#docker run --gpus all --shm-size 64G -v "$PWD"/scratch-space:/root/scratch-space -w /root/scratch-space -it ghcr.io/collabora/whisperbot-base:latest ./build-models.sh
-docker build $ARGS -t ghcr.io/collabora/whisperbot:latest .

 (
   cd base-image &&
+  docker build $ARGS -t ghcr.io/collabora/whisperfusion-base:latest .
 )
 mkdir -p scratch-space
 cp -r scripts/build-* scratch-space
+docker run --gpus all --shm-size 64G -v "$PWD"/scratch-space:/root/scratch-space -w /root/scratch-space -it ghcr.io/collabora/whisperfusion-base:latest ./build-models.sh
+docker build $ARGS -t ghcr.io/collabora/whisperfusion:latest .

docker/publish.sh CHANGED Viewed

@@ -1,4 +1,4 @@
 #!/bin/bash -e
-docker push ghcr.io/collabora/whisperbot-base:latest
-docker push ghcr.io/collabora/whisperbot:latest

 #!/bin/bash -e
+docker push ghcr.io/collabora/whisperfusion-base:latest
+docker push ghcr.io/collabora/whisperfusion:latest

docker/scripts/{run-whisperbot.sh → run-whisperfusion.sh} RENAMED Viewed

@@ -2,7 +2,7 @@
 test -f /etc/shinit_v2 && source /etc/shinit_v2
-cd WhisperBot
 if [ "$1" != "mistral" ]; then
   exec python3 main.py --phi \
                   --whisper_tensorrt_path /root/whisper_small_en \

 test -f /etc/shinit_v2 && source /etc/shinit_v2
+cd WhisperFusion
 if [ "$1" != "mistral" ]; then
   exec python3 main.py --phi \
                   --whisper_tensorrt_path /root/whisper_small_en \

docker/scripts/{setup-whisperbot.sh → setup-whisperfusion.sh} RENAMED Viewed

@@ -1,9 +1,9 @@
 #!/bin/bash -e
 ## Clone this repo and install requirements
-[ -d "WhisperBot" ] || git clone https://github.com/collabora/WhisperBot.git
-cd WhisperBot
 apt update
 apt install ffmpeg portaudio19-dev -y

 #!/bin/bash -e
 ## Clone this repo and install requirements
+[ -d "WhisperFusion" ] || git clone https://github.com/collabora/WhisperFusion.git
+cd WhisperFusion
 apt update
 apt install ffmpeg portaudio19-dev -y

docker/scripts/setup.sh CHANGED Viewed

@@ -3,4 +3,4 @@
 ./setup-whisper.sh
 #./setup-mistral.sh
 ./setup-phi-2.sh
-./setup-whisperbot.sh

 ./setup-whisper.sh
 #./setup-mistral.sh
 ./setup-phi-2.sh
+./setup-whisperfusion.sh