Spaces:

utkarsh-dixit
/

WhisperFusion

Paused

Marcus Edel commited on Jan 25, 2024

Commit

b5073d7

1 Parent(s): a7b86dc

Add sample video to the README.

Files changed (2) hide show

README.md CHANGED Viewed

@@ -1,5 +1,10 @@
 # WhisperFusion
 Welcome to WhisperFusion. WhisperFusion builds upon the capabilities of
 the [WhisperLive](https://github.com/collabora/WhisperLive) and

 # WhisperFusion
+<h2 align="center">
+  <a href="https://www.youtube.com/watch?v=_PnaP0AQJnk"><img
+src="https://img.youtube.com/vi/_PnaP0AQJnk/0.jpg" style="background-color:rgba(0,0,0,0);" height=300 alt="WhisperFusion"></a>
+  <br><br>Doing math with WhisperFusion: Ultra-low latency conversations with an AI chatbot<br><br>
+</h2>
 Welcome to WhisperFusion. WhisperFusion builds upon the capabilities of
 the [WhisperLive](https://github.com/collabora/WhisperLive) and

README.qmd CHANGED Viewed

@@ -29,6 +29,12 @@ These steps are included in `{fname}`
 # WhisperFusion
 Welcome to WhisperFusion. WhisperFusion builds upon the capabilities of the [WhisperLive](https://github.com/collabora/WhisperLive) and [WhisperSpeech](https://github.com/collabora/WhisperSpeech) by integrating Mistral, a Large Language Model (LLM), on top of the real-time speech-to-text pipeline. WhisperLive relies on OpenAI Whisper, a powerful automatic speech recognition (ASR) system. Both Mistral and Whisper are optimized to run efficiently as TensorRT engines, maximizing performance and real-time processing capabilities.
 ## Features

 # WhisperFusion
+<h2 align="center">
+  <a href="https://www.youtube.com/watch?v=_PnaP0AQJnk"><img
+src="https://img.youtube.com/vi/_PnaP0AQJnk/0.jpg" style="background-color:rgba(0,0,0,0);" height=300 alt="WhisperFusion"></a>
+  <br><br>Doing math with WhisperFusion: Ultra-low latency conversations with an AI chatbot<br><br>
+</h2>
 Welcome to WhisperFusion. WhisperFusion builds upon the capabilities of the [WhisperLive](https://github.com/collabora/WhisperLive) and [WhisperSpeech](https://github.com/collabora/WhisperSpeech) by integrating Mistral, a Large Language Model (LLM), on top of the real-time speech-to-text pipeline. WhisperLive relies on OpenAI Whisper, a powerful automatic speech recognition (ASR) system. Both Mistral and Whisper are optimized to run efficiently as TensorRT engines, maximizing performance and real-time processing capabilities.
 ## Features