Spaces:

JohnInizio
/

persona-chat-demo

Running

App Files Files Community

Create app.py

by zainmushtaq54 - opened Sep 27

base: refs/heads/main

←

from: refs/pr/1

Discussion Files changed

-20

Files changed (1) hide show

app.py +0 -20

app.py CHANGED Viewed

@@ -1,22 +1,3 @@
-'''
-+----------------------+        +-------------------------+        +-------------------------------+        +-------------------------+
-| Step 1: Set Up       |        |  Step 2: Set Up Gradio  |        |  Step 3: Speech-to-Text       |        |  Step 4: Text-to-Speech |
-| Environment          |        |  Interface              |        | & Language Model Processing   |        |  Output                 |
-+----------------------+        +-------------------------+        +-------------------------------+        +-------------------------+
-|                      |        |                         |        |                               |        |                         |
-| - Import Python      |        | - Define interface      |        | - Transcribe audio            |        | - XTTS model generates  |
-|   libraries          |        |   components            |        |   to text using               |        |   spoken response from  |
-| - Initialize models: |--------> - Configure audio and   |------->|   Faster Whisper ASR          |------->|   LLM's text response   |
-|   Whisper, Mistral,  |        |   text interaction      |        | - Transcribed text            |        |                         |
-|   XTTS               |        | - Launch interface      |        |   is added to                 |        |                         |
-|                      |        |                         |        |   chatbot's history           |        |                         |
-|                      |        |                         |        | - Mistral LLM                 |        |                         |
-|                      |        |                         |        |   processes chatbot           |        |                         |
-|                      |        |                         |        |   history to generate         |        |                         |
-|                      |        |                         |        |   response                    |        |                         |
-+----------------------+        +-------------------------+        +-------------------------------+        +-------------------------+
-'''
 ###### Set Up Environment ######
 import os
@@ -205,7 +186,6 @@ with gr.Blocks(title="Voice chat with LLM") as demo:
             - Speech to Text Model: [Faster-Whisper-large-v3](https://huggingface.co/Systran/faster-whisper-large-v3) an ASR model, to transcribe recorded audio to text.
             - Large Language Model: [Mistral-7b-instruct-v0.1-quantized](https://huggingface.co/TheBloke/Mistral-7B-Instruct-v0.1-GGUF) a LLM to generate the chatbot responses.
             - Text to Speech Model: [XTTS-v2](https://huggingface.co/spaces/coqui/xtts) a TTS model, to generate the voice of the chatbot.
             Note:
             - Responses generated by chat model should not be assumed correct or taken serious, as this is a demonstration example only
             - iOS (Iphone/Ipad) devices may not experience voice due to autoplay being disabled on these devices by Vendor"""

 ###### Set Up Environment ######
 import os
             - Speech to Text Model: [Faster-Whisper-large-v3](https://huggingface.co/Systran/faster-whisper-large-v3) an ASR model, to transcribe recorded audio to text.
             - Large Language Model: [Mistral-7b-instruct-v0.1-quantized](https://huggingface.co/TheBloke/Mistral-7B-Instruct-v0.1-GGUF) a LLM to generate the chatbot responses.
             - Text to Speech Model: [XTTS-v2](https://huggingface.co/spaces/coqui/xtts) a TTS model, to generate the voice of the chatbot.
             Note:
             - Responses generated by chat model should not be assumed correct or taken serious, as this is a demonstration example only
             - iOS (Iphone/Ipad) devices may not experience voice due to autoplay being disabled on these devices by Vendor"""