Spaces:

jthteo
/

Whisper

Runtime error

App Files Files Community

jthteo commited on Sep 26, 2022

Commit

d089c2e

•

1 Parent(s): 9dc7dd3

Added choice for Model Size

Browse files

Files changed (1) hide show

app.py +17 -3

app.py CHANGED Viewed

@@ -4,6 +4,13 @@ import gradio as gr
 import whisper
 model = whisper.load_model("small")
 def inference(audio):
@@ -131,13 +138,17 @@ with block:
               <p style="margin-bottom: 10px; font-size: 94%">
                 Whisper is a general-purpose speech recognition model. It has been trained on a large dataset of diverse audio and is also a multi-task model that can perform multilingual speech recognition as well as speech translation and language identification. </p>
                 <p>This is a fork by JTHTEO.</p>
-                <p>This uses the small multi-lingual 244mill parameter Whisper model (<a href="https://github.com/openai/whisper/blob/main/model-card.md">Model Card</a>.) </p>
               </p>
             </div>
         """
     )
     with gr.Group():
         with gr.Box():
             with gr.Row().style(mobile_collapse=False, equal_height=True):
                 audio = gr.Audio(
                     label="Input Audio",
@@ -147,10 +158,13 @@ with block:
                 )
                 btn = gr.Button("Transcribe")
         text = gr.Textbox(show_label=False)
         btn.click(inference, inputs=[audio], outputs=[text])
         gr.HTML('''
         <div class="footer">
                     <p>Model by <a href="https://github.com/openai/whisper" style="text-decoration: underline;" target="_blank">OpenAI</a> - Gradio Demo by 🤗 Hugging Face, this is a fork by JTHTEO

 import whisper
 model = whisper.load_model("small")
+current_size = 'small'
+def change_model(size):
+  if size == current_size:
+    return
+  model = whisper.load_model(size)
+  current_size = size
 def inference(audio):
               <p style="margin-bottom: 10px; font-size: 94%">
                 Whisper is a general-purpose speech recognition model. It has been trained on a large dataset of diverse audio and is also a multi-task model that can perform multilingual speech recognition as well as speech translation and language identification. </p>
                 <p>This is a fork by JTHTEO.</p>
+                <p>The different sized Whisper models can be found in this (<a href="https://github.com/openai/whisper/blob/main/model-card.md">Model Card</a>.) </p>
               </p>
             </div>
         """
     )
     with gr.Group():
         with gr.Box():
+            wmodel = gr.Radio(
+                        choices=["tiny", "base", "small", "medium", "large"],
+                        label="Model used",
+                        value="small")
             with gr.Row().style(mobile_collapse=False, equal_height=True):
                 audio = gr.Audio(
                     label="Input Audio",
                 )
                 btn = gr.Button("Transcribe")
         text = gr.Textbox(show_label=False)
+##events###
+        wmodel.change(change_model, inputs=[wmodel], outputs=[])
         btn.click(inference, inputs=[audio], outputs=[text])
+ ##footer###
         gr.HTML('''
         <div class="footer">
                     <p>Model by <a href="https://github.com/openai/whisper" style="text-decoration: underline;" target="_blank">OpenAI</a> - Gradio Demo by 🤗 Hugging Face, this is a fork by JTHTEO