Spaces:

sarulab-speech
/

CoCoCap-beta

Running on Zero

Wataru commited on Jan 23, 2024

Commit

ddb5be9

1 Parent(s): 41db67b

added mic input

Files changed (2) hide show

README.md CHANGED Viewed

@@ -10,7 +10,7 @@ pinned: false
 license: mit
 ---
 # CocoCap-beta
-このスペースでは，[CocoNutコーパス](https://sites.google.com/site/shinnosuketakamichi/research-topics/coconut_corpus)でfinetuningした[OpenAI whisper](https://github.com/openai/whisper)による声質キャプショニングを示します．
 # Contributors / 貢献者
 * [中田 亘](https://wataru-nakata.github.io)

 license: mit
 ---
 # CocoCap-beta
+このスペースでは，[CocoNutコーパス](https://sites.google.com/site/shinnosuketakamichi/research-topics/coconut_corpus)でfinetuningした[OpenAI whisper](https://github.com/openai/whisper)による声質キャプショニング（どんな人がどんなスタイルで喋っているかの文章化）を示します．
 # Contributors / 貢献者
 * [中田 亘](https://wataru-nakata.github.io)

app.py CHANGED Viewed

@@ -14,10 +14,15 @@ def main(audio_path):
     return transcribe(audio_path)["text"]
 iface = gr.Interface(
   fn=main,
-  inputs=gr.inputs.Audio(type='filepath'),
   outputs="text",
-  title="Whisper-base finetuned on Coco-Nut Corpus",
 ).launch(share=True)

     return transcribe(audio_path)["text"]
+with open('./README.md') as f:
+    md = f.readlines()
+    md = md[11:]
+    md = "\n".join(md)
 iface = gr.Interface(
   fn=main,
+  inputs=[gr.Audio(type='filepath',sources=['microphone','upload'])],
+  description=md,
   outputs="text",
+  title="CoCoCap-beta 日本語声質キャプショニンング with CocoNut Corpus",
 ).launch(share=True)