kimbochen
/

whisper-small-jp

Automatic Speech Recognition

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

kimbochen commited on Dec 13, 2022

Commit

fa6c669

•

1 Parent(s): 375d8f9

update model card README.md

Files changed (1) hide show

README.md +35 -17

README.md CHANGED Viewed

@@ -1,31 +1,38 @@
 ---
-language:
-- ja
 license: apache-2.0
 tags:
-- whisper-event
 - generated_from_trainer
 datasets:
-- mozilla-foundation/common_voice_11_0
 model-index:
-- name: Whisper Small Japanese - Kimbo Chen
-  results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
-# Whisper Small Japanese - Kimbo Chen
-This model is a fine-tuned version of [openai/whisper-small](https://huggingface.co/openai/whisper-small) on the Common Voice 11.0 dataset.
 It achieves the following results on the evaluation set:
-- eval_loss: 0.2988
-- eval_wer: 74.7340
-- eval_runtime: 1501.9912
-- eval_samples_per_second: 3.065
-- eval_steps_per_second: 0.383
-- epoch: 5.09
-- step: 800
 ## Model description
@@ -46,14 +53,25 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 1e-05
 - train_batch_size: 64
-- eval_batch_size: 8
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- lr_scheduler_warmup_steps: 200
 - training_steps: 1000
 - mixed_precision_training: Native AMP
 ### Framework versions
 - Transformers 4.26.0.dev0

 ---
 license: apache-2.0
 tags:
 - generated_from_trainer
 datasets:
+- common_voice_11_0
+metrics:
+- wer
 model-index:
+- name: openai/whisper-small
+  results:
+  - task:
+      name: Automatic Speech Recognition
+      type: automatic-speech-recognition
+    dataset:
+      name: common_voice_11_0
+      type: common_voice_11_0
+      config: ja
+      split: test
+      args: ja
+    metrics:
+    - name: Wer
+      type: wer
+      value: 13.970036175005118
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
+# openai/whisper-small
+This model is a fine-tuned version of [openai/whisper-small](https://huggingface.co/openai/whisper-small) on the common_voice_11_0 dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.2814
+- Wer: 13.9700
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 1e-05
 - train_batch_size: 64
+- eval_batch_size: 32
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- lr_scheduler_warmup_steps: 500
 - training_steps: 1000
 - mixed_precision_training: Native AMP
+### Training results
+| Training Loss | Epoch | Step | Validation Loss | Wer     |
+|:-------------:|:-----:|:----:|:---------------:|:-------:|
+| 0.2515        | 1.06  | 200  | 0.2881          | 16.9442 |
+| 0.2212        | 2.12  | 400  | 0.2616          | 14.6884 |
+| 0.0774        | 4.04  | 600  | 0.2543          | 13.7687 |
+| 0.0564        | 5.09  | 800  | 0.2731          | 13.9769 |
+| 0.0221        | 7.01  | 1000 | 0.2814          | 13.9700 |
 ### Framework versions
 - Transformers 4.26.0.dev0