Watarungurunnn
/

whisper-large-v3-ja

@@ -1,18 +1,18 @@
 ---
 license: apache-2.0
 tags:
 - generated_from_trainer
 datasets:
 - common_voice_16_0
 metrics:
 - wer
-base_model: openai/whisper-large-v3
 model-index:
 - name: whisper-large-v3-ja
   results:
   - task:
-      type: automatic-speech-recognition
       name: Automatic Speech Recognition
     dataset:
       name: common_voice_16_0
       type: common_voice_16_0
@@ -20,9 +20,9 @@ model-index:
       split: validation
       args: ja
     metrics:
-    - type: wer
-      value: 38.775510204081634
-      name: Wer
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -32,8 +32,8 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [openai/whisper-large-v3](https://huggingface.co/openai/whisper-large-v3) on the common_voice_16_0 dataset.
 It achieves the following results on the evaluation set:
-- Loss: 2.6403
-- Wer: 38.7755
 ## Model description
@@ -61,14 +61,21 @@ The following hyperparameters were used during training:
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 500
-- training_steps: 1
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Wer     |
 |:-------------:|:-----:|:----:|:---------------:|:-------:|
-| 1.7023        | 1.0   | 1    | 2.6403          | 38.7755 |
 ### Framework versions

 ---
 license: apache-2.0
+base_model: openai/whisper-large-v3
 tags:
 - generated_from_trainer
 datasets:
 - common_voice_16_0
 metrics:
 - wer
 model-index:
 - name: whisper-large-v3-ja
   results:
   - task:
       name: Automatic Speech Recognition
+      type: automatic-speech-recognition
     dataset:
       name: common_voice_16_0
       type: common_voice_16_0
       split: validation
       args: ja
     metrics:
+    - name: Wer
+      type: wer
+      value: 14.696501005043272
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 This model is a fine-tuned version of [openai/whisper-large-v3](https://huggingface.co/openai/whisper-large-v3) on the common_voice_16_0 dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.4210
+- Wer: 14.6965
 ## Model description
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 500
+- training_steps: 4000
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Wer     |
 |:-------------:|:-----:|:----:|:---------------:|:-------:|
+| 0.1542        | 1.69  | 500  | 0.2712          | 15.6149 |
+| 0.0351        | 3.39  | 1000 | 0.3074          | 16.1866 |
+| 0.0081        | 5.08  | 1500 | 0.3475          | 15.3802 |
+| 0.0049        | 6.78  | 2000 | 0.3427          | 15.1804 |
+| 0.001         | 8.47  | 2500 | 0.3851          | 14.7302 |
+| 0.0004        | 10.17 | 3000 | 0.4109          | 14.7254 |
+| 0.0003        | 11.86 | 3500 | 0.4168          | 14.6953 |
+| 0.0003        | 13.56 | 4000 | 0.4210          | 14.6965 |
 ### Framework versions

model-00001-of-00002.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:08e0005225b3dbaf55dd13ac62926cc7e02c1025d66fa375e6fb305ff79cd4f9
 size 4993448880

 version https://git-lfs.github.com/spec/v1
+oid sha256:a4b80ec7637784a453aae06edac8d3c9dd25c2e6386a54db78a9cd35b6dd59b6
 size 4993448880

model-00002-of-00002.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:630ca774672856d2e0e39a702e590f635a1cfc5726a64b6578ab46dd367369a9
 size 1180663192

 version https://git-lfs.github.com/spec/v1
+oid sha256:46e27ac4e2f66f534d39d80fee3cf43b9981ad847532e1a2a840d1d72a61e603
 size 1180663192