End of training

Browse files

Files changed (10) hide show

.gitattributes +1 -0
README.md +26 -24
model.safetensors +2 -2
predictions_common_voice_13_en_common_voice_13_en_test_wer19.44.csv +0 -0
predictions_common_voice_13_en_common_voice_13_en_test_wer19.44_hyp.trn +0 -0
predictions_common_voice_13_en_common_voice_13_en_test_wer19.44_hyp.trn.dtl +0 -0
predictions_common_voice_13_en_common_voice_13_en_test_wer19.44_hyp.trn.snt.utt +3 -0
predictions_common_voice_13_en_common_voice_13_en_test_wer19.44_hyp.trn.sys +18 -0
predictions_common_voice_13_en_common_voice_13_en_test_wer19.44_ref.trn +0 -0
training_args.bin +1 -1

.gitattributes CHANGED Viewed

@@ -34,3 +34,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text
 predictions_common_voice_13_en_common_voice_13_en_test_wer19.46_hyp.trn.snt.utt filter=lfs diff=lfs merge=lfs -text

 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text
 predictions_common_voice_13_en_common_voice_13_en_test_wer19.46_hyp.trn.snt.utt filter=lfs diff=lfs merge=lfs -text
+predictions_common_voice_13_en_common_voice_13_en_test_wer19.44_hyp.trn.snt.utt filter=lfs diff=lfs merge=lfs -text

README.md CHANGED Viewed

@@ -1,7 +1,7 @@
 ---
 tags:
 - generated_from_trainer
-base_model: Lakoc/DeCRED_small_cv_2
 datasets:
 - common_voice_13_0
 metrics:
@@ -18,16 +18,16 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [Lakoc/DeCRED_small_cv_2](https://huggingface.co/Lakoc/DeCRED_small_cv_2) on the common_voice_13_0 dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.0590
 - Cer: 0.0632
-- Wer: 0.1471
-- Mer: 0.1444
 - Wil: 0.2408
 - Wip: 0.7592
-- Hits: 23158
-- Substitutions: 2931
-- Deletions: 484
-- Insertions: 494
 ## Model description
@@ -46,7 +46,7 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 0.05
 - train_batch_size: 256
 - eval_batch_size: 64
 - seed: 42
@@ -61,18 +61,23 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss | Cer    | Wer    | Mer    | Wil    | Wip    | Hits  | Substitutions | Deletions | Insertions |
 |:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:------:|:------:|:-----:|:-------------:|:---------:|:----------:|
-| 1.185         | 2.67  | 20   | 1.1215          | 0.0646 | 0.1520 | 0.1492 | 0.2479 | 0.7521 | 23030 | 3013          | 530       | 496        |
-| 1.0639        | 5.33  | 40   | 1.0717          | 0.0634 | 0.1480 | 0.1453 | 0.2420 | 0.7580 | 23124 | 2939          | 510       | 483        |
-| 1.0957        | 8.0   | 60   | 1.0612          | 0.0628 | 0.1464 | 0.1438 | 0.2402 | 0.7598 | 23162 | 2929          | 482       | 480        |
-| 1.0936        | 10.67 | 80   | 1.0595          | 0.0631 | 0.1469 | 0.1442 | 0.2407 | 0.7593 | 23158 | 2934          | 481       | 488        |
-| 1.0804        | 13.33 | 100  | 1.0591          | 0.0630 | 0.1468 | 0.1441 | 0.2405 | 0.7595 | 23164 | 2929          | 480       | 492        |
-| 1.1044        | 16.0  | 120  | 1.0591          | 0.0631 | 0.1469 | 0.1442 | 0.2405 | 0.7595 | 23162 | 2929          | 482       | 492        |
-| 1.0836        | 18.67 | 140  | 1.0589          | 0.0631 | 0.1468 | 0.1442 | 0.2405 | 0.7595 | 23163 | 2929          | 481       | 492        |
-| 1.0924        | 21.33 | 160  | 1.0590          | 0.0632 | 0.1471 | 0.1444 | 0.2408 | 0.7592 | 23158 | 2931          | 484       | 494        |
-| 1.1048        | 24.0  | 180  | 1.0590          | 0.0632 | 0.1471 | 0.1444 | 0.2408 | 0.7592 | 23158 | 2931          | 484       | 494        |
-| 1.0858        | 26.67 | 200  | 1.0589          | 0.0631 | 0.1470 | 0.1443 | 0.2407 | 0.7593 | 23160 | 2929          | 484       | 494        |
-| 1.0953        | 29.33 | 220  | 1.0589          | 0.0631 | 0.1468 | 0.1442 | 0.2405 | 0.7595 | 23163 | 2929          | 481       | 492        |
-| 1.1308        | 32.0  | 240  | 1.0590          | 0.0632 | 0.1471 | 0.1444 | 0.2408 | 0.7592 | 23158 | 2931          | 484       | 494        |
 ### Framework versions
@@ -81,6 +86,3 @@ The following hyperparameters were used during training:
 - Pytorch 2.2.0+rocm5.6
 - Datasets 2.18.0
 - Tokenizers 0.15.2
-### Wandb run
-https://wandb.ai/butspeechfit/decred_commonvoice_en/runs/DeCRED_linear_mixing_tuning

 ---
+base_model: Lakoc/DeCRED_small_cv_2
 tags:
 - generated_from_trainer
 datasets:
 - common_voice_13_0
 metrics:
 This model is a fine-tuned version of [Lakoc/DeCRED_small_cv_2](https://huggingface.co/Lakoc/DeCRED_small_cv_2) on the common_voice_13_0 dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.0601
 - Cer: 0.0632
+- Wer: 0.1472
+- Mer: 0.1445
 - Wil: 0.2408
 - Wip: 0.7592
+- Hits: 23157
+- Substitutions: 2930
+- Deletions: 486
+- Insertions: 495
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 0.01
 - train_batch_size: 256
 - eval_batch_size: 64
 - seed: 42
 | Training Loss | Epoch | Step | Validation Loss | Cer    | Wer    | Mer    | Wil    | Wip    | Hits  | Substitutions | Deletions | Insertions |
 |:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:------:|:------:|:-----:|:-------------:|:---------:|:----------:|
+| 3.4981        | 2.67  | 20   | 3.3391          | 3.8755 | 3.5546 | 0.9635 | 0.9950 | 0.0050 | 3582  | 22099         | 892       | 71466      |
+| 1.2736        | 5.33  | 40   | 1.2175          | 0.0756 | 0.1717 | 0.1678 | 0.2775 | 0.7225 | 22623 | 3423          | 527       | 612        |
+| 1.1073        | 8.0   | 60   | 1.0687          | 0.0647 | 0.1511 | 0.1483 | 0.2464 | 0.7536 | 23059 | 2993          | 521       | 501        |
+| 1.0963        | 10.67 | 80   | 1.0656          | 0.0638 | 0.1492 | 0.1464 | 0.2436 | 0.7564 | 23122 | 2963          | 488       | 514        |
+| 1.0811        | 13.33 | 100  | 1.0630          | 0.0636 | 0.1478 | 0.1451 | 0.2416 | 0.7584 | 23152 | 2937          | 484       | 507        |
+| 1.1036        | 16.0  | 120  | 1.0617          | 0.0634 | 0.1476 | 0.1448 | 0.2410 | 0.7590 | 23160 | 2925          | 488       | 509        |
+| 1.0831        | 18.67 | 140  | 1.0610          | 0.0632 | 0.1474 | 0.1447 | 0.2410 | 0.7590 | 23157 | 2931          | 485       | 501        |
+| 1.0914        | 21.33 | 160  | 1.0607          | 0.0634 | 0.1478 | 0.1451 | 0.2418 | 0.7582 | 23142 | 2941          | 490       | 497        |
+| 1.1033        | 24.0  | 180  | 1.0605          | 0.0631 | 0.1470 | 0.1443 | 0.2405 | 0.7595 | 23162 | 2925          | 486       | 496        |
+| 1.0849        | 26.67 | 200  | 1.0603          | 0.0632 | 0.1472 | 0.1445 | 0.2407 | 0.7593 | 23159 | 2926          | 488       | 498        |
+| 1.0937        | 29.33 | 220  | 1.0603          | 0.0632 | 0.1473 | 0.1445 | 0.2407 | 0.7593 | 23160 | 2925          | 488       | 500        |
+| 1.1295        | 32.0  | 240  | 1.0601          | 0.0632 | 0.1471 | 0.1444 | 0.2406 | 0.7594 | 23162 | 2926          | 485       | 499        |
+| 1.0741        | 34.67 | 260  | 1.0602          | 0.0631 | 0.1471 | 0.1444 | 0.2405 | 0.7595 | 23161 | 2924          | 488       | 496        |
+| 1.073         | 37.33 | 280  | 1.0601          | 0.0631 | 0.1471 | 0.1444 | 0.2407 | 0.7593 | 23159 | 2927          | 487       | 496        |
+| 1.0846        | 40.0  | 300  | 1.0601          | 0.0631 | 0.1471 | 0.1445 | 0.2408 | 0.7592 | 23158 | 2929          | 486       | 495        |
+| 1.0717        | 42.67 | 320  | 1.0601          | 0.0632 | 0.1472 | 0.1445 | 0.2408 | 0.7592 | 23158 | 2929          | 486       | 497        |
+| 1.1017        | 45.33 | 340  | 1.0601          | 0.0632 | 0.1472 | 0.1445 | 0.2408 | 0.7592 | 23157 | 2930          | 486       | 495        |
 ### Framework versions
 - Pytorch 2.2.0+rocm5.6
 - Datasets 2.18.0
 - Tokenizers 0.15.2

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:2d38c60c905689cd2eb72d6225b6037af9e159ba63b6684421cdc867bd21ce74
-size 144243304

 version https://git-lfs.github.com/spec/v1
+oid sha256:a99a5ad59f0cc2e78884c6570aadddf02ec9114cf852352d0e079d69a3aac04c
+size 144251296

predictions_common_voice_13_en_common_voice_13_en_test_wer19.44.csv ADDED Viewed

The diff for this file is too large to render. See raw diff

predictions_common_voice_13_en_common_voice_13_en_test_wer19.44_hyp.trn ADDED Viewed

The diff for this file is too large to render. See raw diff

predictions_common_voice_13_en_common_voice_13_en_test_wer19.44_hyp.trn.dtl ADDED Viewed

The diff for this file is too large to render. See raw diff

predictions_common_voice_13_en_common_voice_13_en_test_wer19.44_hyp.trn.snt.utt ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:507d3811282389cafe62d8998e5071e19271afc8194cae8425115704eaa3eb1f
+size 11430312

predictions_common_voice_13_en_common_voice_13_en_test_wer19.44_hyp.trn.sys ADDED Viewed

	@@ -0,0 +1,18 @@

+                     SYSTEM SUMMARY PERCENTAGES by SPEAKER
+,-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------.
+|/scratch/project_465000836/ipoloka/huggingface_asr/experiments/decred/commonvoice/DeCRED_linear_mixing_tuning/predictions_common_voice_13_en_common_voice_13_en_test_wer19.44_hyp.trn|
+|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
+|       SPKR         |        # Snt                 # Wrd        |       Corr                 Sub                Del                 Ins                Err               S.Err       |
+|--------------------+-------------------------------------------+--------------------------------------------------------------------------------------------------------------------|
+|       utt          |       16365                 144023        |       83.3                14.2                2.5                 2.7               19.4                63.1       |
+|=====================================================================================================================================================================================|
+|       Sum/Avg      |       16365                 144023        |       83.3                14.2                2.5                 2.7               19.4                63.1       |
+|=====================================================================================================================================================================================|
+|        Mean        |      16365.0               144023.0       |       83.3                14.2                2.5                 2.7               19.4                63.1       |
+|        S.D.        |         0.0                   0.0         |        0.0                 0.0                0.0                 0.0                0.0                 0.0       |
+|       Median       |      16365.0               144023.0       |       83.3                14.2                2.5                 2.7               19.4                63.1       |
+`-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------'

predictions_common_voice_13_en_common_voice_13_en_test_wer19.44_ref.trn ADDED Viewed

The diff for this file is too large to render. See raw diff

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:8d149e7d20e1596cfcc0fbf07335f4e8275568f7a342982420a9e80721de1adb
 size 5688

 version https://git-lfs.github.com/spec/v1
+oid sha256:315c4a2be87925de586b362d37c1c982b3c51d788cdfddab1839b822d84cfbac
 size 5688