End of training

Files changed (4) hide show

README.md CHANGED Viewed

@@ -23,7 +23,7 @@ model-index:
       args: default
     metrics:
     - type: wer
-      value: 47.95539033457249
       name: Wer
 ---
@@ -34,8 +34,8 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [b-brave/asr_double_training_15-10-2024_merged](https://huggingface.co/b-brave/asr_double_training_15-10-2024_merged) on the ASR_BB_and_EC dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.3965
-- Wer: 47.9554
 ## Model description
@@ -63,16 +63,18 @@ The following hyperparameters were used during training:
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: constant
 - lr_scheduler_warmup_steps: 50
-- num_epochs: 3
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch  | Step | Validation Loss | Wer     |
 |:-------------:|:------:|:----:|:---------------:|:-------:|
-| 0.6316        | 0.8929 | 100  | 0.4368          | 37.1747 |
-| 0.4745        | 1.7857 | 200  | 0.4086          | 48.6989 |
-| 0.4211        | 2.6786 | 300  | 0.3965          | 47.9554 |
 ### Framework versions

       args: default
     metrics:
     - type: wer
+      value: 39.03345724907063
       name: Wer
 ---
 This model is a fine-tuned version of [b-brave/asr_double_training_15-10-2024_merged](https://huggingface.co/b-brave/asr_double_training_15-10-2024_merged) on the ASR_BB_and_EC dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.3711
+- Wer: 39.0335
 ## Model description
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: constant
 - lr_scheduler_warmup_steps: 50
+- num_epochs: 5
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch  | Step | Validation Loss | Wer     |
 |:-------------:|:------:|:----:|:---------------:|:-------:|
+| 0.5871        | 0.8929 | 100  | 0.4197          | 64.5601 |
+| 0.4071        | 1.7857 | 200  | 0.3965          | 47.5836 |
+| 0.3503        | 2.6786 | 300  | 0.3837          | 46.5923 |
+| 0.2778        | 3.5714 | 400  | 0.3777          | 46.2206 |
+| 0.2195        | 4.4643 | 500  | 0.3711          | 39.0335 |
 ### Framework versions

adapter_config.json CHANGED Viewed

@@ -13,7 +13,7 @@
   "layers_pattern": null,
   "layers_to_transform": null,
   "loftq_config": {},
-  "lora_alpha": 32,
   "lora_dropout": 0.01,
   "megatron_config": null,
   "megatron_core": "megatron.core",

   "layers_pattern": null,
   "layers_to_transform": null,
   "loftq_config": {},
+  "lora_alpha": 64,
   "lora_dropout": 0.01,
   "megatron_config": null,
   "megatron_core": "megatron.core",

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:bed9aedf4567784db6f5be218dea51883b22c36fe559f645eb3e3048a87c197d
 size 37789960

 version https://git-lfs.github.com/spec/v1
+oid sha256:a1ea3bb76f00b5c9572fa7d6a46d97acf06f30d4072b5334b08b7931c45c4d60
 size 37789960

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:e38cca5d1e211828e034e2b518002731563038434ea35002737f016bc0385a69
 size 5368

 version https://git-lfs.github.com/spec/v1
+oid sha256:d74f173f5d8a8747d397daf447cd1cdc6705eeb46951c58adcc17396241a6bc5
 size 5368