jerrychatz
/

wav2vec2-large-xls-r-300m-greek

@@ -15,8 +15,8 @@ should probably proofread and complete it, then remove this comment. -->
 This model was trained from scratch on the common_voice dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.4860
-- Wer: 0.3464
 ## Model description
@@ -35,7 +35,7 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 0.0003
 - train_batch_size: 16
 - eval_batch_size: 8
 - seed: 42
@@ -43,7 +43,6 @@ The following hyperparameters were used during training:
 - total_train_batch_size: 32
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- lr_scheduler_warmup_steps: 500
 - num_epochs: 30
 - mixed_precision_training: Native AMP
@@ -51,23 +50,23 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss | Wer    |
 |:-------------:|:-----:|:----:|:---------------:|:------:|
-| 3.1726        | 1.72  | 200  | 0.6130          | 0.4181 |
-| 0.1165        | 3.45  | 400  | 0.6819          | 0.4588 |
-| 0.1272        | 5.17  | 600  | 0.5812          | 0.4385 |
-| 0.1253        | 6.89  | 800  | 0.5866          | 0.4723 |
-| 0.1136        | 8.62  | 1000 | 0.5289          | 0.4361 |
-| 0.0936        | 10.34 | 1200 | 0.5785          | 0.4265 |
-| 0.0853        | 12.07 | 1400 | 0.5314          | 0.4365 |
-| 0.0761        | 13.79 | 1600 | 0.5065          | 0.4056 |
-| 0.0707        | 15.52 | 1800 | 0.5427          | 0.3948 |
-| 0.0632        | 17.24 | 2000 | 0.5227          | 0.3840 |
-| 0.0501        | 18.96 | 2200 | 0.5150          | 0.3875 |
-| 0.0462        | 20.69 | 2400 | 0.4949          | 0.3747 |
-| 0.0402        | 22.41 | 2600 | 0.4773          | 0.3720 |
-| 0.0339        | 24.14 | 2800 | 0.4719          | 0.3593 |
-| 0.0302        | 25.86 | 3000 | 0.4995          | 0.3540 |
-| 0.028         | 27.58 | 3200 | 0.4840          | 0.3510 |
-| 0.0244        | 29.31 | 3400 | 0.4860          | 0.3464 |
 ### Framework versions

 This model was trained from scratch on the common_voice dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.4823
+- Wer: 0.3338
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 3e-05
 - train_batch_size: 16
 - eval_batch_size: 8
 - seed: 42
 - total_train_batch_size: 32
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - num_epochs: 30
 - mixed_precision_training: Native AMP
 | Training Loss | Epoch | Step | Validation Loss | Wer    |
 |:-------------:|:-----:|:----:|:---------------:|:------:|
+| 0.0106        | 1.72  | 200  | 0.5519          | 0.3537 |
+| 0.0249        | 3.45  | 400  | 0.5174          | 0.3465 |
+| 0.0206        | 5.17  | 600  | 0.4721          | 0.3323 |
+| 0.0221        | 6.89  | 800  | 0.4652          | 0.3373 |
+| 0.0204        | 8.62  | 1000 | 0.4883          | 0.3389 |
+| 0.0192        | 10.34 | 1200 | 0.4785          | 0.3389 |
+| 0.0186        | 12.07 | 1400 | 0.4789          | 0.3378 |
+| 0.0172        | 13.79 | 1600 | 0.4915          | 0.3347 |
+| 0.0184        | 15.52 | 1800 | 0.4759          | 0.3440 |
+| 0.0168        | 17.24 | 2000 | 0.4891          | 0.3371 |
+| 0.0155        | 18.96 | 2200 | 0.4928          | 0.3394 |
+| 0.0146        | 20.69 | 2400 | 0.4834          | 0.3357 |
+| 0.0146        | 22.41 | 2600 | 0.4814          | 0.3362 |
+| 0.0151        | 24.14 | 2800 | 0.4791          | 0.3345 |
+| 0.0136        | 25.86 | 3000 | 0.4825          | 0.3356 |
+| 0.0136        | 27.58 | 3200 | 0.4850          | 0.3351 |
+| 0.0127        | 29.31 | 3400 | 0.4823          | 0.3338 |
 ### Framework versions