lnxdx
/

Wav2Vec2-Large-XLSR-Persian-ShEMO

@@ -63,7 +63,7 @@ It achieves the following results:
 - WER on ShEMO dev set:            32.85
 - WER on Common Voice 13 test set: 19.21
-## Evaluation results 🌡️
 |                             Checkpoint Name                              | WER on ShEMO dev set | WER on Common Voice 13 test set | Max :) |
 | :---------------------------------------------------------------------------------------------------------------: | :------:  | :-------: | :---:     |
 | [m3hrdadfi/wav2vec2-large-xlsr-persian-v3](https://huggingface.co/m3hrdadfi/wav2vec2-large-xlsr-persian-v3)       | 46.55     | **17.43** | 46.55     |
@@ -73,7 +73,7 @@ It achieves the following results:
 As you can see, my model performs better in maximum case :D
-## Training procedure
 #### Training hyperparameters
@@ -92,7 +92,7 @@ The following hyperparameters were used during training:
 You may need *gradient_accumulation* because you need more batch size.
-#### Training results
 | Training Loss | Epoch | Step | Validation Loss | Wer    |
 |:-------------:|:-----:|:----:|:---------------:|:------:|
@@ -117,7 +117,7 @@ You may need *gradient_accumulation* because you need more batch size.
 | 0.8238        | 11.88 | 1900 | 0.6735          | 0.3297 |
 | 0.7618        | 12.5  | 2000 | 0.6728          | 0.3286 |
-#### Hyperparameter tuning
 Several models with differet hyperparameters were trained. The following figures show the training process for three of them.
 ![wer](wandb-wer.png)
 ![loss](wandb-loss.png)
@@ -182,5 +182,7 @@ Check out [this blog](https://huggingface.co/blog/fine-tune-xlsr-wav2vec2) for m
 ## Contact us 🤝
 If you have any technical question regarding the model, pretraining, code or publication, please create an issue in the repository. This is the *best* way to reach us.
-## Citation ↩️
-*TO DO!*

 - WER on ShEMO dev set:            32.85
 - WER on Common Voice 13 test set: 19.21
+## Evaluation results 🧪
 |                             Checkpoint Name                              | WER on ShEMO dev set | WER on Common Voice 13 test set | Max :) |
 | :---------------------------------------------------------------------------------------------------------------: | :------:  | :-------: | :---:     |
 | [m3hrdadfi/wav2vec2-large-xlsr-persian-v3](https://huggingface.co/m3hrdadfi/wav2vec2-large-xlsr-persian-v3)       | 46.55     | **17.43** | 46.55     |
 As you can see, my model performs better in maximum case :D
+## Training procedure 🏋️
 #### Training hyperparameters
 You may need *gradient_accumulation* because you need more batch size.
+#### Training log
 | Training Loss | Epoch | Step | Validation Loss | Wer    |
 |:-------------:|:-----:|:----:|:---------------:|:------:|
 | 0.8238        | 11.88 | 1900 | 0.6735          | 0.3297 |
 | 0.7618        | 12.5  | 2000 | 0.6728          | 0.3286 |
+#### Hyperparameter tuning 🔧
 Several models with differet hyperparameters were trained. The following figures show the training process for three of them.
 ![wer](wandb-wer.png)
 ![loss](wandb-loss.png)
 ## Contact us 🤝
 If you have any technical question regarding the model, pretraining, code or publication, please create an issue in the repository. This is the *best* way to reach us.
+## Citation 🖇
+*TO DO!*
+**Fine-tuned with ❤️ without ☕︎**