Ashegh-Sad-Warrior
/

my_awesome_opus_books_model

@@ -3,6 +3,8 @@ license: apache-2.0
 base_model: google-t5/t5-small
 tags:
 - generated_from_trainer
 model-index:
 - name: my_awesome_opus_books_model
   results: []
@@ -17,9 +19,14 @@ should probably proofread and complete it, then remove this comment. -->
 [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>]()
 [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>]()
 [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>]()
 # my_awesome_opus_books_model
 This model is a fine-tuned version of [google-t5/t5-small](https://huggingface.co/google-t5/t5-small) on an unknown dataset.
 ## Model description
@@ -47,6 +54,14 @@ The following hyperparameters were used during training:
 - num_epochs: 2
 - mixed_precision_training: Native AMP
 ### Framework versions
 - Transformers 4.42.3

 base_model: google-t5/t5-small
 tags:
 - generated_from_trainer
+metrics:
+- bleu
 model-index:
 - name: my_awesome_opus_books_model
   results: []
 [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>]()
 [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>]()
 [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>]()
+[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>]()
 # my_awesome_opus_books_model
 This model is a fine-tuned version of [google-t5/t5-small](https://huggingface.co/google-t5/t5-small) on an unknown dataset.
+It achieves the following results on the evaluation set:
+- Loss: 2.2630
+- Bleu: 11.5935
+- Gen Len: 11.9413
 ## Model description
 - num_epochs: 2
 - mixed_precision_training: Native AMP
+### Training results
+| Training Loss | Epoch | Step | Validation Loss | Bleu    | Gen Len |
+|:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|
+| 2.578         | 1.0   | 3178 | 2.3047          | 11.3146 | 11.7909 |
+| 2.484         | 2.0   | 6356 | 2.2630          | 11.5935 | 11.9413 |
 ### Framework versions
 - Transformers 4.42.3

generation_config.json CHANGED Viewed

@@ -1,5 +1,4 @@
 {
-  "_from_model_config": true,
   "decoder_start_token_id": 0,
   "eos_token_id": 1,
   "pad_token_id": 0,

 {
   "decoder_start_token_id": 0,
   "eos_token_id": 1,
   "pad_token_id": 0,

runs/Aug04_11-09-44_e8185cfad283/events.out.tfevents.1722769809.e8185cfad283.34.12 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:8ba6ff423bc1202da369eab386a655da797cd6b2438d2bc43d47894431325db4
-size 8771

 version https://git-lfs.github.com/spec/v1
+oid sha256:5dea29f4f6f3aa3a2cb274e0510ba4030e9c6ce7890b2d92d2f127bcfe1348f6
+size 9495