End of training

Files changed (4) hide show

README.md CHANGED Viewed

@@ -20,9 +20,9 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [google/madlad400-3b-mt](https://huggingface.co/google/madlad400-3b-mt) on the generator dataset.
 It achieves the following results on the evaluation set:
-- Loss: 4.4321
-- Bleu: 3.9461
-- Gen Len: 7.2243
 ## Model description
@@ -41,9 +41,9 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 0.0003
 - train_batch_size: 18
-- eval_batch_size: 8
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
@@ -53,9 +53,9 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step  | Validation Loss | Bleu   | Gen Len |
 |:-------------:|:-----:|:-----:|:---------------:|:------:|:-------:|
-| 2.1442        | 1.0   | 4997  | 4.4262          | 3.0781 | 7.4202  |
-| 2.0066        | 2.0   | 9994  | 4.4181          | 3.8977 | 7.2596  |
-| 1.948         | 3.0   | 14991 | 4.4321          | 3.9461 | 7.2243  |
 ### Framework versions

 This model is a fine-tuned version of [google/madlad400-3b-mt](https://huggingface.co/google/madlad400-3b-mt) on the generator dataset.
 It achieves the following results on the evaluation set:
+- Loss: 4.1804
+- Bleu: 4.0833
+- Gen Len: 7.1314
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 0.00016
 - train_batch_size: 18
+- eval_batch_size: 18
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 | Training Loss | Epoch | Step  | Validation Loss | Bleu   | Gen Len |
 |:-------------:|:-----:|:-----:|:---------------:|:------:|:-------:|
+| 2.1185        | 1.0   | 4957  | 4.1772          | 3.9348 | 7.1858  |
+| 2.0246        | 2.0   | 9914  | 4.1778          | 4.1343 | 7.1091  |
+| 1.9755        | 3.0   | 14871 | 4.1804          | 4.0833 | 7.1314  |
 ### Framework versions

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:85b61cce3c62cc93fd32755c046f0b5bf8607a3bfed3f67187ec10af1a83c9da
 size 37803600

 version https://git-lfs.github.com/spec/v1
+oid sha256:f66d3d5a00637884dc5f28f0b9d33e3e5e5e6cb8283d0eebda00b55c154ad782
 size 37803600

runs/Jun28_04-16-41_bd278a7072d8/events.out.tfevents.1719548908.bd278a7072d8.25.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:b11620ea0908fda6324512a0630b6fe80fd8c4226a9ac35d8d73b035bb3abed2
+size 13536

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:d206e08e489493864d895ce783997572623b08184be85173c05289750cb983c3
 size 5304

 version https://git-lfs.github.com/spec/v1
+oid sha256:3d2922f57801518c2b1e92a3a7edcff96cb329bb4f3c66b1da4fad9e390a0565
 size 5304