meoo225
/

FLANT5_base

@@ -16,9 +16,9 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [google/flan-t5-base](https://huggingface.co/google/flan-t5-base) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.0872
-- Bleu Score: 25.2243
-- Gen Len: 18.8053
 ## Model description
@@ -37,21 +37,22 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 0.0002
 - train_batch_size: 8
 - eval_batch_size: 8
 - seed: 42
 - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
-- num_epochs: 3
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Bleu Score | Gen Len |
 |:-------------:|:-----:|:----:|:---------------:|:----------:|:-------:|
-| 0.2           | 1.0   | 838  | 0.1229          | 24.1188    | 18.8076 |
-| 0.1099        | 2.0   | 1676 | 0.0951          | 24.7563    | 18.8017 |
-| 0.0841        | 3.0   | 2514 | 0.0872          | 25.2243    | 18.8053 |
 ### Framework versions

 This model is a fine-tuned version of [google/flan-t5-base](https://huggingface.co/google/flan-t5-base) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.0941
+- Bleu Score: 24.8094
+- Gen Len: 18.8172
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 0.0001
 - train_batch_size: 8
 - eval_batch_size: 8
 - seed: 42
 - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
+- num_epochs: 4
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Bleu Score | Gen Len |
 |:-------------:|:-----:|:----:|:---------------:|:----------:|:-------:|
+| 0.2195        | 1.0   | 838  | 0.1354          | 23.689     | 18.7945 |
+| 0.1289        | 2.0   | 1676 | 0.1085          | 24.4546    | 18.8053 |
+| 0.104         | 3.0   | 2514 | 0.0969          | 24.8285    | 18.8112 |
+| 0.0906        | 4.0   | 3352 | 0.0941          | 24.8094    | 18.8172 |
 ### Framework versions

logs/events.out.tfevents.1731560328.ea42bc39d353.1329.0 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:c66f20a59f73a8f08ff5b22187694ef12ab6d3d08f4be394e98646c960c83503
-size 7835

 version https://git-lfs.github.com/spec/v1
+oid sha256:ea45aca115275b0e86cf808aecfd5ea22651c9c2161713a23f09d17bf920536f
+size 8776