tiagoblima
/

t5_base-qg-ap-oficial

Text2Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

tiagoblima commited on Jan 25

Commit

e3d4861

•

1 Parent(s): d162079

Model save

Files changed (2) hide show

README.md +11 -15
model.safetensors +1 -1

README.md CHANGED Viewed

@@ -3,8 +3,6 @@ license: mit
 base_model: unicamp-dl/ptt5-base-portuguese-vocab
 tags:
 - generated_from_trainer
-datasets:
-- tiagoblima/du-qg-squadv1_pt
 model-index:
 - name: t5_base-qg-ap-oficial
   results: []
@@ -15,9 +13,9 @@ should probably proofread and complete it, then remove this comment. -->
 # t5_base-qg-ap-oficial
-This model is a fine-tuned version of [unicamp-dl/ptt5-base-portuguese-vocab](https://huggingface.co/unicamp-dl/ptt5-base-portuguese-vocab) on the tiagoblima/du-qg-squadv1_pt dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.9945
 ## Model description
@@ -37,11 +35,9 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 0.0001
-- train_batch_size: 8
-- eval_batch_size: 4
 - seed: 42
-- gradient_accumulation_steps: 4
-- total_train_batch_size: 32
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-06
 - lr_scheduler_type: linear
 - num_epochs: 5.0
@@ -50,16 +46,16 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step  | Validation Loss |
 |:-------------:|:-----:|:-----:|:---------------:|
-| 2.1593        | 1.0   | 2366  | 2.0509          |
-| 1.9907        | 2.0   | 4733  | 2.0158          |
-| 1.9408        | 3.0   | 7099  | 2.0011          |
-| 1.8321        | 4.0   | 9466  | 1.9945          |
-| 1.7928        | 5.0   | 11830 | 1.9992          |
 ### Framework versions
 - Transformers 4.35.2
-- Pytorch 2.0.0
 - Datasets 2.15.0
-- Tokenizers 0.15.0

 base_model: unicamp-dl/ptt5-base-portuguese-vocab
 tags:
 - generated_from_trainer
 model-index:
 - name: t5_base-qg-ap-oficial
   results: []
 # t5_base-qg-ap-oficial
+This model is a fine-tuned version of [unicamp-dl/ptt5-base-portuguese-vocab](https://huggingface.co/unicamp-dl/ptt5-base-portuguese-vocab) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.7673
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 0.0001
+- train_batch_size: 32
+- eval_batch_size: 64
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-06
 - lr_scheduler_type: linear
 - num_epochs: 5.0
 | Training Loss | Epoch | Step  | Validation Loss |
 |:-------------:|:-----:|:-----:|:---------------:|
+| 1.928         | 1.0   | 2367  | 1.8312          |
+| 1.7467        | 2.0   | 4734  | 1.7875          |
+| 1.6658        | 3.0   | 7101  | 1.7681          |
+| 1.5892        | 4.0   | 9468  | 1.7632          |
+| 1.5483        | 5.0   | 11835 | 1.7673          |
 ### Framework versions
 - Transformers 4.35.2
+- Pytorch 2.1.0+cu121
 - Datasets 2.15.0
+- Tokenizers 0.15.1

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:bd14265aac0435771eae311887b5e52c81da1f11b0d5510b208a55e950ec469e
 size 891644712

 version https://git-lfs.github.com/spec/v1
+oid sha256:54dc671bbc7016070224e672cbd10a575b40c747646ff60d23615c79a4d1d41f
 size 891644712