flan-t5-small-qa / README.md
badokorach's picture
Model save
994ac6e
|
raw
history blame
2.29 kB
metadata
license: apache-2.0
base_model: badokorach/flan-t5-small-qa
tags:
  - generated_from_trainer
model-index:
  - name: flan-t5-small-qa
    results: []

flan-t5-small-qa

This model is a fine-tuned version of badokorach/flan-t5-small-qa on an unknown dataset. It achieves the following results on the evaluation set:

  • Loss: 0.0862

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 2e-05
  • train_batch_size: 4
  • eval_batch_size: 4
  • seed: 42
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • num_epochs: 20

Training results

Training Loss Epoch Step Validation Loss
0.0486 1.0 609 0.0792
0.0487 2.0 1218 0.0810
0.0461 3.0 1827 0.0837
0.0454 4.0 2436 0.0850
0.045 5.0 3045 0.0834
0.0442 6.0 3654 0.0852
0.044 7.0 4263 0.0836
0.0435 8.0 4872 0.0836
0.0442 9.0 5481 0.0840
0.0437 10.0 6090 0.0858
0.0423 11.0 6699 0.0850
0.0435 12.0 7308 0.0857
0.0433 13.0 7917 0.0855
0.0423 14.0 8526 0.0856
0.0425 15.0 9135 0.0859
0.0423 16.0 9744 0.0860
0.0412 17.0 10353 0.0861
0.0426 18.0 10962 0.0861
0.0419 19.0 11571 0.0861
0.0414 20.0 12180 0.0862

Framework versions

  • Transformers 4.33.2
  • Pytorch 2.0.1+cu118
  • Tokenizers 0.13.3