mlath123
/

flan-t5-base-samsum

Text2Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

flan-t5-base-samsum

This model is a fine-tuned version of google/flan-t5-base on an unknown dataset. It achieves the following results on the evaluation set:

Loss: 1.3707
Rouge1: 47.3426
Rouge2: 23.8703
Rougel: 40.0537
Rougelsum: 43.5879
Gen Len: 17.2063

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

learning_rate: 5e-05
train_batch_size: 8
eval_batch_size: 8
seed: 42
optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
lr_scheduler_type: linear
num_epochs: 5

Training results

Training Loss	Epoch	Step	Validation Loss	Rouge1	Rouge2	Rougel	Rougelsum	Gen Len
1.4525	1.0	1842	1.3837	46.3005	22.8797	39.0659	42.773	17.2149
1.3436	2.0	3684	1.3725	47.0672	23.547	39.8291	43.3576	17.1954
1.2821	3.0	5526	1.3708	47.2477	23.6592	39.7661	43.4389	17.2295
1.2307	4.0	7368	1.3707	47.3426	23.8703	40.0537	43.5879	17.2063
1.1985	5.0	9210	1.3762	47.4705	23.9801	40.0948	43.7244	17.2833

Framework versions

Transformers 4.35.2
Pytorch 2.1.0+cu121
Datasets 2.17.0
Tokenizers 0.15.1

Downloads last month: 91

Safetensors

Model size

248M params

Tensor type

F32

·

Inference Providers NEW

Text2Text Generation

This model is not currently available via any of the supported Inference Providers.

Model tree for mlath123/flan-t5-base-samsum

Base model

google/flan-t5-base

Finetuned

(674)

this model

Evaluation results

Metadata error: specify a dataset to view leaderboard