metadata

license: apache-2.0
tags:
  - generated_from_trainer
metrics:
  - rouge
model-index:
  - name: t5-small-github-repo-tag-generation
    results: []

t5-small-github-repo-tag-generation

This model is a fine-tuned version of t5-small on the None dataset. It achieves the following results on the evaluation set:

Loss: 0.6488
Rouge1: 25.2912
Rouge2: 9.5617
Rougel: 22.6455
Rougelsum: 22.617
Gen Len: 19.0

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

learning_rate: 2e-05
train_batch_size: 8
eval_batch_size: 8
seed: 42
optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
lr_scheduler_type: linear
num_epochs: 30
mixed_precision_training: Native AMP

Training results

Training Loss	Epoch	Step	Validation Loss	Rouge1	Rouge2	Rougel	Rougelsum	Gen Len
1.7857	1.0	66	1.2317	2.609	0.0	2.5996	2.6132	19.0
1.2147	2.0	132	1.0105	1.8041	0.0866	1.7432	1.7451	18.9848
1.0625	3.0	198	0.9154	2.5794	0.5266	2.4272	2.4305	19.0
0.9849	4.0	264	0.8615	19.2823	4.6729	17.639	17.6209	19.0
0.9363	5.0	330	0.8248	22.4371	5.4177	20.0806	20.1164	19.0
0.8995	6.0	396	0.7943	24.162	6.1729	21.3733	21.3621	19.0
0.8774	7.0	462	0.7736	24.0765	6.4219	21.2588	21.2715	19.0
0.8544	8.0	528	0.7558	24.4842	6.7685	21.8275	21.8459	19.0
0.8334	9.0	594	0.7416	25.009	7.8025	22.3227	22.3454	19.0
0.8212	10.0	660	0.7300	24.9532	7.9013	22.4275	22.4138	19.0
0.8118	11.0	726	0.7208	25.4191	7.8727	22.696	22.6894	19.0
0.7994	12.0	792	0.7114	25.4852	8.1776	22.4479	22.4522	19.0
0.7904	13.0	858	0.7020	25.4509	8.7603	22.7333	22.7213	19.0
0.7829	14.0	924	0.6958	25.0587	8.9197	22.6393	22.6207	19.0
0.7764	15.0	990	0.6897	25.0867	9.0392	22.6598	22.6808	19.0
0.7703	16.0	1056	0.6841	25.2402	9.3991	22.6384	22.6226	19.0
0.7633	17.0	1122	0.6781	25.7124	9.5485	23.0809	23.0677	19.0
0.7591	18.0	1188	0.6744	25.0679	9.4176	22.5225	22.4913	19.0
0.7553	19.0	1254	0.6695	25.3046	9.2343	22.931	22.8948	19.0
0.7514	20.0	1320	0.6661	25.3134	9.3234	22.8281	22.8198	19.0
0.746	21.0	1386	0.6630	25.3837	9.2876	22.806	22.7907	19.0
0.741	22.0	1452	0.6592	25.4751	9.3792	22.9321	22.9158	19.0
0.7404	23.0	1518	0.6566	25.5734	9.4539	23.0627	23.063	19.0
0.735	24.0	1584	0.6555	25.2529	9.5285	22.6775	22.6504	19.0
0.7334	25.0	1650	0.6536	25.2281	9.4984	22.3494	22.3364	19.0
0.7352	26.0	1716	0.6514	25.3464	9.7302	22.6918	22.6786	19.0
0.7322	27.0	1782	0.6502	25.2349	9.6516	22.6298	22.6005	19.0
0.7333	28.0	1848	0.6492	25.288	9.5646	22.6836	22.6629	19.0
0.7291	29.0	1914	0.6488	25.2912	9.5617	22.6455	22.617	19.0

Framework versions

Transformers 4.26.1
Pytorch 1.13.1+cu116
Datasets 2.10.0
Tokenizers 0.13.2