metadata

license: mit
base_model: prajjwal1/bert-tiny
tags:
  - generated_from_trainer
metrics:
  - accuracy
model-index:
  - name: tinybert_29_med_intents
    results: []

tinybert_29_med_intents

This model is a fine-tuned version of prajjwal1/bert-tiny on the None dataset. It achieves the following results on the evaluation set:

Loss: 0.4559
Accuracy: 0.9122

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

learning_rate: 2e-05
train_batch_size: 16
eval_batch_size: 16
seed: 42
optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
lr_scheduler_type: linear
num_epochs: 50

Training results

Training Loss	Epoch	Step	Validation Loss	Accuracy
No log	1.0	378	3.0359	0.3448
3.1662	2.0	756	2.7596	0.4953
2.7937	3.0	1134	2.4944	0.5141
2.4474	4.0	1512	2.2497	0.5674
2.4474	5.0	1890	2.0280	0.6207
2.1416	6.0	2268	1.8382	0.6646
1.8743	7.0	2646	1.6716	0.6740
1.6483	8.0	3024	1.5295	0.6959
1.6483	9.0	3402	1.4096	0.7304
1.4578	10.0	3780	1.3064	0.7304
1.3078	11.0	4158	1.2158	0.7524
1.1745	12.0	4536	1.1396	0.7555
1.1745	13.0	4914	1.0636	0.7837
1.0674	14.0	5292	1.0014	0.7931
0.9794	15.0	5670	0.9418	0.8119
0.8783	16.0	6048	0.8938	0.8307
0.8783	17.0	6426	0.8488	0.8401
0.8241	18.0	6804	0.8048	0.8370
0.7575	19.0	7182	0.7750	0.8401
0.7055	20.0	7560	0.7406	0.8433
0.7055	21.0	7938	0.7063	0.8589
0.6492	22.0	8316	0.6821	0.8527
0.6121	23.0	8694	0.6619	0.8589
0.5644	24.0	9072	0.6393	0.8683
0.5644	25.0	9450	0.6200	0.8683
0.5406	26.0	9828	0.5992	0.8746
0.5148	27.0	10206	0.5846	0.8809
0.4723	28.0	10584	0.5659	0.8934
0.4723	29.0	10962	0.5566	0.8934
0.4653	30.0	11340	0.5447	0.8966
0.4386	31.0	11718	0.5358	0.8997
0.4163	32.0	12096	0.5242	0.8997
0.4163	33.0	12474	0.5183	0.9028
0.404	34.0	12852	0.5113	0.9028
0.3849	35.0	13230	0.5005	0.9028
0.3677	36.0	13608	0.4966	0.9060
0.3677	37.0	13986	0.4908	0.9091
0.3652	38.0	14364	0.4843	0.9091
0.3533	39.0	14742	0.4784	0.9060
0.3362	40.0	15120	0.4733	0.9091
0.3362	41.0	15498	0.4703	0.9091
0.3403	42.0	15876	0.4668	0.9091
0.3268	43.0	16254	0.4642	0.9122
0.3229	44.0	16632	0.4642	0.9091
0.3177	45.0	17010	0.4606	0.9154
0.3177	46.0	17388	0.4575	0.9122
0.3137	47.0	17766	0.4574	0.9122
0.3067	48.0	18144	0.4562	0.9122
0.3054	49.0	18522	0.4561	0.9122
0.3054	50.0	18900	0.4559	0.9122

Framework versions

Transformers 4.34.1
Pytorch 2.1.0+cu118
Datasets 2.14.6
Tokenizers 0.14.1