all-distilroberta-v1-20240102
This model is a fine-tuned version of sentence-transformers/all-distilroberta-v1 on the None dataset. It achieves the following results on the evaluation set:
- Loss: 0.1413
- F1: 0.8025
- Roc Auc: 0.8744
- Accuracy: 0.7687
Model description
More information needed
Intended uses & limitations
More information needed
Training and evaluation data
More information needed
Training procedure
Training hyperparameters
The following hyperparameters were used during training:
- learning_rate: 2e-05
- train_batch_size: 8
- eval_batch_size: 8
- seed: 42
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
- lr_scheduler_type: linear
- num_epochs: 11
Training results
Training Loss | Epoch | Step | Validation Loss | F1 | Roc Auc | Accuracy |
---|---|---|---|---|---|---|
No log | 1.0 | 109 | 0.2310 | 0.5875 | 0.7128 | 0.4305 |
No log | 2.0 | 218 | 0.1775 | 0.7192 | 0.7950 | 0.5991 |
No log | 3.0 | 327 | 0.1571 | 0.7654 | 0.8255 | 0.6604 |
No log | 4.0 | 436 | 0.1499 | 0.7854 | 0.8516 | 0.7192 |
0.2013 | 5.0 | 545 | 0.1410 | 0.8000 | 0.8705 | 0.7593 |
0.2013 | 6.0 | 654 | 0.1413 | 0.8025 | 0.8744 | 0.7687 |
0.2013 | 7.0 | 763 | 0.1408 | 0.8018 | 0.8762 | 0.7731 |
0.2013 | 8.0 | 872 | 0.1454 | 0.7973 | 0.8753 | 0.7736 |
0.2013 | 9.0 | 981 | 0.1467 | 0.7966 | 0.8736 | 0.7692 |
0.0634 | 10.0 | 1090 | 0.1481 | 0.7928 | 0.8721 | 0.7672 |
0.0634 | 11.0 | 1199 | 0.1471 | 0.7945 | 0.8730 | 0.7692 |
Framework versions
- Transformers 4.35.2
- Pytorch 2.1.0+cu121
- Datasets 2.16.1
- Tokenizers 0.15.0
- Downloads last month
- 94
Inference Providers
NEW
This model is not currently available via any of the supported Inference Providers.
Model tree for Kevinger/all-distilroberta-v1-20240102
Base model
sentence-transformers/all-distilroberta-v1