eysharaazia
/

cyber_deberta

@@ -20,11 +20,11 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [MoritzLaurer/mDeBERTa-v3-base-xnli-multilingual-nli-2mil7](https://huggingface.co/MoritzLaurer/mDeBERTa-v3-base-xnli-multilingual-nli-2mil7) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.4646
-- Accuracy: 0.8273
-- F1: 0.8125
-- Precision: 0.8068
-- Recall: 0.8207
 ## Model description
@@ -47,8 +47,11 @@ The following hyperparameters were used during training:
 - train_batch_size: 64
 - eval_batch_size: 64
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
-- lr_scheduler_type: linear
 - num_epochs: 5
 - mixed_precision_training: Native AMP
@@ -56,11 +59,11 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss | Accuracy | F1     | Precision | Recall |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|:------:|:---------:|:------:|
-| 0.3747        | 1.0   | 277  | 0.4398          | 0.7981   | 0.7899 | 0.7874    | 0.8177 |
-| 0.2971        | 2.0   | 554  | 0.4022          | 0.8226   | 0.8101 | 0.8031    | 0.8241 |
-| 0.2659        | 3.0   | 831  | 0.4262          | 0.8258   | 0.8135 | 0.8065    | 0.8280 |
-| 0.2387        | 4.0   | 1108 | 0.4502          | 0.8320   | 0.8168 | 0.8118    | 0.8235 |
-| 0.268         | 5.0   | 1385 | 0.4646          | 0.8273   | 0.8125 | 0.8068    | 0.8207 |
 ### Framework versions

 This model is a fine-tuned version of [MoritzLaurer/mDeBERTa-v3-base-xnli-multilingual-nli-2mil7](https://huggingface.co/MoritzLaurer/mDeBERTa-v3-base-xnli-multilingual-nli-2mil7) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.3811
+- Accuracy: 0.8357
+- F1: 0.8180
+- Precision: 0.8167
+- Recall: 0.8193
 ## Model description
 - train_batch_size: 64
 - eval_batch_size: 64
 - seed: 42
+- gradient_accumulation_steps: 2
+- total_train_batch_size: 128
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
+- lr_scheduler_type: cosine
+- lr_scheduler_warmup_steps: 500
 - num_epochs: 5
 - mixed_precision_training: Native AMP
 | Training Loss | Epoch | Step | Validation Loss | Accuracy | F1     | Precision | Recall |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|:------:|:---------:|:------:|
+| 0.6162        | 1.0   | 105  | 0.6158          | 0.6573   | 0.3966 | 0.3292    | 0.4988 |
+| 0.4929        | 2.0   | 210  | 0.4845          | 0.7621   | 0.7338 | 0.7353    | 0.7325 |
+| 0.4092        | 3.0   | 315  | 0.4153          | 0.8044   | 0.7827 | 0.7824    | 0.7830 |
+| 0.3707        | 4.0   | 420  | 0.3846          | 0.8206   | 0.7986 | 0.8015    | 0.7960 |
+| 0.3046        | 5.0   | 525  | 0.3811          | 0.8357   | 0.8180 | 0.8167    | 0.8193 |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:be81b4d62a52e62cdc175eab3bab0db532fecfde7751ca63e5de1810a7e50ba6
 size 1115268200

 version https://git-lfs.github.com/spec/v1
+oid sha256:3e3d78e6a0ec3ff63c47cc5a99cdb2f29ca5b488f39f45476fa9bdfef115046e
 size 1115268200

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:7b06cfc114d7acf6808c55d2d25cac269166e7d5e899592aa2dab29327b053c0
-size 5112

 version https://git-lfs.github.com/spec/v1
+oid sha256:49dcaa7482fed2158c1abd1129fcd16d8f4e37bcefd022a489d704af4cc934b6
+size 5048