sohamtiwari3120
/

deberta-v3-base-finetuned-ner

@@ -16,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [microsoft/deberta-v3-base](https://huggingface.co/microsoft/deberta-v3-base) on the generator dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.5136
 - Overall Precision: 0.0
 - Overall Recall: 0.0
 - Overall F1: 0.0
@@ -47,27 +47,22 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 0.001
-- train_batch_size: 8
 - eval_batch_size: 8
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 10
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Overall Precision | Overall Recall | Overall F1 | Overall Accuracy | Datasetname F1 | Hyperparametername F1 | Hyperparametervalue F1 | Methodname F1 | Metricname F1 | Metricvalue F1 | Taskname F1 |
 |:-------------:|:-----:|:----:|:---------------:|:-----------------:|:--------------:|:----------:|:----------------:|:--------------:|:---------------------:|:----------------------:|:-------------:|:-------------:|:--------------:|:-----------:|
-| No log        | 1.0   | 71   | 0.5821          | 0.0               | 0.0            | 0.0        | 0.9116           | 0.0            | 0.0                   | 0.0                    | 0.0           | 0.0           | 0.0            | 0.0         |
-| No log        | 2.0   | 142  | 0.5116          | 0.0               | 0.0            | 0.0        | 0.9116           | 0.0            | 0.0                   | 0.0                    | 0.0           | 0.0           | 0.0            | 0.0         |
-| No log        | 3.0   | 213  | 0.5100          | 0.0               | 0.0            | 0.0        | 0.9116           | 0.0            | 0.0                   | 0.0                    | 0.0           | 0.0           | 0.0            | 0.0         |
-| No log        | 4.0   | 284  | 0.5116          | 0.0               | 0.0            | 0.0        | 0.9116           | 0.0            | 0.0                   | 0.0                    | 0.0           | 0.0           | 0.0            | 0.0         |
-| No log        | 5.0   | 355  | 0.5181          | 0.0               | 0.0            | 0.0        | 0.9116           | 0.0            | 0.0                   | 0.0                    | 0.0           | 0.0           | 0.0            | 0.0         |
-| No log        | 6.0   | 426  | 0.5098          | 0.0               | 0.0            | 0.0        | 0.9116           | 0.0            | 0.0                   | 0.0                    | 0.0           | 0.0           | 0.0            | 0.0         |
-| No log        | 7.0   | 497  | 0.5060          | 0.0               | 0.0            | 0.0        | 0.9116           | 0.0            | 0.0                   | 0.0                    | 0.0           | 0.0           | 0.0            | 0.0         |
-| 0.5155        | 8.0   | 568  | 0.5263          | 0.0               | 0.0            | 0.0        | 0.9116           | 0.0            | 0.0                   | 0.0                    | 0.0           | 0.0           | 0.0            | 0.0         |
-| 0.5155        | 9.0   | 639  | 0.5109          | 0.0               | 0.0            | 0.0        | 0.9116           | 0.0            | 0.0                   | 0.0                    | 0.0           | 0.0           | 0.0            | 0.0         |
-| 0.5155        | 10.0  | 710  | 0.5136          | 0.0               | 0.0            | 0.0        | 0.9116           | 0.0            | 0.0                   | 0.0                    | 0.0           | 0.0           | 0.0            | 0.0         |
 ### Framework versions

 This model is a fine-tuned version of [microsoft/deberta-v3-base](https://huggingface.co/microsoft/deberta-v3-base) on the generator dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.1356
 - Overall Precision: 0.0
 - Overall Recall: 0.0
 - Overall F1: 0.0
 The following hyperparameters were used during training:
 - learning_rate: 0.001
+- train_batch_size: 2
 - eval_batch_size: 8
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 100
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Overall Precision | Overall Recall | Overall F1 | Overall Accuracy | Datasetname F1 | Hyperparametername F1 | Hyperparametervalue F1 | Methodname F1 | Metricname F1 | Metricvalue F1 | Taskname F1 |
 |:-------------:|:-----:|:----:|:---------------:|:-----------------:|:--------------:|:----------:|:----------------:|:--------------:|:---------------------:|:----------------------:|:-------------:|:-------------:|:--------------:|:-----------:|
+| No log        | 1.0   | 282  | 1.2013          | 0.0               | 0.0            | 0.0        | 0.9116           | 0.0            | 0.0                   | 0.0                    | 0.0           | 0.0           | 0.0            | 0.0         |
+| 0.9367        | 2.0   | 564  | 1.0731          | 0.0               | 0.0            | 0.0        | 0.9116           | 0.0            | 0.0                   | 0.0                    | 0.0           | 0.0           | 0.0            | 0.0         |
+| 0.9367        | 3.0   | 846  | 1.0889          | 0.0               | 0.0            | 0.0        | 0.9116           | 0.0            | 0.0                   | 0.0                    | 0.0           | 0.0           | 0.0            | 0.0         |
+| 0.9128        | 4.0   | 1128 | 1.0884          | 0.0               | 0.0            | 0.0        | 0.9116           | 0.0            | 0.0                   | 0.0                    | 0.0           | 0.0           | 0.0            | 0.0         |
+| 0.9128        | 5.0   | 1410 | 1.1356          | 0.0               | 0.0            | 0.0        | 0.9116           | 0.0            | 0.0                   | 0.0                    | 0.0           | 0.0           | 0.0            | 0.0         |
 ### Framework versions