callmesan
/

twitter-roberta-large-hate-latest-roman-urdu-binary

@@ -21,11 +21,11 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [cardiffnlp/twitter-roberta-large-hate-latest](https://huggingface.co/cardiffnlp/twitter-roberta-large-hate-latest) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.3062
-- Accuracy: 0.9021
-- Precision: 0.9014
-- Recall: 0.9028
-- F1: 0.9019
 ## Model description
@@ -44,7 +44,7 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 2e-05
 - train_batch_size: 32
 - eval_batch_size: 128
 - seed: 42
@@ -52,17 +52,22 @@ The following hyperparameters were used during training:
 - total_train_batch_size: 128
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 5
 ### Training results
 | Training Loss | Epoch  | Step | Validation Loss | Accuracy | Precision | Recall | F1     |
 |:-------------:|:------:|:----:|:---------------:|:--------:|:---------:|:------:|:------:|
-| 0.4413        | 0.9912 | 56   | 0.3491          | 0.8489   | 0.8505    | 0.8461 | 0.8474 |
-| 0.2844        | 2.0    | 113  | 0.3017          | 0.8677   | 0.8697    | 0.8648 | 0.8663 |
-| 0.2437        | 2.9912 | 169  | 0.3157          | 0.8739   | 0.8731    | 0.8739 | 0.8735 |
-| 0.1735        | 4.0    | 226  | 0.3252          | 0.8752   | 0.8759    | 0.8732 | 0.8742 |
-| 0.1438        | 4.9558 | 280  | 0.3455          | 0.8777   | 0.8769    | 0.8778 | 0.8772 |
 ### Framework versions

 This model is a fine-tuned version of [cardiffnlp/twitter-roberta-large-hate-latest](https://huggingface.co/cardiffnlp/twitter-roberta-large-hate-latest) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.4537
+- Accuracy: 0.9016
+- Precision: 0.9015
+- Recall: 0.9007
+- F1: 0.9011
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 5e-05
 - train_batch_size: 32
 - eval_batch_size: 128
 - seed: 42
 - total_train_batch_size: 128
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 10
 ### Training results
 | Training Loss | Epoch  | Step | Validation Loss | Accuracy | Precision | Recall | F1     |
 |:-------------:|:------:|:----:|:---------------:|:--------:|:---------:|:------:|:------:|
+| 0.4956        | 0.9912 | 56   | 0.4878          | 0.8215   | 0.8208    | 0.8202 | 0.8205 |
+| 0.3268        | 2.0    | 113  | 0.3194          | 0.8652   | 0.8699    | 0.8611 | 0.8632 |
+| 0.2555        | 2.9912 | 169  | 0.3241          | 0.8839   | 0.8894    | 0.8798 | 0.8822 |
+| 0.1832        | 4.0    | 226  | 0.3184          | 0.8914   | 0.8930    | 0.8891 | 0.8904 |
+| 0.1283        | 4.9912 | 282  | 0.3375          | 0.8976   | 0.9028    | 0.8939 | 0.8962 |
+| 0.0815        | 6.0    | 339  | 0.3645          | 0.8939   | 0.8937    | 0.8955 | 0.8937 |
+| 0.0961        | 6.9912 | 395  | 0.3691          | 0.9001   | 0.9004    | 0.8988 | 0.8995 |
+| 0.0544        | 8.0    | 452  | 0.4781          | 0.8901   | 0.8895    | 0.8910 | 0.8899 |
+| 0.0404        | 8.9912 | 508  | 0.4209          | 0.9089   | 0.9104    | 0.9068 | 0.9081 |
+| 0.0359        | 9.9115 | 560  | 0.4259          | 0.9051   | 0.9051    | 0.9042 | 0.9046 |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:a5ac7e8a0415cc18edcb4653abe427fcedd56aab6cf6e85cf64b7127ed76babc
 size 1421495416

 version https://git-lfs.github.com/spec/v1
+oid sha256:55dd784a0cb2ba00bb8db7d8c9cf1c43238555ae064b253604e8182717e8cfe1
 size 1421495416