RogerB commited on
Commit
1522a78
·
1 Parent(s): ea923a6

End of training

Browse files
Files changed (2) hide show
  1. README.md +6 -4
  2. config.json +6 -6
README.md CHANGED
@@ -17,8 +17,8 @@ should probably proofread and complete it, then remove this comment. -->
17
 
18
  This model is a fine-tuned version of [RogerB/afro-xlmr-large-kinte-domain-kinte-task](https://huggingface.co/RogerB/afro-xlmr-large-kinte-domain-kinte-task) on the None dataset.
19
  It achieves the following results on the evaluation set:
20
- - Loss: 0.7683
21
- - F1: 0.6774
22
 
23
  ## Model description
24
 
@@ -43,13 +43,15 @@ The following hyperparameters were used during training:
43
  - seed: 270136
44
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
45
  - lr_scheduler_type: linear
46
- - num_epochs: 1
47
 
48
  ### Training results
49
 
50
  | Training Loss | Epoch | Step | Validation Loss | F1 |
51
  |:-------------:|:-----:|:----:|:---------------:|:------:|
52
- | 0.8698 | 1.0 | 1013 | 0.6868 | 0.7212 |
 
 
53
 
54
 
55
  ### Framework versions
 
17
 
18
  This model is a fine-tuned version of [RogerB/afro-xlmr-large-kinte-domain-kinte-task](https://huggingface.co/RogerB/afro-xlmr-large-kinte-domain-kinte-task) on the None dataset.
19
  It achieves the following results on the evaluation set:
20
+ - Loss: 0.9205
21
+ - F1: 0.6949
22
 
23
  ## Model description
24
 
 
43
  - seed: 270136
44
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
45
  - lr_scheduler_type: linear
46
+ - num_epochs: 3
47
 
48
  ### Training results
49
 
50
  | Training Loss | Epoch | Step | Validation Loss | F1 |
51
  |:-------------:|:-----:|:----:|:---------------:|:------:|
52
+ | 0.9046 | 1.0 | 1013 | 0.6601 | 0.7333 |
53
+ | 0.7362 | 2.0 | 2026 | 0.5438 | 0.7930 |
54
+ | 0.6502 | 3.0 | 3039 | 0.5326 | 0.8121 |
55
 
56
 
57
  ### Framework versions
config.json CHANGED
@@ -11,16 +11,16 @@
11
  "hidden_dropout_prob": 0.1,
12
  "hidden_size": 1024,
13
  "id2label": {
14
- "0": "LABEL_0",
15
- "1": "LABEL_1",
16
- "2": "LABEL_2"
17
  },
18
  "initializer_range": 0.02,
19
  "intermediate_size": 4096,
20
  "label2id": {
21
- "LABEL_0": 0,
22
- "LABEL_1": 1,
23
- "LABEL_2": 2
24
  },
25
  "layer_norm_eps": 1e-05,
26
  "max_position_embeddings": 514,
 
11
  "hidden_dropout_prob": 0.1,
12
  "hidden_size": 1024,
13
  "id2label": {
14
+ "0": "positive",
15
+ "1": "neutral",
16
+ "2": "negative"
17
  },
18
  "initializer_range": 0.02,
19
  "intermediate_size": 4096,
20
  "label2id": {
21
+ "negative": 2,
22
+ "neutral": 1,
23
+ "positive": 0
24
  },
25
  "layer_norm_eps": 1e-05,
26
  "max_position_embeddings": 514,