meoo225 commited on
Commit
16e2468
1 Parent(s): bd9271f

End of training

Browse files
README.md CHANGED
@@ -19,12 +19,12 @@ should probably proofread and complete it, then remove this comment. -->
19
 
20
  This model is a fine-tuned version of [google/mt5-small](https://huggingface.co/google/mt5-small) on the None dataset.
21
  It achieves the following results on the evaluation set:
22
- - Loss: 0.4030
23
- - Bleu Score: 45.1405
24
- - Precision: 13.3811
25
- - Recall: 13.3811
26
- - Gen Len: 16.8196
27
- - Err: 13.3811
28
 
29
  ## Model description
30
 
@@ -49,15 +49,16 @@ The following hyperparameters were used during training:
49
  - seed: 42
50
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
51
  - lr_scheduler_type: linear
52
- - num_epochs: 3
53
 
54
  ### Training results
55
 
56
  | Training Loss | Epoch | Step | Validation Loss | Bleu Score | Precision | Recall | Gen Len | Err |
57
  |:-------------:|:-----:|:----:|:---------------:|:----------:|:---------:|:-------:|:-------:|:-------:|
58
- | 2.8374 | 1.0 | 838 | 0.5595 | 41.3763 | 7.1685 | 7.1685 | 16.7802 | 7.1685 |
59
- | 0.7564 | 2.0 | 1676 | 0.4503 | 44.4878 | 12.0669 | 12.0669 | 16.8124 | 12.0669 |
60
- | 0.6103 | 3.0 | 2514 | 0.4030 | 45.1405 | 13.3811 | 13.3811 | 16.8196 | 13.3811 |
 
61
 
62
 
63
  ### Framework versions
 
19
 
20
  This model is a fine-tuned version of [google/mt5-small](https://huggingface.co/google/mt5-small) on the None dataset.
21
  It achieves the following results on the evaluation set:
22
+ - Loss: 0.3417
23
+ - Bleu Score: 47.0526
24
+ - Precision: 17.2043
25
+ - Recall: 17.2043
26
+ - Gen Len: 16.8315
27
+ - Err: 17.2043
28
 
29
  ## Model description
30
 
 
49
  - seed: 42
50
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
51
  - lr_scheduler_type: linear
52
+ - num_epochs: 4
53
 
54
  ### Training results
55
 
56
  | Training Loss | Epoch | Step | Validation Loss | Bleu Score | Precision | Recall | Gen Len | Err |
57
  |:-------------:|:-----:|:----:|:---------------:|:----------:|:---------:|:-------:|:-------:|:-------:|
58
+ | 2.798 | 1.0 | 838 | 0.5495 | 41.8683 | 7.7658 | 7.7658 | 16.7766 | 7.7658 |
59
+ | 0.7216 | 2.0 | 1676 | 0.4311 | 44.9002 | 13.0227 | 13.0227 | 16.8148 | 13.0227 |
60
+ | 0.5551 | 3.0 | 2514 | 0.3565 | 46.5247 | 16.0096 | 16.0096 | 16.816 | 16.0096 |
61
+ | 0.4951 | 4.0 | 3352 | 0.3417 | 47.0526 | 17.2043 | 17.2043 | 16.8315 | 17.2043 |
62
 
63
 
64
  ### Framework versions
logs/events.out.tfevents.1728140915.56571babd82e.1261.2 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:800f0af24beb9bad7b0a446189f5cccd1b50ebbd19139e6f8c808f60ed24599a
3
- size 7396
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f129f0b20b6e0c5e1aa38cf53db5b7518b7fb1ab507c30112a2e91ee0db00184
3
+ size 8487