IAmSkyDra commited on
Commit
5e480a1
·
verified ·
1 Parent(s): adb50f5

End of training

Browse files
Files changed (1) hide show
  1. README.md +11 -13
README.md CHANGED
@@ -1,11 +1,11 @@
1
  ---
2
- base_model: IAmSkyDra/BARTBana_v4
3
  library_name: transformers
4
  license: mit
5
- metrics:
6
- - sacrebleu
7
  tags:
8
  - generated_from_trainer
 
 
9
  model-index:
10
  - name: BARTBana_Translation_ReplaceSymWord_v0
11
  results: []
@@ -18,8 +18,8 @@ should probably proofread and complete it, then remove this comment. -->
18
 
19
  This model is a fine-tuned version of [IAmSkyDra/BARTBana_v4](https://huggingface.co/IAmSkyDra/BARTBana_v4) on the None dataset.
20
  It achieves the following results on the evaluation set:
21
- - Loss: 0.0261
22
- - Sacrebleu: 20.4115
23
 
24
  ## Model description
25
 
@@ -44,23 +44,21 @@ The following hyperparameters were used during training:
44
  - seed: 42
45
  - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
46
  - lr_scheduler_type: linear
47
- - num_epochs: 5
48
  - mixed_precision_training: Native AMP
49
 
50
  ### Training results
51
 
52
  | Training Loss | Epoch | Step | Validation Loss | Sacrebleu |
53
  |:-------------:|:-----:|:-----:|:---------------:|:---------:|
54
- | 0.2981 | 1.0 | 7111 | 0.2214 | 9.2406 |
55
- | 0.1405 | 2.0 | 14222 | 0.0835 | 15.4247 |
56
- | 0.0822 | 3.0 | 21333 | 0.0436 | 18.4992 |
57
- | 0.0591 | 4.0 | 28444 | 0.0299 | 19.9525 |
58
- | 0.0498 | 5.0 | 35555 | 0.0261 | 20.4115 |
59
 
60
 
61
  ### Framework versions
62
 
63
- - Transformers 4.48.1
64
- - Pytorch 2.5.1+cu124
65
  - Datasets 3.2.0
66
  - Tokenizers 0.21.0
 
1
  ---
 
2
  library_name: transformers
3
  license: mit
4
+ base_model: IAmSkyDra/BARTBana_v4
 
5
  tags:
6
  - generated_from_trainer
7
+ metrics:
8
+ - sacrebleu
9
  model-index:
10
  - name: BARTBana_Translation_ReplaceSymWord_v0
11
  results: []
 
18
 
19
  This model is a fine-tuned version of [IAmSkyDra/BARTBana_v4](https://huggingface.co/IAmSkyDra/BARTBana_v4) on the None dataset.
20
  It achieves the following results on the evaluation set:
21
+ - Loss: 0.0861
22
+ - Sacrebleu: 15.3102
23
 
24
  ## Model description
25
 
 
44
  - seed: 42
45
  - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
46
  - lr_scheduler_type: linear
47
+ - num_epochs: 3
48
  - mixed_precision_training: Native AMP
49
 
50
  ### Training results
51
 
52
  | Training Loss | Epoch | Step | Validation Loss | Sacrebleu |
53
  |:-------------:|:-----:|:-----:|:---------------:|:---------:|
54
+ | 0.3205 | 1.0 | 7111 | 0.2434 | 8.5066 |
55
+ | 0.1779 | 2.0 | 14222 | 0.1138 | 13.6308 |
56
+ | 0.1384 | 3.0 | 21333 | 0.0861 | 15.3102 |
 
 
57
 
58
 
59
  ### Framework versions
60
 
61
+ - Transformers 4.48.2
62
+ - Pytorch 2.6.0+cu124
63
  - Datasets 3.2.0
64
  - Tokenizers 0.21.0