hugo-albert commited on
Commit
e580162
1 Parent(s): 45536b4

Training complete

Browse files
Files changed (2) hide show
  1. README.md +16 -11
  2. adapter_model.bin +1 -1
README.md CHANGED
@@ -17,9 +17,9 @@ should probably proofread and complete it, then remove this comment. -->
17
 
18
  This model is a fine-tuned version of [facebook/nllb-200-distilled-600M](https://huggingface.co/facebook/nllb-200-distilled-600M) on the None dataset.
19
  It achieves the following results on the evaluation set:
20
- - Loss: 1.2267
21
- - Bleu: 55.4437
22
- - Gen Len: 74.5273
23
 
24
  ## Model description
25
 
@@ -44,17 +44,22 @@ The following hyperparameters were used during training:
44
  - seed: 42
45
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
46
  - lr_scheduler_type: linear
47
- - num_epochs: 5
48
 
49
  ### Training results
50
 
51
- | Training Loss | Epoch | Step | Validation Loss | Bleu | Gen Len |
52
- |:-------------:|:-----:|:----:|:---------------:|:-------:|:--------:|
53
- | No log | 1.0 | 67 | 2.7602 | 30.0843 | 94.7818 |
54
- | No log | 2.0 | 134 | 1.8176 | 11.874 | 139.8182 |
55
- | No log | 3.0 | 201 | 1.4276 | 43.5045 | 85.8182 |
56
- | No log | 4.0 | 268 | 1.2682 | 54.7173 | 72.7273 |
57
- | No log | 5.0 | 335 | 1.2267 | 55.4437 | 74.5273 |
 
 
 
 
 
58
 
59
 
60
  ### Framework versions
 
17
 
18
  This model is a fine-tuned version of [facebook/nllb-200-distilled-600M](https://huggingface.co/facebook/nllb-200-distilled-600M) on the None dataset.
19
  It achieves the following results on the evaluation set:
20
+ - Loss: 0.7738
21
+ - Bleu: 67.4647
22
+ - Gen Len: 75.9455
23
 
24
  ## Model description
25
 
 
44
  - seed: 42
45
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
46
  - lr_scheduler_type: linear
47
+ - num_epochs: 10
48
 
49
  ### Training results
50
 
51
+ | Training Loss | Epoch | Step | Validation Loss | Bleu | Gen Len |
52
+ |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|
53
+ | No log | 1.0 | 67 | 2.6896 | 29.0389 | 96.5455 |
54
+ | No log | 2.0 | 134 | 1.6534 | 30.4693 | 96.6727 |
55
+ | No log | 3.0 | 201 | 1.2046 | 55.0467 | 76.7455 |
56
+ | No log | 4.0 | 268 | 1.0048 | 59.5519 | 76.9091 |
57
+ | No log | 5.0 | 335 | 0.9176 | 64.2229 | 75.5455 |
58
+ | No log | 6.0 | 402 | 0.8610 | 65.8311 | 73.6909 |
59
+ | No log | 7.0 | 469 | 0.8160 | 65.5771 | 76.4727 |
60
+ | 1.5731 | 8.0 | 536 | 0.7968 | 67.9558 | 74.7636 |
61
+ | 1.5731 | 9.0 | 603 | 0.7794 | 67.5994 | 75.8 |
62
+ | 1.5731 | 10.0 | 670 | 0.7738 | 67.4647 | 75.9455 |
63
 
64
 
65
  ### Framework versions
adapter_model.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:6949a136301d4fc6684121edf89734ace70b50322747c2464971bfe39c804f6b
3
  size 9490378
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d95e076dd55aa2db5b8d4f5e24176646443ea33ff2c8cbff3ca1d8b09db968d2
3
  size 9490378