Ransaka commited on
Commit
f69a1ef
1 Parent(s): a0b0b0d

End of training

Browse files
README.md CHANGED
@@ -12,11 +12,6 @@ should probably proofread and complete it, then remove this comment. -->
12
  # sinhala-roman-transformer
13
 
14
  This model is a fine-tuned version of [](https://huggingface.co/) on an unknown dataset.
15
- It achieves the following results on the evaluation set:
16
- - Loss: 8.1028
17
- - Rouge2 Precision: 0.0
18
- - Rouge2 Recall: 0.0
19
- - Rouge2 Fmeasure: 0.0
20
 
21
  ## Model description
22
 
@@ -41,14 +36,12 @@ The following hyperparameters were used during training:
41
  - seed: 42
42
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
43
  - lr_scheduler_type: linear
 
44
  - num_epochs: 3.0
45
  - mixed_precision_training: Native AMP
46
 
47
  ### Training results
48
 
49
- | Training Loss | Epoch | Step | Validation Loss | Rouge2 Precision | Rouge2 Recall | Rouge2 Fmeasure |
50
- |:-------------:|:-----:|:----:|:---------------:|:----------------:|:-------------:|:---------------:|
51
- | 8.182 | 2.0 | 4 | 8.1028 | 0.0 | 0.0 | 0.0 |
52
 
53
 
54
  ### Framework versions
 
12
  # sinhala-roman-transformer
13
 
14
  This model is a fine-tuned version of [](https://huggingface.co/) on an unknown dataset.
 
 
 
 
 
15
 
16
  ## Model description
17
 
 
36
  - seed: 42
37
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
38
  - lr_scheduler_type: linear
39
+ - lr_scheduler_warmup_steps: 2000
40
  - num_epochs: 3.0
41
  - mixed_precision_training: Native AMP
42
 
43
  ### Training results
44
 
 
 
 
45
 
46
 
47
  ### Framework versions
config.json CHANGED
@@ -163,6 +163,8 @@
163
  "eos_token_id": 3,
164
  "is_encoder_decoder": true,
165
  "length_penalty": 2.0,
 
 
166
  "model_type": "encoder-decoder",
167
  "no_repeat_ngram_size": 3,
168
  "num_beams": 4,
 
163
  "eos_token_id": 3,
164
  "is_encoder_decoder": true,
165
  "length_penalty": 2.0,
166
+ "max_length": 142,
167
+ "min_length": 56,
168
  "model_type": "encoder-decoder",
169
  "no_repeat_ngram_size": 3,
170
  "num_beams": 4,
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:a52ee247a0c8efe4a594efd0d9762f3ecc619c7fcb788bde9ee7d2df546262ae
3
  size 132506536
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9f51540746a6f3d27d5c07b72a8b6fa4b48f9fbffea353a035cd3f9b6f99fc15
3
  size 132506536
runs/Jun23_10-43-28_970b0707b08f/events.out.tfevents.1719139412.970b0707b08f.317.5 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:46ad872ed9e85ff61aec67f1204fba0f1511cf07326bdd8fd927da47fe268db9
3
- size 17431
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b84d194ae1f9d608f42150f82c5cbbb14c8c796365bc9b58e8f822ae1b459923
3
+ size 18418
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:95c1b0823b88f7f1d84ef295d5dde620f76f61b4847b26c5592f1eb5be7acb4d
3
- size 5240
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3169c99acca6520dde776a047dc39e352627a074cef0e630de1c341aaacd1e93
3
+ size 5304