huseinzol05 commited on
Commit
9886323
1 Parent(s): 1612feb

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +4 -3
README.md CHANGED
@@ -17,10 +17,11 @@ Wandb at https://wandb.ai/huseinzol05/nanot5-small-malaysian-cased-translation-v
17
 
18
  ## how we trained it?
19
 
20
- We done 2 phases,
21
 
22
- 1. First phase, trained on 6B tokens noisy translation dataset.
23
- 2. Second phase, trained on 1B tokens higher quality translation dataset.
 
24
 
25
  ## Supported prefix
26
 
 
17
 
18
  ## how we trained it?
19
 
20
+ We done 3 phases,
21
 
22
+ 1. First phase, trained on 5% of the 6B tokens noisy translation dataset that include all prefixes on padding based training to improve attention bias.
23
+ 1. Second phase, trained on 6B tokens noisy translation dataset on packing based and this required to freeze attention bias to speed up the training.
24
+ 2. Third phase, trained on 1B tokens higher quality translation dataset on packing based and this required to freeze attention bias to speed up the training.
25
 
26
  ## Supported prefix
27