huseinzol05
commited on
Commit
•
9886323
1
Parent(s):
1612feb
Update README.md
Browse files
README.md
CHANGED
@@ -17,10 +17,11 @@ Wandb at https://wandb.ai/huseinzol05/nanot5-small-malaysian-cased-translation-v
|
|
17 |
|
18 |
## how we trained it?
|
19 |
|
20 |
-
We done
|
21 |
|
22 |
-
1. First phase, trained on 6B tokens noisy translation dataset.
|
23 |
-
|
|
|
24 |
|
25 |
## Supported prefix
|
26 |
|
|
|
17 |
|
18 |
## how we trained it?
|
19 |
|
20 |
+
We done 3 phases,
|
21 |
|
22 |
+
1. First phase, trained on 5% of the 6B tokens noisy translation dataset that include all prefixes on padding based training to improve attention bias.
|
23 |
+
1. Second phase, trained on 6B tokens noisy translation dataset on packing based and this required to freeze attention bias to speed up the training.
|
24 |
+
2. Third phase, trained on 1B tokens higher quality translation dataset on packing based and this required to freeze attention bias to speed up the training.
|
25 |
|
26 |
## Supported prefix
|
27 |
|