huseinzol05
commited on
Commit
•
1612feb
1
Parent(s):
c28cf33
Update README.md
Browse files
README.md
CHANGED
@@ -13,7 +13,14 @@ Finetuned https://huggingface.co/mesolitica/nanot5-small-malaysian-cased using 2
|
|
13 |
- This model natively code switching.
|
14 |
- This model should maintain `\n`, `\t`, `\r` as it is.
|
15 |
|
16 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
17 |
|
18 |
## Supported prefix
|
19 |
|
|
|
13 |
- This model natively code switching.
|
14 |
- This model should maintain `\n`, `\t`, `\r` as it is.
|
15 |
|
16 |
+
Wandb at https://wandb.ai/huseinzol05/nanot5-small-malaysian-cased-translation-v4-multipack-post-v3
|
17 |
+
|
18 |
+
## how we trained it?
|
19 |
+
|
20 |
+
We done 2 phases,
|
21 |
+
|
22 |
+
1. First phase, trained on 6B tokens noisy translation dataset.
|
23 |
+
2. Second phase, trained on 1B tokens higher quality translation dataset.
|
24 |
|
25 |
## Supported prefix
|
26 |
|