jaswanthrk
commited on
Commit
•
0ddb6ec
1
Parent(s):
a5400e1
Update README.md
Browse files
README.md
CHANGED
@@ -1,3 +1,10 @@
|
|
1 |
---
|
2 |
license: mit
|
3 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
---
|
2 |
license: mit
|
3 |
---
|
4 |
+
|
5 |
+
posterior_KaTeMaTa_llama_llama.model
|
6 |
+
- This is SP format tokenizer obtained by merging Kannada, Telugu, Malayalam, Tamil and Llama-2 tokenizers.
|
7 |
+
|
8 |
+
posterior_dr_llama_15_32k_balanced.model
|
9 |
+
posterior_dr_llama_15_32k_balanced.vocab
|
10 |
+
- These is SP format tokenizer obtained by training the SP tokenizer using the four languages data.
|