jaswanthrk commited on
Commit
0ddb6ec
1 Parent(s): a5400e1

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +7 -0
README.md CHANGED
@@ -1,3 +1,10 @@
1
  ---
2
  license: mit
3
  ---
 
 
 
 
 
 
 
 
1
  ---
2
  license: mit
3
  ---
4
+
5
+ posterior_KaTeMaTa_llama_llama.model
6
+ - This is SP format tokenizer obtained by merging Kannada, Telugu, Malayalam, Tamil and Llama-2 tokenizers.
7
+
8
+ posterior_dr_llama_15_32k_balanced.model
9
+ posterior_dr_llama_15_32k_balanced.vocab
10
+ - These is SP format tokenizer obtained by training the SP tokenizer using the four languages data.