zirui3's picture
Upload README.md
84c361f
|
raw
history blame
123 Bytes
# summary
multilingual tokenizer trained on multilingual data by using the SentencePiece library and the BPE algorithm.