Arnold's picture
add tokenizer
6538682
raw
history blame
328 Bytes
{"p": 0, "ƙ": 1, "b": 2, "y": 3, "k": 4, "x": 5, "c": 6, "n": 7, "j": 8, "d": 9, "w": 10, "'": 11, "i": 12, "u": 13, "r": 14, "v": 15, "m": 16, "ƴ": 17, "l": 18, "ɗ": 19, "s": 20, "o": 21, "ʻ": 22, "a": 23, "í": 24, "f": 25, "ɓ": 26, "h": 27, "z": 28, "q": 29, "t": 30, "g": 31, "e": 33, "|": 32, "[UNK]": 34, "[PAD]": 35}