Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Spaces:
yhavinga
/
dutch-tokenizer-arena
like
6
Running
App
Files
Files
Community
1
bce41d0
dutch-tokenizer-arena
2 contributors
History:
58 commits
eson
fix unicode error: 'unicodeescape' codec can't decode bytes in position 602-608: unknown Unicode character name
bce41d0
4 months ago
css
update
10 months ago
images
update
10 months ago
js
fix chatglm; new feature about add_special_tokens;
5 months ago
tokenizer
fix unicode error: 'unicodeescape' codec can't decode bytes in position 602-608: unknown Unicode character name
4 months ago
utils
add more tokenizer
5 months ago
vocab
fix unicode error: 'unicodeescape' codec can't decode bytes in position 602-608: unknown Unicode character name
4 months ago
.gitattributes
1.83 kB
add gemma_7b
4 months ago
.gitignore
171 Bytes
update
10 months ago
README.md
330 Bytes
update
5 months ago
app.py
7.08 kB
update
4 months ago
config.py
45 Bytes
fix chatglm; new feature about add_special_tokens;
5 months ago
evaluation.md
58 Bytes
update
10 months ago
examples.py
2.93 kB
update
4 months ago
requirements.txt
72 Bytes
add olmo tokenizer
4 months ago
util.py
6.22 kB
fix tiktoken
4 months ago