Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
BEE-spoke-data
's Collections
smol llama
finetuned smol 220M
Pretrained Encoders
Bee Models 🍯
book genre classifiers
tokenizers
FineWeb Concept Datasets
tokenizers
updated
Aug 7
trained and adapted tokenizers - various
Upvote
-
BEE-spoke-data/claude-tokenizer
Updated
Apr 20
BEE-spoke-data/claude-tokenizer-forT5
Updated
Jul 28
BEE-spoke-data/slimpajama_tok-48128-BPE-forT5
Updated
Aug 7
BEE-spoke-data/BeeTokenizer
Updated
Jul 20
•
1
BEE-spoke-data/MiniTokenizer-20480
Updated
Jul 21
sail/scaling-with-vocab-trained-tokenizers
Updated
Aug 2
•
2
pszemraj/claude-tokenizer-mlm
Updated
Mar 14
Upvote
-
Share collection
View history
Collection guide
Browse collections