🚧"raw" pretrained smol_llama checkpoints - WIP 🚧
![BEEspoke Data's profile picture](https://cdn-avatars.huggingface.co/v1/production/uploads/60bccec062080d33f875cd0c/FDQwEj3IBUG_fa377Ej1U.jpeg)
BEEspoke Data
community
AI & ML interests
'an LLM is only as good as the dataset it was trained on' - Sun Tzu
Organization Card
About org cards
🐝📊💁
Collections
6
smol_llama 220M fine-tunes we did
-
BEE-spoke-data/smol_llama-220M-openhermes
Text Generation • Updated • 3.31k • 3 -
BEE-spoke-data/smol_llama-220M-open_instruct
Text Generation • Updated • 389 • 1 -
BEE-spoke-data/beecoder-220M-python
Text Generation • Updated • 14 • 2 -
BEE-spoke-data/zephyr-220m-sft-full
Text Generation • Updated • 1.56k • 1
spaces
1
models
40
![](https://cdn-avatars.huggingface.co/v1/production/uploads/60bccec062080d33f875cd0c/FDQwEj3IBUG_fa377Ej1U.jpeg)
BEE-spoke-data/smol_llama-220M-GQA
Text Generation
•
Updated
•
6.24k
•
10
![](https://cdn-avatars.huggingface.co/v1/production/uploads/60bccec062080d33f875cd0c/FDQwEj3IBUG_fa377Ej1U.jpeg)
BEE-spoke-data/smol_llama-220M-GQA-fineweb_edu
Text Generation
•
Updated
•
167
•
1
![](https://cdn-avatars.huggingface.co/v1/production/uploads/60bccec062080d33f875cd0c/FDQwEj3IBUG_fa377Ej1U.jpeg)
BEE-spoke-data/Jamba-900M-doc-writer
Text Generation
•
Updated
•
108
•
2
![](https://cdn-avatars.huggingface.co/v1/production/uploads/60bccec062080d33f875cd0c/FDQwEj3IBUG_fa377Ej1U.jpeg)
BEE-spoke-data/mega-ar-350m-L3t-v0.08-ultraTBfw
Text Generation
•
Updated
•
3
•
1
![](https://cdn-avatars.huggingface.co/v1/production/uploads/60bccec062080d33f875cd0c/FDQwEj3IBUG_fa377Ej1U.jpeg)
BEE-spoke-data/Meta-Llama-3-8Bee
Text Generation
•
Updated
•
201
![](https://cdn-avatars.huggingface.co/v1/production/uploads/60bccec062080d33f875cd0c/FDQwEj3IBUG_fa377Ej1U.jpeg)
BEE-spoke-data/claude-tokenizer
Updated
![](https://cdn-avatars.huggingface.co/v1/production/uploads/60bccec062080d33f875cd0c/FDQwEj3IBUG_fa377Ej1U.jpeg)
BEE-spoke-data/TinyLlama-3T-1.1bee
Text Generation
•
Updated
•
135
•
2
![](https://cdn-avatars.huggingface.co/v1/production/uploads/60bccec062080d33f875cd0c/FDQwEj3IBUG_fa377Ej1U.jpeg)
BEE-spoke-data/bert-plus-L8-v1.0-allNLI_matryoshka
Sentence Similarity
•
Updated
•
1
![](https://cdn-avatars.huggingface.co/v1/production/uploads/60bccec062080d33f875cd0c/FDQwEj3IBUG_fa377Ej1U.jpeg)
BEE-spoke-data/bert-plus-L8-v1.0-synthSTSv3-4k
Sentence Similarity
•
Updated
![](https://cdn-avatars.huggingface.co/v1/production/uploads/60bccec062080d33f875cd0c/FDQwEj3IBUG_fa377Ej1U.jpeg)
BEE-spoke-data/mega-encoder-small-16k-v1
Fill-Mask
•
Updated
•
7
•
4
datasets
55
BEE-spoke-data/fineweb-1000_64k
Viewer
•
Updated
•
2k
•
67
BEE-spoke-data/govdocs1-image
Viewer
•
Updated
•
199k
•
25
BEE-spoke-data/sarcasm-scrolls
Viewer
•
Updated
•
8.76k
•
33
•
1
BEE-spoke-data/fineweb-edu-10BT-mincols
Viewer
•
Updated
•
9.67M
•
40
•
1
BEE-spoke-data/the-stack-smol-xl-readable
Viewer
•
Updated
•
424k
•
14
•
1
BEE-spoke-data/SaunaWeb-50k
Viewer
•
Updated
•
50k
•
3
BEE-spoke-data/UltraTextbooks-2.1-fw_mix
Viewer
•
Updated
•
7.27M
•
7
•
2
BEE-spoke-data/fineweb-literature-100k
Viewer
•
Updated
•
100k
•
23
•
1
BEE-spoke-data/fineweb-cryptid-5k
Viewer
•
Updated
•
5k
•
36
BEE-spoke-data/MoistWeb-25k
Viewer
•
Updated
•
25k
•
3