Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
HuggingFaceTB
's Collections
SmolLM2
π» Local SmolLMs
πͺ SmolLM
Instruct datasets
π Cosmopedia
Find textbooks in FineWeb with a classifier
FineWeb clustering & synthetic generations
Other: Stanford, OpenStax, khanAcademy, wikihow...
FW generation prompts
Wikipedia Science topics
Wikipedia textbooks
SFT Experiments
Decay mixture experiments
models
π Cosmopedia
updated
Aug 18
Resources for Cosmopedia dataset
Upvote
8
HuggingFaceTB/cosmopedia
Viewer
β’
Updated
Aug 12
β’
31.1M
β’
5.35k
β’
566
HuggingFaceTB/cosmo-1b
Text Generation
β’
Updated
Jul 8
β’
833
β’
129
Running
5
πΈοΈ
Web clusters
HuggingFaceTB/cosmopedia-100k
Viewer
β’
Updated
Feb 19
β’
100k
β’
428
β’
40
HuggingFaceTB/cosmopedia-meta
Viewer
β’
Updated
Feb 20
β’
31.1M
β’
47
β’
2
HuggingFaceTB/smollm-corpus
Viewer
β’
Updated
Sep 6
β’
237M
β’
12.9k
β’
246
Upvote
8
+4
Share collection
View history
Collection guide
Browse collections