Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
PleIAs
's Collections
Common Artifacts
Common Models
Common Corpus
Toxic Commons
Finance Commons
Bad Data Toolbox
OpenCulture
Common Corpus
updated
Nov 13
Largest multilingual pretraining data.
Upvote
8
PleIAs/common_corpus
Viewer
•
Updated
26 days ago
•
397M
•
52.9k
•
190
Upvote
8
+4
Share collection
View history
Collection guide
Browse collections