Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
RickBrannan
's Collections
Hallucinations
Datasets
Machine Translation
Text Classification
Long Context
Low Resource Languages
Multimodal RAG
Datasets
updated
Jan 17
Upvote
-
Towards Best Practices for Open Datasets for LLM Training
Paper
•
2501.08365
•
Published
Jan 14
•
56
Upvote
-
Share collection
View history
Collection guide
Browse collections