Collection of some GPT-4 generated datasets. It may be useful for those looking for the best-quality datasets to train competitive LLMs.
Leon Lee
Leon-Leee
AI & ML interests
LLMs, code generation, chatbot, workflows
Recent Activity
liked
a dataset
about 23 hours ago
bigcode/the-stack-v2-train-smol-ids
liked
a dataset
4 days ago
bigcode/the-stack-v2-dedup
upvoted
a
collection
4 days ago
Tulu 3 Datasets
Organizations
Collections
5
models
None public yet