🧠Reasoning datasets Collection Datasets with reasoning traces for math and code released by the community • 11 items • Updated 7 days ago • 68
OpenRLHF/preference_dataset_mixture2_and_safe_pku Viewer • Updated Jun 14, 2024 • 555k • 866 • 4
Cornell-AGI/REFUEL-Ultrainteract-Llama-3-Armo-iter_1 Viewer • Updated Oct 8, 2024 • 64.6k • 75 • 2
deepseek-ai/DeepSeek-R1-Distill-Qwen-32B Text Generation • Updated 9 days ago • 826k • • 1.1k
MiLoRA: Efficient Mixture of Low-Rank Adaptation for Large Language Models Fine-tuning Paper • 2410.18035 • Published Oct 23, 2024 • 1
Gemma-2-9B-it-Advanced Collection Merges of the advanced research fine tunes of gemma-2 9B it • 3 items • Updated Oct 20, 2024 • 3