SafeChain: Safety of Language Models with Long Chain-of-Thought Reasoning Capabilities Paper β’ 2502.12025 β’ Published 20 days ago
KodCode: A Diverse, Challenging, and Verifiable Synthetic Dataset for Coding Paper β’ 2503.02951 β’ Published 5 days ago β’ 25
Small Models Struggle to Learn from Strong Reasoners Paper β’ 2502.12143 β’ Published 20 days ago β’ 28
Small Models Struggle to Learn from Strong Reasoners Paper β’ 2502.12143 β’ Published 20 days ago β’ 28
Small Models Struggle to Learn from Strong Reasoners Paper β’ 2502.12143 β’ Published 20 days ago β’ 28
ZebraLogic: On the Scaling Limits of LLMs for Logical Reasoning Paper β’ 2502.01100 β’ Published Feb 3 β’ 17
Magpie-Align/Magpie-Reasoning-V1-150K-CoT-Skywork-O1-Llama-3.1-8B Viewer β’ Updated Jan 27 β’ 150k β’ 254 β’ 1
Magpie-Align/Magpie-Reasoning-V2-250K-CoT-Skywork-O1-Llama-3.1-8B Viewer β’ Updated Jan 27 β’ 250k β’ 352 β’ 5
Magpie-Align/Magpie-Reasoning-V2-250K-CoT-Deepseek-R1-Llama-70B Viewer β’ Updated Jan 27 β’ 250k β’ 5.63k β’ 91
Magpie-Align/Magpie-Reasoning-V1-150K-CoT-Deepseek-R1-Llama-70B Viewer β’ Updated Jan 27 β’ 150k β’ 736 β’ 16
Magpie Reasoning Datasets Collection Reasoning datasets built by Magpie and its friends! β’ 8 items β’ Updated Jan 27 β’ 10
Magpie-Align/Magpie-Reasoning-V1-150K-CoT-Deepseek-R1-Llama-70B Viewer β’ Updated Jan 27 β’ 150k β’ 736 β’ 16
Magpie Reasoning Datasets Collection Reasoning datasets built by Magpie and its friends! β’ 8 items β’ Updated Jan 27 β’ 10
Magpie-Align/Magpie-Reasoning-V2-250K-CoT-Deepseek-R1-Llama-70B Viewer β’ Updated Jan 27 β’ 250k β’ 5.63k β’ 91