MiniMax-01: Scaling Foundation Models with Lightning Attention Paper โข 2501.08313 โข Published 18 days ago โข 271
DeepSeek R1 (All Versions) Collection DeepSeek R1 - the most powerful reasoning open-source model - available in GGUF, original & 4-bit formats. Includes Llama & Qwen distilled models. โข 27 items โข Updated about 3 hours ago โข 129
deepseek-ai/DeepSeek-R1-Distill-Qwen-32B Text Generation โข Updated about 23 hours ago โข 238k โข โข 811
๐ FineMath Collection FineMath datasets and ablation models โข 14 items โข Updated 27 days ago โข 19