Deepthink and Reasoning Collection Best for Deepthink and Reasoning β’ 12 items β’ Updated about 15 hours ago β’ 11
Unsloth 4-bit Dynamic Quants Collection Unsloths Dynamic 4bit Quants selectively skips quantizing certain parameters; greatly improving accuracy while only using <10% more VRAM than BnB 4bit β’ 7 items β’ Updated 11 days ago β’ 21
EXAONE 3.5: Series of Large Language Models for Real-world Use Cases Paper β’ 2412.04862 β’ Published 30 days ago β’ 49
view article Article πΊπ¦ββ¬ LLM Comparison/Test: 25 SOTA LLMs (including QwQ) through 59 MMLU-Pro CS benchmark runs By wolfram β’ Dec 4, 2024 β’ 75
Skywork-Reward-Data-Collection Collection Open-source preference datasets used to train the Skywork reward model series β’ 17 items β’ Updated Oct 12, 2024 β’ 12
Skywork-Reward-Model Collection Skywork reward model series β’ 6 items β’ Updated Nov 26, 2024 β’ 5
Skywork-o1-Open Collection Skywork o1 open model collections β’ 3 items β’ Updated Nov 27, 2024 β’ 18
Qwen2.5-Math Collection Math-specific model series based on Qwen2.5 β’ 9 items β’ Updated Nov 28, 2024 β’ 59
Vortex Collection ModelCloud optimized and validated quants that pass/meet strict quality assurance on multiple benchmarks. β’ 10 items β’ Updated 6 days ago β’ 7
O1 Replication Journey -- Part 2: Surpassing O1-preview through Simple Distillation, Big Progress or Bitter Lesson? Paper β’ 2411.16489 β’ Published Nov 25, 2024 β’ 41
Speculative Decoding Draft Models Collection Collection of OpenVINO optimized efficient draft models for speculative decoding β’ 2 items β’ Updated 13 days ago β’ 7
Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models Paper β’ 2411.14432 β’ Published Nov 21, 2024 β’ 22
Insight-V Collection Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models β’ 5 items β’ Updated Nov 22, 2024 β’ 9
Marco-o1: Towards Open Reasoning Models for Open-Ended Solutions Paper β’ 2411.14405 β’ Published Nov 21, 2024 β’ 58
OpenCoder Collection OpenCoder is an open and reproducible code LLM family which matches the performance of top-tier code LLMs. β’ 8 items β’ Updated Nov 23, 2024 β’ 79