Granite 3.1 Language Models Collection A series of language models with 128K context length trained by IBM licensed under Apache 2.0 license. β’ 8 items β’ Updated 23 days ago β’ 47
view article Article πΊπ¦ββ¬ LLM Comparison/Test: 25 SOTA LLMs (including QwQ) through 59 MMLU-Pro CS benchmark runs By wolfram β’ Dec 4, 2024 β’ 75
BloombergGPT: A Large Language Model for Finance Paper β’ 2303.17564 β’ Published Mar 30, 2023 β’ 21
π Ichigo v0.4 Collection The experimental family designed to train LLMs to understand sound natively. β’ 2 items β’ Updated Nov 11, 2024 β’ 7
OpenCoder Collection OpenCoder is an open and reproducible code LLM family which matches the performance of top-tier code LLMs. β’ 8 items β’ Updated Nov 23, 2024 β’ 79
Llama 3.2 3B & 1B GGUF Quants Collection Llama.cpp compatible quants for Llama 3.2 3B and 1B Instruct models. β’ 4 items β’ Updated Sep 26, 2024 β’ 46
Moshi v0.1 Release Collection MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi β’ 13 items β’ Updated Sep 18, 2024 β’ 225
Jamba-1.5 Collection The AI21 Jamba family of models are state-of-the-art, hybrid SSM-Transformer instruction following foundation models β’ 2 items β’ Updated Aug 22, 2024 β’ 84
Transformer Explainer: Interactive Learning of Text-Generative Models Paper β’ 2408.04619 β’ Published Aug 8, 2024 β’ 156
OpenDevin: An Open Platform for AI Software Developers as Generalist Agents Paper β’ 2407.16741 β’ Published Jul 23, 2024 β’ 69
Agentless: Demystifying LLM-based Software Engineering Agents Paper β’ 2407.01489 β’ Published Jul 1, 2024 β’ 42
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence Paper β’ 2406.11931 β’ Published Jun 17, 2024 β’ 58
Chameleon: Mixed-Modal Early-Fusion Foundation Models Paper β’ 2405.09818 β’ Published May 16, 2024 β’ 127
πGGUF Collection Llama.cpp compatible models, can be used on CPUs and GPUs! β’ 1002 items β’ Updated 1 day ago β’ 35