C4AI Aya 23 Collection Aya 23 is an open weights research release of an instruction fine-tuned model with highly advanced multilingual capabilities. β’ 4 items β’ Updated 20 days ago β’ 50
Yi 1.5 GGUFs Collection Collection of Yi 1.5 GGUFs made with gguf-my-repo β’ 15 items β’ Updated May 20 β’ 5
Phi-3 Collection Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. β’ 26 items β’ Updated Nov 14 β’ 534
MegaScale: Scaling Large Language Model Training to More Than 10,000 GPUs Paper β’ 2402.15627 β’ Published Feb 23 β’ 34
C4AI Command R Collection C4AI Command-R is a research release of a 35 billion parameter highly performant generative model. Command-R is a large language model with open weigh β’ 4 items β’ Updated 20 days ago β’ 19
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits Paper β’ 2402.17764 β’ Published Feb 27 β’ 603
Frankenmodels Collection They're not supposed to be that size! Neat, right? β’ 8 items β’ Updated Dec 12, 2023 β’ 3