Falcon Mamba: The First Competitive Attention-free 7B Language Model Paper ā¢ 2410.05355 ā¢ Published Oct 7 ā¢ 29
Searching for Better ViT Baselines Collection Exploring ViT hparams and model shapes for the GPU poor (between tiny and base). ā¢ 25 items ā¢ Updated Aug 21 ā¢ 13
view article Article StarCoder2-Instruct: Fully Transparent and Permissive Self-Alignment for Code Generation Apr 29 ā¢ 75
OpenELM: An Efficient Language Model Family with Open-source Training and Inference Framework Paper ā¢ 2404.14619 ā¢ Published Apr 22 ā¢ 126
T2I-Adapter-SDXL Collection The smallest and most efficient control models for SDXL! ā¢ 8 items ā¢ Updated Sep 8, 2023 ā¢ 32
Llama 2: Open Foundation and Fine-Tuned Chat Models Paper ā¢ 2307.09288 ā¢ Published Jul 18, 2023 ā¢ 242
DIY AI For Journalists Collection Compiling resources useful for journalists building prototypes with AI ā¢ 8 items ā¢ Updated Sep 18, 2023 ā¢ 11
OpenChat Collection OpenChat: Advancing Open-source Language Models with Mixed-Quality Data ā¢ 7 items ā¢ Updated Jul 31 ā¢ 33
FinGPT: Large Generative Models for a Small Language Paper ā¢ 2311.05640 ā¢ Published Nov 3, 2023 ā¢ 27
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model Paper ā¢ 2211.05100 ā¢ Published Nov 9, 2022 ā¢ 28