Qwen2 Collection Qwen2 language models, including pretrained and instruction-tuned models of 5 sizes, including 0.5B, 1.5B, 7B, 57B-A14B, and 72B. β’ 29 items β’ Updated 23 days ago β’ 218
K2 Collection K2-65B is a fully reproducible LLM outperforming Llama 2 70B using 35% less compute. β’ 7 items β’ Updated 17 days ago β’ 6
PaliGemma Release Collection Pretrained and mix checkpoints for PaliGemma β’ 16 items β’ Updated 2 days ago β’ 118
OLMo Suite Collection Artifacts for the first set of OLMo models. β’ 14 items β’ Updated 4 days ago β’ 37
Mantis Collection Mantis model family optimized for multi-image reasoning with interleaved text/image format β’ 10 items β’ Updated May 23 β’ 7
Granite Code Models Collection A series of code models trained by IBM licensed under Apache 2.0 license. We release both the base pretrained and instruct models. β’ 20 items β’ Updated about 14 hours ago β’ 145
Phi-3 Collection Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. β’ 22 items β’ Updated 30 days ago β’ 346
Meta Llama 3 Collection This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases β’ 5 items β’ Updated Apr 18 β’ 612
Idefics2 πΆ Collection Idefics2-8B is a foundation vision-language model. In this collection, you will find the models, datasets and demo related to its creation. β’ 11 items β’ Updated May 6 β’ 85
view article Article Introducing Idefics2: A Powerful 8B Vision-Language Model for the community Apr 15 β’ 144
[lecture artifacts] aligning open language models Collection artifacts referenced in the talk timeline! Slides: https://docs.google.com/presentation/d/1quMyI4BAx4rvcDfk8jjv063bmHg4RxZd9mhQloXpMn0/edit?usp=sharin β’ 63 items β’ Updated Apr 17 β’ 51
Leaderboards and benchmarks β¨ Collection Cool leaderboard spaces collection for models across modalities! Text, vision, audio, ... β’ 64 items β’ Updated 18 days ago β’ 69
Aurora-M models Collection Aurora-M models (base, biden-harris redteams and instruct) β’ 5 items β’ Updated May 6 β’ 17
LLM Leaderboard best models β€οΈβπ₯ Collection A daily uploaded list of models with best evaluations on the LLM leaderboard: β’ 264 items β’ Updated 7 days ago β’ 335
Paloma Collection Dataset and baseline models for Paloma, a benchmark of language model fit to 585 textual domains β’ 8 items β’ Updated 19 days ago β’ 13
π Daily Picks in Interpretability & Analysis of LMs Collection Outstanding research in interpretability and evaluation of language models, summarized β’ 57 items β’ Updated 3 days ago β’ 60
Qwen1.5 Collection Qwen1.5 is the improved version of Qwen, the large language model series developed by Alibaba Cloud. β’ 55 items β’ Updated 23 days ago β’ 198
OpenChat Collection OpenChat: Advancing Open-source Language Models with Mixed-Quality Data β’ 7 items β’ Updated Jan 10 β’ 33
Tulu V2 Suite Collection The set of models associated with the paper "Camels in a Changing Climate: Enhancing LM Adaptation with Tulu 2" β’ 19 items β’ Updated 19 days ago β’ 43
Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research Paper β’ 2402.00159 β’ Published Jan 31 β’ 55
β StarCoder Collection All models, datasets, and demos related to StarCoder! β’ 11 items β’ Updated Feb 27 β’ 20