ModernBERT Collection Bringing BERT into modernity via both architecture changes and scaling β’ 3 items β’ Updated 1 day ago β’ 67
Bamba Collection Collection of Bamba - hybrid Mamba2 model architecture based models trained on open data β’ 8 items β’ Updated 3 days ago β’ 14
Falcon3 Collection Falcon3 family of Open Foundation Models is a set of pretrained and instruct LLMs ranging from 1B to 10B parameters. β’ 40 items β’ Updated 1 day ago β’ 68
PaliGemma 2 Release Collection Vision-Language Models available in multiple 3B, 10B and 28B variants. β’ 23 items β’ Updated 7 days ago β’ 118
π± Sailor2 Language Models Collection Sailing in South-East Asia with Inclusive Multilingual LLMs β’ 9 items β’ Updated 17 days ago β’ 21
Hymba Collection A series of Hybrid Small Language Models. β’ 2 items β’ Updated 29 days ago β’ 24
Quantization Spaces on the Hub β‘ Collection A collection of spaces that allow you to quantize on the Hub β’ 4 items β’ Updated Nov 4 β’ 5
Tulu 3 Models Collection All models released with Tulu 3 -- state of the art open post-training recipes. β’ 7 items β’ Updated 23 days ago β’ 29
Thinking LLMs: General Instruction Following with Thought Generation Paper β’ 2410.10630 β’ Published Oct 14 β’ 17
Qwen 2.5 Coder Collection Complete collection of Code-specific model series for Qwen2.5 in bnb 4bit, 16bit and GGUF formats. β’ 35 items β’ Updated 14 days ago β’ 20
Qwen2.5-Coder Collection Code-specific model series based on Qwen2.5 β’ 40 items β’ Updated 23 days ago β’ 255
AMD-OLMo Collection AMD-OLMo are a series of 1 billion parameter language models trained by AMD on AMD Instinctβ’ MI250 GPUs based on OLMo. β’ 4 items β’ Updated Oct 31 β’ 17