Qwen2.5-Math Collection Math-specific model series based on Qwen2.5 • 9 items • Updated 5 days ago • 33
Qwen2.5 Collection Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 45 items • Updated 9 days ago • 203
Qwen2.5-Coder Collection Code-specific model series based on Qwen2.5 • 14 items • Updated 3 days ago • 66
Llama 3.2 Collection This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 11 items • Updated 2 days ago • 259
Jina Reranker v2 Collection A collection of state-of-the-art multilingual neural rerankers • 1 item • Updated 11 days ago • 7
jina-embeddings-v3 Collection Multilingual multi-task general text embedding model • 6 items • Updated 9 days ago • 11
jina-embeddings-v2 Collection The V2 family of Jina Embeddings supports encoding large documents with 8k sequence length. • 8 items • Updated 11 days ago • 14
Qwen2-Audio Collection Audio-language model series based on Qwen2 • 4 items • Updated 10 days ago • 41
Qwen2-VL Collection Vision-language model series based on Qwen2 • 15 items • Updated 10 days ago • 125
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery Paper • 2408.06292 • Published Aug 12 • 114
Gemma release Collection Groups the Gemma models released by the Google team. • 40 items • Updated Jul 31 • 325
Llama 3.1 Collection This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated 2 days ago • 583
Zephyr 7B Collection Models, datasets, and demos associated with Zephyr 7B. For code to train the models, see: https://github.com/huggingface/alignment-handbook • 9 items • Updated Apr 12 • 144
Qwen1.5 Collection Qwen1.5 is the improved version of Qwen, the large language model series developed by Alibaba Cloud. • 55 items • Updated 10 days ago • 206
Qwen2 Collection Qwen2 language models, including pretrained and instruction-tuned models of 5 sizes, including 0.5B, 1.5B, 7B, 57B-A14B, and 72B. • 39 items • Updated 10 days ago • 339
Nemotron 4 340B Collection Nemotron-4: open models for Synthetic Data Generation (SDG). Includes Base, Instruct, and Reward models. • 4 items • Updated about 19 hours ago • 156
Phi-3 Collection Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. • 27 items • Updated 9 days ago • 467
Llama 2 Family Collection This collection hosts the transformers and original repos of the Llama 2 and Llama Guard releases • 13 items • Updated 2 days ago • 63
Meta Llama 3 Collection This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases • 5 items • Updated 2 days ago • 676