Qwen2.5 Collection Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 45 items • Updated Nov 28, 2024 • 452
Apollo: An Exploration of Video Understanding in Large Multimodal Models Paper • 2412.10360 • Published 20 days ago • 135
Prompt Order Experiment Collection Prompt Order Experiment shows how to run a simple experiment on the hub and leverage tools like AutoTrain, and Inference Endpoints. • 16 items • Updated 21 days ago • 2
PaliGemma 2: A Family of Versatile VLMs for Transfer Paper • 2412.03555 • Published 29 days ago • 119
SmolLM2 Collection State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 15 items • Updated 11 days ago • 195
view article Article Releasing Outlines-core 0.1.0: structured generation in Rust and Python Oct 22, 2024 • 44
AutoTrain: No-code training for state-of-the-art models Paper • 2410.15735 • Published Oct 21, 2024 • 59
One Initialization to Rule them All: Fine-tuning via Explained Variance Adaptation Paper • 2410.07170 • Published Oct 9, 2024 • 15
Molmo Collection Artifacts for open multimodal language models. • 5 items • Updated Nov 27, 2024 • 289
Llama 3.2 Collection This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated 27 days ago • 551
ReLiK: Retrieve, Read and LinK Collection A blazing fast and lightweight Information Extraction model for Entity Linking and Relation Extraction. • 20 items • Updated 29 days ago • 23
jina-embeddings-v3: Multilingual Embeddings With Task LoRA Paper • 2409.10173 • Published Sep 16, 2024 • 28
Llama 3.1 GPTQ, AWQ, and BNB Quants Collection Optimised Quants for high-throughput deployments! Compatible with Transformers, TGI & VLLM 🤗 • 9 items • Updated Sep 26, 2024 • 56
NIM Serverless Inference API Collection Models in this collection are available for inference via a serverless API powered by NVIDIA NIM. • 8 items • Updated Oct 14, 2024 • 22
General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model Paper • 2409.01704 • Published Sep 3, 2024 • 83