Models
Datasets
Spaces
Posts
Docs
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2410.20011

about 2 hours ago

Self-Rewarding Language Models

Paper • 2401.10020 • Published Jan 18 • 143
Orion-14B: Open-source Multilingual Large Language Models

Paper • 2401.12246 • Published Jan 20 • 11
MambaByte: Token-free Selective State Space Model

Paper • 2401.13660 • Published Jan 24 • 50
MM-LLMs: Recent Advances in MultiModal Large Language Models

Paper • 2401.13601 • Published Jan 24 • 44

Let’s try this out

GRS-QA -- Graph Reasoning-Structured Question Answering Dataset

Paper • 2411.00369 • Published 5 days ago • 6
A Survey of Small Language Models

Paper • 2410.20011 • Published 11 days ago • 36

A Survey of Small Language Models

Paper • 2410.20011 • Published 11 days ago • 36

AgentStore: Scalable Integration of Heterogeneous Agents As Specialized Generalist Computer Assistant

Paper • 2410.18603 • Published 13 days ago • 29
A Survey of Small Language Models

Paper • 2410.20011 • Published 11 days ago • 36
Vision Search Assistant: Empower Vision-Language Models as Multimodal Search Engines

Paper • 2410.21220 • Published 9 days ago • 8

GPT-4o System Card

Paper • 2410.21276 • Published 11 days ago • 75
A Survey of Small Language Models

Paper • 2410.20011 • Published 11 days ago • 36

SLM - small language models

A Survey of Small Language Models

Paper • 2410.20011 • Published 11 days ago • 36
MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases

Paper • 2402.14905 • Published Feb 22 • 124
HuggingFaceTB/SmolLM2-1.7B-Instruct-GGUF

Text Generation • Updated about 15 hours ago • 2.64k • 22
OpenGVLab/Mini-InternVL-Chat-2B-V1-5

Image-Text-to-Text • Updated Sep 24 • 1.76k • 66

LLM Technical Reports

The Llama 3 Herd of Models

Paper • 2407.21783 • Published Jul 31 • 105
Qwen2-VL: Enhancing Vision-Language Model's Perception of the World at Any Resolution

Paper • 2409.12191 • Published Sep 18 • 73
Baichuan Alignment Technical Report

Paper • 2410.14940 • Published 18 days ago • 47
A Survey of Small Language Models

Paper • 2410.20011 • Published 11 days ago • 36

MotionLLM: Understanding Human Behaviors from Human Motions and Videos

Paper • 2405.20340 • Published May 30 • 19
Spectrally Pruned Gaussian Fields with Neural Compensation

Paper • 2405.00676 • Published May 1 • 8
Paint by Inpaint: Learning to Add Image Objects by Removing Them First

Paper • 2404.18212 • Published Apr 28 • 27
LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report

Paper • 2405.00732 • Published Apr 29 • 118

LoRA Learns Less and Forgets Less

Paper • 2405.09673 • Published May 15 • 87
LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report

Paper • 2405.00732 • Published Apr 29 • 118
PoSE: Efficient Context Window Extension of LLMs via Positional Skip-wise Training

Paper • 2309.10400 • Published Sep 19, 2023 • 25
Ferret-UI 2: Mastering Universal User Interface Understanding Across Platforms

Paper • 2410.18967 • Published 12 days ago

To read... eventually

MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training

Paper • 2403.09611 • Published Mar 14 • 124
Evolutionary Optimization of Model Merging Recipes

Paper • 2403.13187 • Published Mar 19 • 50
MobileVLM V2: Faster and Stronger Baseline for Vision Language Model

Paper • 2402.03766 • Published Feb 6 • 12
LLM Agent Operating System

Paper • 2403.16971 • Published Mar 25 • 65

Previous
1
2
Next

Company

© Hugging Face

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs