Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2408.04034

about 11 hours ago

LinFusion: 1 GPU, 1 Minute, 16K Image

Paper • 2409.02097 • Published Sep 3 • 31
Phidias: A Generative Model for Creating 3D Content from Text, Image, and 3D Conditions with Reference-Augmented Diffusion

Paper • 2409.11406 • Published Sep 17 • 25
Diffusion Models Are Real-Time Game Engines

Paper • 2408.14837 • Published Aug 27 • 121
Segment Anything with Multiple Modalities

Paper • 2408.09085 • Published Aug 17 • 21

Task-oriented Sequential Grounding in 3D Scenes

Paper • 2408.04034 • Published Aug 7 • 8

Next-Gen Robotics

Collection for myself to compile everything I thing is or will be related to Robotics

Achieving Human Level Competitive Robot Table Tennis

Paper • 2408.03906 • Published Aug 7 • 26
openbmb/MiniCPM-V-2_6

Image-Text-to-Text • Updated 5 days ago • 126k • 823
apple/OpenELM-270M-Instruct

Text Generation • Updated Jul 18 • 2.86k • 133
google/gemma-2-2b

Text Generation • Updated Aug 7 • 7.76M • 428

GECO: Generative Image-to-3D within a SECOnd

Paper • 2405.20327 • Published May 30 • 9
Ouroboros3D: Image-to-3D Generation via 3D-aware Recursive Diffusion

Paper • 2406.03184 • Published Jun 5 • 19
NPGA: Neural Parametric Gaussian Avatars

Paper • 2405.19331 • Published May 29 • 10
Unified Text-to-Image Generation and Retrieval

Paper • 2406.05814 • Published Jun 9 • 11

TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbones

Paper • 2312.16862 • Published Dec 28, 2023 • 30
Unified-IO 2: Scaling Autoregressive Multimodal Models with Vision, Language, Audio, and Action

Paper • 2312.17172 • Published Dec 28, 2023 • 26
Towards Truly Zero-shot Compositional Visual Reasoning with LLMs as Programmers

Paper • 2401.01974 • Published Jan 3 • 5
From Audio to Photoreal Embodiment: Synthesizing Humans in Conversations

Paper • 2401.01885 • Published Jan 3 • 27

Company

© Hugging Face

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs