Transformers.js demos Collection A collection of my favorite WebML demos, built with Transformers.js! • 23 items • Updated May 8 • 53
4M Models Collection Multimodal models from https://4m.epfl.ch/ • 14 items • Updated 12 days ago • 28
Ouroboros3D: Image-to-3D Generation via 3D-aware Recursive Diffusion Paper • 2406.03184 • Published 21 days ago • 18
Learning Temporally Consistent Video Depth from Video Diffusion Priors Paper • 2406.01493 • Published 23 days ago • 17
CraftsMan: High-fidelity Mesh Generation with 3D Native Generation and Interactive Geometry Refiner Paper • 2405.14979 • Published May 23 • 14
Phi-3 Collection Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. • 22 items • Updated 27 days ago • 345
ZeroGPU Spaces Collection ZeroGPU Spaces made by the community • 17 items • Updated 20 days ago • 203
Idefics2 🐶 Collection Idefics2-8B is a foundation vision-language model. In this collection, you will find the models, datasets and demo related to its creation. • 11 items • Updated May 6 • 85
view article Article Introducing Idefics2: A Powerful 8B Vision-Language Model for the community Apr 15 • 144
view article Article StarCoder2-Instruct: Fully Transparent and Permissive Self-Alignment for Code Generation Apr 29 • 70
view article Article Powerful ASR + diarization + speculative decoding with Hugging Face Inference Endpoints May 1 • 58
Granite Code Models Collection A series of code models trained by IBM licensed under Apache 2.0 license. We release both the base pretrained and instruct models. • 18 items • Updated 27 days ago • 145
view article Article 🧑⚖️ "Replacing Judges with Juries" using distilabel By alvarobartt • May 3 • 15
Llama3-ChatQA-1.5 Collection Llama3-ChatQA-1.5 models excel at conversational question answering (QA) and retrieval-augmented generation (RAG). • 6 items • Updated 12 days ago • 37
view article Article The Open Medical-LLM Leaderboard: Benchmarking Large Language Models in Healthcare Apr 19 • 76
Leaderboards and benchmarks ✨ Collection Cool leaderboard spaces collection for models across modalities! Text, vision, audio, ... • 64 items • Updated 15 days ago • 69
DocLLM: A layout-aware generative language model for multimodal document understanding Paper • 2401.00908 • Published Dec 31, 2023 • 176
ControlRoom3D: Room Generation using Semantic Proxy Rooms Paper • 2312.05208 • Published Dec 8, 2023 • 8
Toward General-Purpose Robots via Foundation Models: A Survey and Meta-Analysis Paper • 2312.08782 • Published Dec 14, 2023 • 5
Seamless Communication Collection A significant step towards removing language barriers through expressive, fast and high-quality AI translation. • 16 items • Updated Jan 16 • 129
SpeechX: Neural Codec Language Model as a Versatile Speech Transformer Paper • 2308.06873 • Published Aug 14, 2023 • 24
STEVE-1: A Generative Model for Text-to-Behavior in Minecraft Paper • 2306.00937 • Published Jun 1, 2023 • 8