view article Article Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models 5 days ago • 106
Tensor Programs V: Tuning Large Neural Networks via Zero-Shot Hyperparameter Transfer Paper • 2203.03466 • Published Mar 7, 2022 • 1
An Image is Worth More Than 16x16 Patches: Exploring Transformers on Individual Pixels Paper • 2406.09415 • Published 16 days ago • 47
VALL-E 2: Neural Codec Language Models are Human Parity Zero-Shot Text to Speech Synthesizers Paper • 2406.05370 • Published 21 days ago • 12
Mixture-of-Agents Enhances Large Language Model Capabilities Paper • 2406.04692 • Published 22 days ago • 49
Guiding a Diffusion Model with a Bad Version of Itself Paper • 2406.02507 • Published 25 days ago • 14
Seed-TTS: A Family of High-Quality Versatile Speech Generation Models Paper • 2406.02430 • Published 25 days ago • 27
Scaling Rectified Flow Transformers for High-Resolution Image Synthesis Paper • 2403.03206 • Published Mar 5 • 47
Transformers are SSMs: Generalized Models and Efficient Algorithms Through Structured State Space Duality Paper • 2405.21060 • Published 29 days ago • 60
mistralai_hackathon Collection Synthetic datasets and fine-tuned Mistral models used in MistralAI Hackathon • 21 items • Updated about 18 hours ago • 4
FIFO-Diffusion: Generating Infinite Videos from Text without Training Paper • 2405.11473 • Published May 19 • 53
Uni-MoE: Scaling Unified Multimodal LLMs with Mixture of Experts Paper • 2405.11273 • Published May 18 • 17
RecurrentGemma: Moving Past Transformers for Efficient Open Language Models Paper • 2404.07839 • Published Apr 11 • 40
SpeechGPT: Empowering Large Language Models with Intrinsic Cross-Modal Conversational Abilities Paper • 2305.11000 • Published May 18, 2023 • 3
An Integration of Pre-Trained Speech and Language Models for End-to-End Speech Recognition Paper • 2312.03668 • Published Dec 6, 2023 • 1
StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video Generation Paper • 2405.01434 • Published May 2 • 49
— UI is a good thing 💅 — Collection cool spaces with a cool UI, what could be better? • 5 items • Updated 11 days ago • 12
ZeroGPU Spaces Collection ZeroGPU Spaces made by the community • 17 items • Updated 23 days ago • 205
Meta Llama 3 Collection This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases • 5 items • Updated Apr 18 • 612
Gemma release Collection Groups the Gemma models released by the Google team. • 40 items • Updated 2 days ago • 317
Lumiere: A Space-Time Diffusion Model for Video Generation Paper • 2401.12945 • Published Jan 23 • 84
Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data Paper • 2401.10891 • Published Jan 19 • 54
🛰️🌍 Geospatial Datasets Collection A curated collections of diverse geospatial and satellite imagery datasets. • 54 items • Updated Mar 6 • 11
Model Merging Collection Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! • 30 items • Updated 16 days ago • 189
Seamless Communication Collection A significant step towards removing language barriers through expressive, fast and high-quality AI translation. • 16 items • Updated Jan 16 • 129
QA-LoRA: Quantization-Aware Low-Rank Adaptation of Large Language Models Paper • 2309.14717 • Published Sep 26, 2023 • 43
3D Gaussian Splatting Collection Tools to create or visualize gaussian splatting scenes • 4 items • Updated Sep 28, 2023 • 3
🎧AI Podcasts and Talks! Collection 🤗Cool stuff to listen to at any time! • 10 items • Updated Oct 6, 2023 • 4
Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks Paper • 2311.06242 • Published Nov 10, 2023 • 65
Latent Consistency Models LoRAs Collection Latent Consistency Models for Stable Diffusion - LoRAs and full fine-tuned weights • 4 items • Updated Nov 10, 2023 • 96
zephyr story Collection sources mentioned by hf.co/thomwolf tweet: x.com/Thom_Wolf/status/1720503998518640703 • 8 items • Updated Jan 24 • 15
LLaVA-Interactive: An All-in-One Demo for Image Chat, Segmentation, Generation and Editing Paper • 2311.00571 • Published Nov 1, 2023 • 39
Eureka: Human-Level Reward Design via Coding Large Language Models Paper • 2310.12931 • Published Oct 19, 2023 • 26
LLM Leaderboard best models ❤️🔥 Collection A daily uploaded list of models with best evaluations on the LLM leaderboard: • 264 items • Updated 6 days ago • 335
💙 Favorites Spaces Collection My handpicked favourite Spaces of all time! • 10 items • Updated Feb 6 • 4