MUMU: Bootstrapping Multimodal Image Generation from Text-to-Image Data Paper • 2406.18790 • Published 2 days ago • 18
MotionBooth: Motion-Aware Customized Text-to-Video Generation Paper • 2406.17758 • Published 4 days ago • 15
The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale Paper • 2406.17557 • Published 4 days ago • 66
Jina Reranker v1 Collection Neural Reranker models for English language • 3 items • Updated 2 days ago • 1
LongVA Collection Long Context Transfer From Text To Vision: https://lmms-lab.github.io/posts/longva/ • 5 items • Updated 3 days ago • 9
view article Article Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models 5 days ago • 106
Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation Paper • 2311.17117 • Published Nov 28, 2023 • 5
Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks Paper • 2311.06242 • Published Nov 10, 2023 • 65
MatFuse: Controllable Material Generation with Diffusion Models Paper • 2308.11408 • Published Aug 22, 2023 • 3
VideoGPT+ Collection VideoGPT+: Integrating Image and Video Encoders for Enhanced Video Understanding • 10 items • Updated 18 days ago • 3
Mistral-C2F: Coarse to Fine Actor for Analytical and Reasoning Enhancement in RLHF and Effective-Merged LLMs Paper • 2406.08657 • Published 16 days ago • 9
Explore the Limits of Omni-modal Pretraining at Scale Paper • 2406.09412 • Published 16 days ago • 10
TC-Bench: Benchmarking Temporal Compositionality in Text-to-Video and Image-to-Video Generation Paper • 2406.08656 • Published 16 days ago • 7
Test of Time: A Benchmark for Evaluating LLMs on Temporal Reasoning Paper • 2406.09170 • Published 16 days ago • 22
CS-Bench: A Comprehensive Benchmark for Large Language Models towards Computer Science Mastery Paper • 2406.08587 • Published 17 days ago • 14
Visual Sketchpad: Sketching as a Visual Chain of Thought for Multimodal Language Models Paper • 2406.09403 • Published 16 days ago • 17
HelpSteer2: Open-source dataset for training top-performing reward models Paper • 2406.08673 • Published 16 days ago • 14
Interpreting the Weight Space of Customized Diffusion Models Paper • 2406.09413 • Published 16 days ago • 18
An Image is Worth More Than 16x16 Patches: Exploring Transformers on Individual Pixels Paper • 2406.09415 • Published 16 days ago • 47
Alleviating Distortion in Image Generation via Multi-Resolution Diffusion Models Paper • 2406.09416 • Published 16 days ago • 28
Perplexed by Perplexity: Perplexity-Based Data Pruning With Small Reference Models Paper • 2405.20541 • Published 29 days ago • 18
Terminus XL Collection v-prediction SDXL clone with zero-terminal SNR noise schedule • 8 items • Updated Apr 24 • 6
🎭 Avatars Collection The latest AI-powered technologies usher in a new era of realistic avatars! 🚀 • 44 items • Updated 1 day ago • 58
Repeated Random Sampling for Minimizing the Time-to-Accuracy of Learning Paper • 2305.18424 • Published May 28, 2023 • 1
Merge, Then Compress: Demystify Efficient SMoE with Hints from Its Routing Policy Paper • 2310.01334 • Published Oct 2, 2023 • 3
ECoFLaP: Efficient Coarse-to-Fine Layer-Wise Pruning for Vision-Language Models Paper • 2310.02998 • Published Oct 4, 2023 • 1
Unraveling the Key Components of OOD Generalization via Diversification Paper • 2312.16313 • Published Dec 26, 2023 • 1
Overcoming the Pitfalls of Vision-Language Model Finetuning for OOD Generalization Paper • 2401.15914 • Published Jan 29 • 7
Ferret: Refer and Ground Anything Anywhere at Any Granularity Paper • 2310.07704 • Published Oct 11, 2023 • 11
ConjNorm: Tractable Density Estimation for Out-of-Distribution Detection Paper • 2402.17888 • Published Feb 27 • 1
Self-Supervised High Dynamic Range Imaging with Multi-Exposure Images in Dynamic Scenes Paper • 2310.01840 • Published Oct 3, 2023 • 1
Protein Design & Protein Structure Prediction Collection Interactive Demos that can be used for protein structure prediction using AlphaFold2 or RoseTTAfold2, prediction of small metal ions • 7 items • Updated Sep 18, 2023 • 4
Spaces of the Week Collection My spaces or spaces I worked featured on Spaces of the Week! Ones at the top are the oldest, newest at the bottom 🤗 • 6 items • Updated Apr 29 • 2
🚂 SD-XL Training Suite Collection All the steps to train your own SD-XL custom model • 7 items • Updated 18 days ago • 14
Edit Your Image! Collection Find all the trending and useful Gradio demos that you can use to edit your images. • 21 items • Updated Apr 26 • 23
Experimental Projects Collection Spaces that are too early or cutting edge for mainstream usage 🙂 • 4 items • Updated Nov 16, 2023 • 5