Autoregressive Video Generation without Vector Quantization Paper • 2412.14169 • Published 3 days ago • 12
Arbitrary-steps Image Super-resolution via Diffusion Inversion Paper • 2412.09013 • Published 10 days ago • 10
FreeScale: Unleashing the Resolution of Diffusion Models via Tuning-Free Scale Fusion Paper • 2412.09626 • Published 9 days ago • 19
SnapGen: Taming High-Resolution Text-to-Image Models for Mobile Devices with Efficient Architectures and Training Paper • 2412.09619 • Published 9 days ago • 20
UniReal: Universal Image Generation and Editing via Learning Real-world Dynamics Paper • 2412.07774 • Published 11 days ago • 24
DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation Paper • 2412.07589 • Published 12 days ago • 45
Florence-VL: Enhancing Vision-Language Models with Generative Vision Encoder and Depth-Breadth Fusion Paper • 2412.04424 • Published 16 days ago • 54
TryOffDiff: Virtual-Try-Off via High-Fidelity Garment Reconstruction using Diffusion Models Paper • 2411.18350 • Published 25 days ago • 22
CAT4D: Create Anything in 4D with Multi-View Video Diffusion Models Paper • 2411.18613 • Published 24 days ago • 50
Pathways on the Image Manifold: Image Editing via Video Generation Paper • 2411.16819 • Published 27 days ago • 30
VideoRepair: Improving Text-to-Video Generation via Misalignment Evaluation and Localized Refinement Paper • 2411.15115 • Published 30 days ago • 9
Stylecodes: Encoding Stylistic Information For Image Generation Paper • 2411.12811 • Published Nov 19 • 11
StableV2V: Stablizing Shape Consistency in Video-to-Video Editing Paper • 2411.11045 • Published Nov 17 • 11
MagicQuill: An Intelligent Interactive Image Editing System Paper • 2411.09703 • Published Nov 14 • 57
Adaptive Caching for Faster Video Generation with Diffusion Transformers Paper • 2411.02397 • Published Nov 4 • 23
DreamVideo-2: Zero-Shot Subject-Driven Video Customization with Precise Motion Control Paper • 2410.13830 • Published Oct 17 • 23
DART: Denoising Autoregressive Transformer for Scalable Text-to-Image Generation Paper • 2410.08159 • Published Oct 10 • 25