DI-PCG: Diffusion-based Efficient Inverse Procedural Content Generation for High-quality 3D Asset Creation Paper • 2412.15200 • Published 1 day ago • 8
Autoregressive Video Generation without Vector Quantization Paper • 2412.14169 • Published 2 days ago • 11
SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse Viewpoints Paper • 2412.07760 • Published 10 days ago • 49
MOVIS: Enhancing Multi-Object Novel View Synthesis for Indoor Scenes Paper • 2412.11457 • Published 5 days ago • 5
FreeScale: Unleashing the Resolution of Diffusion Models via Tuning-Free Scale Fusion Paper • 2412.09626 • Published 8 days ago • 19
FreeSplatter: Pose-free Gaussian Splatting for Sparse-view 3D Reconstruction Paper • 2412.09573 • Published 8 days ago • 7
POINTS1.5: Building a Vision-Language Model towards Real World Applications Paper • 2412.08443 • Published 10 days ago • 38
ACDiT: Interpolating Autoregressive Conditional Modeling and Diffusion Transformer Paper • 2412.07720 • Published 10 days ago • 30
MotionShop: Zero-Shot Motion Transfer in Video Diffusion Models with Mixture of Score Guidance Paper • 2412.05355 • Published 14 days ago • 7
Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling Paper • 2412.05271 • Published 14 days ago • 111
Improved Distribution Matching Distillation for Fast Image Synthesis Paper • 2405.14867 • Published May 23 • 12
LiFT: Leveraging Human Feedback for Text-to-Video Model Alignment Paper • 2412.04814 • Published 15 days ago • 44
AnyDressing: Customizable Multi-Garment Virtual Dressing via Latent Diffusion Models Paper • 2412.04146 • Published 16 days ago • 21
Mimir: Improving Video Diffusion Models for Precise Text Understanding Paper • 2412.03085 • Published 17 days ago • 12
SNOOPI: Supercharged One-step Diffusion Distillation with Proper Guidance Paper • 2412.02687 • Published 17 days ago • 109
VideoGen-of-Thought: A Collaborative Framework for Multi-Shot Video Generation Paper • 2412.02259 • Published 18 days ago • 59
Critical Tokens Matter: Token-Level Contrastive Estimation Enhence LLM's Reasoning Capability Paper • 2411.19943 • Published 21 days ago • 55
Spatiotemporal Skip Guidance for Enhanced Video Diffusion Sampling Paper • 2411.18664 • Published 24 days ago • 23