Stable Flow: Vital Layers for Training-Free Image Editing Paper • 2411.14430 • Published about 24 hours ago • 7
Enhancing the Reasoning Ability of Multimodal Large Language Models via Mixed Preference Optimization Paper • 2411.10442 • Published 7 days ago • 40
LLaVA-o1: Let Vision Language Models Reason Step-by-Step Paper • 2411.10440 • Published 7 days ago • 91
LLaMA-Mesh: Unifying 3D Mesh Generation with Language Models Paper • 2411.09595 • Published 8 days ago • 65
DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion Paper • 2411.04928 • Published 15 days ago • 47
ReCapture: Generative Video Camera Controls for User-Provided Videos using Masked Video Fine-Tuning Paper • 2411.05003 • Published 15 days ago • 67
TIP-I2V: A Million-Scale Real Text and Image Prompt Dataset for Image-to-Video Generation Paper • 2411.04709 • Published 17 days ago • 25
DreamPolish: Domain Score Distillation With Progressive Geometry Generation Paper • 2411.01602 • Published 19 days ago • 9
Learning Video Representations without Natural Videos Paper • 2410.24213 • Published 22 days ago • 14
LLaMo: Large Language Model-based Molecular Graph Assistant Paper • 2411.00871 • Published 23 days ago • 21
MVPaint: Synchronized Multi-View Diffusion for Painting Anything 3D Paper • 2411.02336 • Published 18 days ago • 23
Training-free Regional Prompting for Diffusion Transformers Paper • 2411.02395 • Published 18 days ago • 23
TOMATO: Assessing Visual Temporal Reasoning Capabilities in Multimodal Foundation Models Paper • 2410.23266 • Published 23 days ago • 19
Adaptive Caching for Faster Video Generation with Diffusion Transformers Paper • 2411.02397 • Published 18 days ago • 20
How Far is Video Generation from World Model: A Physical Law Perspective Paper • 2411.02385 • Published 18 days ago • 32
DynaMath: A Dynamic Visual Benchmark for Evaluating Mathematical Reasoning Robustness of Vision Language Models Paper • 2411.00836 • Published 24 days ago • 15
Adapting While Learning: Grounding LLMs for Scientific Problems with Intelligent Tool Usage Adaptation Paper • 2411.00412 • Published 21 days ago • 9