DisPose: Disentangling Pose Guidance for Controllable Human Image Animation Paper • 2412.09349 • Published 10 days ago • 6
Identity-Preserving Text-to-Video Generation by Frequency Decomposition Paper • 2411.17440 • Published 26 days ago • 34
LOOK-M: Look-Once Optimization in KV Cache for Efficient Multimodal Long-Context Inference Paper • 2406.18139 • Published Jun 26 • 2
Distilling an End-to-End Voice Assistant Without Instruction Training Data Paper • 2410.02678 • Published Oct 3 • 22
MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators Paper • 2404.05014 • Published Apr 7 • 31
Continuous-Multiple Image Outpainting in One-Step via Positional Query and A Diffusion-based Approach Paper • 2401.15652 • Published Jan 28
ChronoMagic-Bench: A Benchmark for Metamorphic Evaluation of Text-to-Time-lapse Video Generation Paper • 2406.18522 • Published Jun 26 • 18
DevEval: A Manually-Annotated Code Generation Benchmark Aligned with Real-World Code Repositories Paper • 2405.19856 • Published May 30 • 8
DevEval: Evaluating Code Generation in Practical Software Projects Paper • 2401.06401 • Published Jan 12
EvoCodeBench: An Evolving Code Generation Benchmark Aligned with Real-World Code Repositories Paper • 2404.00599 • Published Mar 31 • 1
FaceChain-SuDe: Building Derived Class to Inherit Category Attributes for One-shot Subject-Driven Generation Paper • 2403.06775 • Published Mar 11 • 3