MotionClone: Training-Free Motion Cloning for Controllable Video Generation Paper • 2406.05338 • Published 23 days ago • 39
NaRCan: Natural Refined Canonical Image with Integration of Diffusion Prior for Video Editing Paper • 2406.06523 • Published 21 days ago • 48
ConvLLaVA: Hierarchical Backbones as Visual Encoder for Large Multimodal Models Paper • 2405.15738 • Published May 24 • 43
Meteor: Mamba-based Traversal of Rationale for Large Language and Vision Models Paper • 2405.15574 • Published May 24 • 52
Chameleon: Mixed-Modal Early-Fusion Foundation Models Paper • 2405.09818 • Published May 16 • 111
Gemma release Collection Groups the Gemma models released by the Google team. • 40 items • Updated 4 days ago • 317