Pippo: High-Resolution Multi-View Humans from a Single Image Paper • 2502.07785 • Published 6 days ago • 9
VidCRAFT3: Camera, Object, and Lighting Control for Image-to-Video Generation Paper • 2502.07531 • Published 6 days ago • 12
FlashVideo:Flowing Fidelity to Detail for Efficient High-Resolution Video Generation Paper • 2502.05179 • Published 10 days ago • 22
Hibiki fr-en Collection Hibiki is a model for streaming speech translation , which can run on device! See https://github.com/kyutai-labs/hibiki. • 5 items • Updated 11 days ago • 48
OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models Paper • 2502.01061 • Published 15 days ago • 177
VideoJAM: Joint Appearance-Motion Representations for Enhanced Motion Generation in Video Models Paper • 2502.02492 • Published 13 days ago • 55
MatAnyone: Stable Video Matting with Consistent Memory Propagation Paper • 2501.14677 • Published 24 days ago • 30
People who frequently use ChatGPT for writing tasks are accurate and robust detectors of AI-generated text Paper • 2501.15654 • Published 22 days ago • 11
Histoires Morales: A French Dataset for Assessing Moral Alignment Paper • 2501.17117 • Published 20 days ago • 3
FilmAgent: A Multi-Agent Framework for End-to-End Film Automation in Virtual 3D Spaces Paper • 2501.12909 • Published 26 days ago • 67
OmAgent: A Multi-modal Agent Framework for Complex Video Understanding with Task Divide-and-Conquer Paper • 2406.16620 • Published Jun 24, 2024 • 2
NeuralSVG: An Implicit Representation for Text-to-Vector Generation Paper • 2501.03992 • Published Jan 7 • 1
Video Depth Anything: Consistent Depth Estimation for Super-Long Videos Paper • 2501.12375 • Published 27 days ago • 22