Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis Paper ā¢ 2412.15322 ā¢ Published 13 days ago ā¢ 16
google/siglip-so400m-patch14-224 Zero-Shot Image Classification ā¢ Updated Aug 23, 2024 ā¢ 28.9k ā¢ 51
Edify Image: High-Quality Image Generation with Pixel Space Laplacian Diffusion Models Paper ā¢ 2411.07126 ā¢ Published Nov 11, 2024 ā¢ 28
OpenCoder Collection OpenCoder is an open and reproducible code LLM family which matches the performance of top-tier code LLMs. ā¢ 8 items ā¢ Updated Nov 23, 2024 ā¢ 78
PPLLaVA: Varied Video Sequence Understanding With Prompt Guidance Paper ā¢ 2411.02327 ā¢ Published Nov 4, 2024 ā¢ 11