Learning Video Representations without Natural Videos Paper • 2410.24213 • Published 26 days ago • 14
GhostNetV3: Exploring the Training Strategies for Compact Models Paper • 2404.11202 • Published Apr 17
ADEM-VL: Adaptive and Embedded Fusion for Efficient Vision-Language Tuning Paper • 2410.17779 • Published Oct 23 • 7
ADEM-VL: Adaptive and Embedded Fusion for Efficient Vision-Language Tuning Paper • 2410.17779 • Published Oct 23 • 7
ADEM-VL: Adaptive and Embedded Fusion for Efficient Vision-Language Tuning Paper • 2410.17779 • Published Oct 23 • 7 • 2
SAM-DiffSR: Structure-Modulated Diffusion Model for Image Super-Resolution Paper • 2402.17133 • Published Feb 27 • 1
Data-efficient Large Vision Models through Sequential Autoregression Paper • 2402.04841 • Published Feb 7
One-for-All: Bridge the Gap Between Heterogeneous Architectures in Knowledge Distillation Paper • 2310.19444 • Published Oct 30, 2023