EQ-VAE: Equivariance Regularized Latent Space for Improved Generative Image Modeling Paper • 2502.09509 • Published 24 days ago • 7
YOLOv12: Attention-Centric Real-Time Object Detectors Paper • 2502.12524 • Published 20 days ago • 10
SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features Paper • 2502.14786 • Published 17 days ago • 128