Vivim: a Video Vision Mamba for Medical Video Object Segmentation Paper • 2401.14168 • Published Jan 25 • 2
Qwen2-VL Collection Vision-language model series based on Qwen2 • 16 items • Updated 17 days ago • 180
Scaling Rectified Flow Transformers for High-Resolution Image Synthesis Paper • 2403.03206 • Published Mar 5 • 60