PaliGemma 2: A Family of Versatile VLMs for Transfer Paper • 2412.03555 • Published 18 days ago • 118
VisionZip: Longer is Better but Not Necessary in Vision Language Models Paper • 2412.04467 • Published 17 days ago • 103
Teach Multimodal LLMs to Comprehend Electrocardiographic Images Paper • 2410.19008 • Published Oct 21 • 23
Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss Paper • 2410.17243 • Published Oct 22 • 89
RadRotator: 3D Rotation of Radiographs with Diffusion Models Paper • 2404.13000 • Published Apr 19 • 25
Understanding LLMs: A Comprehensive Overview from Training to Inference Paper • 2401.02038 • Published Jan 4 • 62
PICTURE: PhotorealistIC virtual Try-on from UnconstRained dEsigns Paper • 2312.04534 • Published Dec 7, 2023 • 6
LLM in a flash: Efficient Large Language Model Inference with Limited Memory Paper • 2312.11514 • Published Dec 12, 2023 • 257
timm Top-20 Fastest Models Collection Not the most accurate, but the highest throughput image classification models in timm • 20 items • Updated Jun 12 • 14
InstaFlow: One Step is Enough for High-Quality Diffusion-Based Text-to-Image Generation Paper • 2309.06380 • Published Sep 12, 2023 • 32