view article Article OCR Processing and Text in Image Analysis with Florence-2-base and Qwen2-VL-2B By PandorAI1995 • Oct 18, 2024 • 17
view article Article PaliGemma 2 Mix - New Instruction Vision Language Models by Google 23 days ago • 65
Qwen2.5-VL (All Versions) Collection All versions of Qwen2.5-VL including 4-bit, 16-bit and more! • 9 items • Updated 2 days ago • 8
💻 Local SmolLMs Collection SmolLM models in MLC, ONNX and GGUF format for local applications + in-browser demos • 14 items • Updated 22 days ago • 50
HiFi-SR: A Unified Generative Transformer-Convolutional Adversarial Network for High-Fidelity Speech Super-Resolution Paper • 2501.10045 • Published Jan 17 • 9
B-STaR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners Paper • 2412.17256 • Published Dec 23, 2024 • 46