microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • Updated 1 day ago • 441k • 1.12k
Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 • 8 items • Updated 18 days ago • 396
openai/whisper-large-v3-turbo Automatic Speech Recognition • Updated Oct 4, 2024 • 7.84M • • 2.11k