microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • Updated about 14 hours ago • 441k • 1.12k
VideoLLaMA3 Collection Frontier Multimodal Foundation Models for Video Understanding • 14 items • Updated 2 days ago • 13