Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
microsoft
/
Phi-4-multimodal-instruct
like
328
Follow
Microsoft
9.37k
Automatic Speech Recognition
Transformers
Safetensors
multilingual
phi4mm
text-generation
nlp
code
audio
speech-summarization
speech-translation
visual-question-answering
phi-4-multimodal
phi
phi-4-mini
custom_code
arxiv:
2407.13833
License:
mit
Model card
Files
Files and versions
Community
6
Train
Use this model
Update README.md
#2
by
fasdfgaer
- opened
about 10 hours ago
base:
refs/heads/main
←
from:
refs/pr/2
Discussion
Files changed
+1
-1
fasdfgaer
about 10 hours ago
•
edited about 9 hours ago
Corrected the typo "Audio Uniderstanding" to "Audio Understanding".
See translation
Update README.md
faf353bc
Edit
Preview
Upload images, audio, and videos by dragging in the text input, pasting, or
clicking here
.
Tap or paste here to upload images
Ready to merge
This branch is ready to get merged automatically.
Comment
·
Sign up
or
log in
to comment