Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
espnet
/
owsm_ctc_v3.2_ft_1B
like
1
Follow
ESPnet
188
Automatic Speech Recognition
ESPnet
owsm_v3.2_ctc
multilingual
audio
speech-translation
language-identification
arxiv:
2401.16658
arxiv:
2406.09282
License:
cc-by-4.0
Model card
Files
Files and versions
Community
Use this model
main
owsm_ctc_v3.2_ft_1B
/
exp
2 contributors
History:
1 commit
Yifan Peng
add files
b0add0d
3 months ago
s2t_train_s2t_multitask-ctc_ebf27_conv2d8_size1024_init3.1_raw_bpe50000
add files
3 months ago