Automatic Speech Recognition
Transformers
Safetensors
whisper
Inference Endpoints
JackyHoCL's picture
Update README.md
e713994 verified
|
raw
history blame
513 Bytes
---
library_name: transformers
license: mit
datasets:
- AlienKevin/mixed_cantonese_and_english_speech
- mozilla-foundation/common_voice_17_0
metrics:
- cer
base_model:
- openai/whisper-large-v3-turbo
---
CER: 15.2% <br/>
transformers-4.46.3<br/>
Train Args:<br/>
per_device_train_batch_size=16,<br/>
gradient_accumulation_steps=1,<br/>
learning_rate=2e-5,<br/>
gradient_checkpointing=True,<br/>
per_device_eval_batch_size=16,<br/>
generation_max_length=225,<br/>
Hardware:<br/>
NVIDIA Tesla V100 16GB * 4<br/>