This is model is a finetune of the openai/whisper-small model using approximately 750 hours of general conversational audio from Part 3 of the National Speech Corpus converted to CTranslate2 format for faster inference. These are the final results on the evaluation set (~95 hours of audio):

  • Validation Loss: 0.386770
  • WER: 14.257934
Downloads last month
221
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API: The model has no library tag.

Model tree for Xycone/faster-whisper-SGspeech-finetune

Finetuned
(2294)
this model