This is model is a finetune of the openai/whisper-small model using approximately 750 hours of general conversational audio from Part 3 of the National Speech Corpus converted to CTranslate2 format for faster inference. These are the final results on the evaluation set (~95 hours of audio):

Validation Loss: 0.386770
WER: 14.257934

Downloads last month: 221

Inference Providers NEW

This model is not currently available via any of the supported Inference Providers.

The model cannot be deployed to the HF Inference API: The model has no library tag.

Model tree for Xycone/faster-whisper-SGspeech-finetune

Base model

openai/whisper-small

Finetuned

(2294)

this model