techiaith
/

whisper-large-v3-ft-cv-cy

Automatic Speech Recognition

Generated from Trainer

Model card Files Files and versions Metrics Training metrics

whisper-large-v3-ft-cv-cy

This model is a version of openai/whisper-large-v3 fine-tuned with the train_all and other_with_excluded custom splits from techiaith/commonvoice_18_0_cy

It achieves the following results on the Common Voice for Welsh release 18's standard test set:

WER: 18.50
CER: 5.32

N.B. this model performs considerably worse on English language speech, but better on Welsh than a bilingual model

Usage

from transformers import pipeline

transcriber = pipeline("automatic-speech-recognition", model="techiaith/whisper-large-v3-ft-cv-cy")
result = transcriber(<path or url to soundfile>)
print (result)

{'text': 'Mae hen wlad fy nhadau yn annwyl i mi.'}

Downloads last month: 28

Safetensors

Model size

1.54B params

Tensor type

F32

·

Inference Examples

Automatic Speech Recognition

Unable to determine this model's library. Check the docs .

Model tree for techiaith/whisper-large-v3-ft-cv-cy

Base model

openai/whisper-large-v3

Finetuned

(349)

this model

Finetunes

1 model

Dataset used to train techiaith/whisper-large-v3-ft-cv-cy

Collection including techiaith/whisper-large-v3-ft-cv-cy

Speech Recognition Models

Models for Welsh language and bilingual speech recognition • 14 items • Updated Nov 8

Evaluation results

Wer on DewiBrynJones/commonvoice_18_0_cy default
self-reported

0.185

View on Papers With Code